Analysis and Prediction of Diabetes Disease Using Machine Learning Methods

IF 0.6 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS International Journal of Decision Support System Technology Pub Date : 2022-01-01 DOI:10.4018/ijdsst.303943
Sarra Samet, Mohamed Ridda Laouar, Issam Bendib, Sean B. Eom
{"title":"Analysis and Prediction of Diabetes Disease Using Machine Learning Methods","authors":"Sarra Samet, Mohamed Ridda Laouar, Issam Bendib, Sean B. Eom","doi":"10.4018/ijdsst.303943","DOIUrl":null,"url":null,"abstract":"To increase healthcare quality, early illness prediction helps patients prevent potentially life-threatening health issues before it is too late. Artificial intelligence is a rapidly evolving area, and its applications to diabetes, a worldwide epidemic, have the potential to revolutionize the way diabetes is diagnosed and managed. A total of six supervised machine learning algorithms based on patient data were used and compared to predict the diagnosis of diabetes mellitus. For experiments, the Pima Indians Diabetes Database was used, and their missing values were carefully handled by different techniques. For random train-test splits, the Random Forest classification algorithm achieved an accuracy rate of 92 percent. This model outperforms other state-of-the-art approaches due to the application of a combination of techniques for dealing with missing values (the mixture of imputing missing values techniques) that is proposed. With this approach, the models of this manuscript achieved better accuracy than prior work done with the Pima diabetes data.","PeriodicalId":42414,"journal":{"name":"International Journal of Decision Support System Technology","volume":"60 1","pages":"1-19"},"PeriodicalIF":0.6000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Decision Support System Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4018/ijdsst.303943","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 1

Abstract

To increase healthcare quality, early illness prediction helps patients prevent potentially life-threatening health issues before it is too late. Artificial intelligence is a rapidly evolving area, and its applications to diabetes, a worldwide epidemic, have the potential to revolutionize the way diabetes is diagnosed and managed. A total of six supervised machine learning algorithms based on patient data were used and compared to predict the diagnosis of diabetes mellitus. For experiments, the Pima Indians Diabetes Database was used, and their missing values were carefully handled by different techniques. For random train-test splits, the Random Forest classification algorithm achieved an accuracy rate of 92 percent. This model outperforms other state-of-the-art approaches due to the application of a combination of techniques for dealing with missing values (the mixture of imputing missing values techniques) that is proposed. With this approach, the models of this manuscript achieved better accuracy than prior work done with the Pima diabetes data.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用机器学习方法分析和预测糖尿病疾病
为了提高医疗保健质量,早期疾病预测可以帮助患者在为时已晚之前预防可能危及生命的健康问题。人工智能是一个快速发展的领域,它在全球流行病糖尿病中的应用有可能彻底改变糖尿病的诊断和管理方式。总共使用了六种基于患者数据的监督机器学习算法,并对其进行了比较,以预测糖尿病的诊断。在实验中,使用了皮马印第安人糖尿病数据库,并通过不同的技术仔细处理了它们的缺失值。对于随机训练-测试分割,随机森林分类算法的准确率达到92%。该模型优于其他最先进的方法,因为应用了处理缺失值的技术组合(输入缺失值技术的混合)。通过这种方法,该手稿的模型比以前用皮马糖尿病数据完成的工作取得了更好的准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
International Journal of Decision Support System Technology
International Journal of Decision Support System Technology COMPUTER SCIENCE, INFORMATION SYSTEMS-
CiteScore
2.20
自引率
18.20%
发文量
40
期刊最新文献
A Novel Query Method for Spatial Database Based on Improved K-Nearest Neighbor Algorithm Analysis and Evaluation of Roadblocks Hindering Lean-Green and Industry 4.0 Practices in Indian Manufacturing Industries Developing Fuzzy-AHP-Integrated Hybrid MCDM System of COPRAS-ARAS for Solving an Industrial Robot Selection Problem Generalized Parametric Intuitionistic Fuzzy Measures Based on Trigonometric Functions for Improved Decision-Making Problem An Efficient Method to Decide the Malicious Traffic
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1