Mohammad Maydanchi, Mehrbod Ziaei, Mehrdad Mohammadi, Armin Ziaei, Mina Basiry, Fatemeh Haji, Kazhal Gharibi
{"title":"A Comparative Analysis of the Machine Learning Methods for Predicting Diabetes","authors":"Mohammad Maydanchi, Mehrbod Ziaei, Mehrdad Mohammadi, Armin Ziaei, Mina Basiry, Fatemeh Haji, Kazhal Gharibi","doi":"10.31181/jopi21202421","DOIUrl":null,"url":null,"abstract":"Diabetes can lead to various health problems and complications, such as cardiovascular disease, kidney damage (nephropathy), eye issues, neuropathy, and foot ailments. Therefore, early diagnosis of diabetes can be immensely beneficial in preventing the development of these conditions. Utilizing machine learning is one method to detect diabetes in individuals at an early stage. In this study, we compare the performance of nine machine-learning classification models in predicting diabetes. These models include XGBoost, gradient boosting, AdaBoost, logistic regression, decision tree, KNN, perceptron, random forest, and naïve bayes. We utilize several evaluation metrics, focusing on the f1-score, area under the curve (AUC), and computational runtime. Our comparison reveals that complex tree-based models exhibit the highest f1-score and AUC, albeit with longer execution times.","PeriodicalId":515345,"journal":{"name":"Journal of Operations Intelligence","volume":"105 11","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Operations Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31181/jopi21202421","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Diabetes can lead to various health problems and complications, such as cardiovascular disease, kidney damage (nephropathy), eye issues, neuropathy, and foot ailments. Therefore, early diagnosis of diabetes can be immensely beneficial in preventing the development of these conditions. Utilizing machine learning is one method to detect diabetes in individuals at an early stage. In this study, we compare the performance of nine machine-learning classification models in predicting diabetes. These models include XGBoost, gradient boosting, AdaBoost, logistic regression, decision tree, KNN, perceptron, random forest, and naïve bayes. We utilize several evaluation metrics, focusing on the f1-score, area under the curve (AUC), and computational runtime. Our comparison reveals that complex tree-based models exhibit the highest f1-score and AUC, albeit with longer execution times.
糖尿病可导致各种健康问题和并发症,如心血管疾病、肾损伤(肾病)、眼部问题、神经病变和足部疾病。因此,早期诊断糖尿病对预防这些病症的发生大有裨益。利用机器学习是早期检测糖尿病的一种方法。在本研究中,我们比较了九种机器学习分类模型在预测糖尿病方面的性能。这些模型包括 XGBoost、梯度提升、AdaBoost、逻辑回归、决策树、KNN、感知器、随机森林和天真贝叶斯。我们采用了多个评估指标,重点关注 f1 分数、曲线下面积(AUC)和计算运行时间。比较结果表明,基于复杂树的模型表现出最高的 f1 分数和 AUC,尽管执行时间较长。