{"title":"Analysis of machine learning methods to determine the best data analysis method for diabetes prediction","authors":"Nondumiso Sihlangu, R. Millham","doi":"10.1109/ICTAS56421.2023.10082727","DOIUrl":null,"url":null,"abstract":"Chronic diabetes results from the body's inability to produce adequate insulin. It is an incurable but treatable disease. The experiment conducted in this study aims to analyze different machine learning methods like Stochastic Gradient Descent, Support Vector Machine, Logistic Regression, and CN2 Rule using the Orange data mining software and use them for diabetes prediction based on the PIMA Indian Diabetes Dataset. Utilizing different performance criteria like Accuracy, Precision, Recall, and F1-Score, these approaches were examined and evaluated. The best outcome was obtained by CN2 Rule Induction, achieving an accuracy score of 80.7% which shows that this method is the most suitable for diabetes prediction compared to the other three models.","PeriodicalId":158720,"journal":{"name":"2023 Conference on Information Communications Technology and Society (ICTAS)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 Conference on Information Communications Technology and Society (ICTAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTAS56421.2023.10082727","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Chronic diabetes results from the body's inability to produce adequate insulin. It is an incurable but treatable disease. The experiment conducted in this study aims to analyze different machine learning methods like Stochastic Gradient Descent, Support Vector Machine, Logistic Regression, and CN2 Rule using the Orange data mining software and use them for diabetes prediction based on the PIMA Indian Diabetes Dataset. Utilizing different performance criteria like Accuracy, Precision, Recall, and F1-Score, these approaches were examined and evaluated. The best outcome was obtained by CN2 Rule Induction, achieving an accuracy score of 80.7% which shows that this method is the most suitable for diabetes prediction compared to the other three models.