Md. Rifatul Islam , Semonti Banik , Kazi Naimur Rahman , Mohammad Mizanur Rahman
{"title":"A comparative approach to alleviating the prevalence of diabetes mellitus using machine learning","authors":"Md. Rifatul Islam , Semonti Banik , Kazi Naimur Rahman , Mohammad Mizanur Rahman","doi":"10.1016/j.cmpbup.2023.100113","DOIUrl":null,"url":null,"abstract":"<div><p>Diabetes mellitus, a metabolic disease with elevated blood sugar levels, is a significant global public health concern. Identification of diabetes at its very early stage can reduce the prevalence of cases. This work focuses on developing a machine learning-based system that will have a significant impact on diabetic patient identification. To develop such a system we utilized a dataset made up by acquiring direct questionnaires from Sylhet Diabetic Hospital patients. The dataset contains information about the signs and symptoms of patients who are new or likely to have diabetes. We applied 14 different machine-learning techniques where the Gradient Boosting Machine (GBM) outperformed other algorithms with the highest F1 and ROC scores of 99.37%, and 99.92% respectively. We also employed various ensemble-based approaches that show competitive performance to the individual model’s performance.</p></div>","PeriodicalId":72670,"journal":{"name":"Computer methods and programs in biomedicine update","volume":"4 ","pages":"Article 100113"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine update","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666990023000228","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Diabetes mellitus, a metabolic disease with elevated blood sugar levels, is a significant global public health concern. Identification of diabetes at its very early stage can reduce the prevalence of cases. This work focuses on developing a machine learning-based system that will have a significant impact on diabetic patient identification. To develop such a system we utilized a dataset made up by acquiring direct questionnaires from Sylhet Diabetic Hospital patients. The dataset contains information about the signs and symptoms of patients who are new or likely to have diabetes. We applied 14 different machine-learning techniques where the Gradient Boosting Machine (GBM) outperformed other algorithms with the highest F1 and ROC scores of 99.37%, and 99.92% respectively. We also employed various ensemble-based approaches that show competitive performance to the individual model’s performance.