{"title":"基于特征选择的机器学习算法预测2型糖尿病","authors":"A. Yılmaz","doi":"10.5114/hpc.2022.114541","DOIUrl":null,"url":null,"abstract":"The aim of this study is to develop and evaluate a machine learning model for the early diagnosis of type 2 diabetes to allow for treatments to be applied in the early stages of the disease. Material and methods. A proposed hybrid machine learning model was developed and applied to the Early-stage diabetes risk prediction dataset from the UCI database. The prediction success of the proposed model was compared with other machine learning models. Pearson’s correlation and SelectKBest feature selection methods were employed to examine the relationships between the dataset input parameters and the results. Results. Of the 520 patients included in the dataset, 320 were diagnosed with diabetes and 328 (63.08%) were males. The most commonly observed diabetes diagnosis criterion was obesity (n=482, 83.08%). While the strongest feature detected with Pearson’s correlation was polyuria, the strongest feature detected with SelectKBest was polydipsia. With Pearson’s feature extraction, the most successful machine learning method was the proposed hybrid method, with an accuracy of 97.28%. Using SelectKBest feature selection, the same model was able to predict type 2 diabetes with accuracy of 95.16%. Conclusions. Early detection of type 2 diabetes will allow for a prompter and more effective treatment of the patient. Thus, use of the proposed model may help to improve the quality of patient care and lower the number of deaths caused by this disease.","PeriodicalId":0,"journal":{"name":"","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"PREDICTION OF TYPE 2 DIABETES MELLITUS USING FEATURE SELECTION-BASED MACHINE LEARNING ALGORITHMS\",\"authors\":\"A. Yılmaz\",\"doi\":\"10.5114/hpc.2022.114541\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The aim of this study is to develop and evaluate a machine learning model for the early diagnosis of type 2 diabetes to allow for treatments to be applied in the early stages of the disease. Material and methods. A proposed hybrid machine learning model was developed and applied to the Early-stage diabetes risk prediction dataset from the UCI database. The prediction success of the proposed model was compared with other machine learning models. Pearson’s correlation and SelectKBest feature selection methods were employed to examine the relationships between the dataset input parameters and the results. Results. Of the 520 patients included in the dataset, 320 were diagnosed with diabetes and 328 (63.08%) were males. The most commonly observed diabetes diagnosis criterion was obesity (n=482, 83.08%). While the strongest feature detected with Pearson’s correlation was polyuria, the strongest feature detected with SelectKBest was polydipsia. With Pearson’s feature extraction, the most successful machine learning method was the proposed hybrid method, with an accuracy of 97.28%. Using SelectKBest feature selection, the same model was able to predict type 2 diabetes with accuracy of 95.16%. Conclusions. Early detection of type 2 diabetes will allow for a prompter and more effective treatment of the patient. Thus, use of the proposed model may help to improve the quality of patient care and lower the number of deaths caused by this disease.\",\"PeriodicalId\":0,\"journal\":{\"name\":\"\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0,\"publicationDate\":\"2022-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5114/hpc.2022.114541\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5114/hpc.2022.114541","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
PREDICTION OF TYPE 2 DIABETES MELLITUS USING FEATURE SELECTION-BASED MACHINE LEARNING ALGORITHMS
The aim of this study is to develop and evaluate a machine learning model for the early diagnosis of type 2 diabetes to allow for treatments to be applied in the early stages of the disease. Material and methods. A proposed hybrid machine learning model was developed and applied to the Early-stage diabetes risk prediction dataset from the UCI database. The prediction success of the proposed model was compared with other machine learning models. Pearson’s correlation and SelectKBest feature selection methods were employed to examine the relationships between the dataset input parameters and the results. Results. Of the 520 patients included in the dataset, 320 were diagnosed with diabetes and 328 (63.08%) were males. The most commonly observed diabetes diagnosis criterion was obesity (n=482, 83.08%). While the strongest feature detected with Pearson’s correlation was polyuria, the strongest feature detected with SelectKBest was polydipsia. With Pearson’s feature extraction, the most successful machine learning method was the proposed hybrid method, with an accuracy of 97.28%. Using SelectKBest feature selection, the same model was able to predict type 2 diabetes with accuracy of 95.16%. Conclusions. Early detection of type 2 diabetes will allow for a prompter and more effective treatment of the patient. Thus, use of the proposed model may help to improve the quality of patient care and lower the number of deaths caused by this disease.