{"title":"A hybrid prediction model for type 2 diabetes using K-means and decision tree","authors":"Wenqian Chen, Shuyu Chen, Hancui Zhang, Tianshu Wu","doi":"10.1109/ICSESS.2017.8342938","DOIUrl":null,"url":null,"abstract":"Type 2 diabetes has a quite high incidence all over the world. For the prevention and treatment of Type 2 diabetes, early detection is demanded. Nowadays, data mining techniques are gaining increasing importance in medical diagnosis field by their classification capability. In this paper, a hybrid prediction model is proposed to help the diagnosis of Type 2 diabetes. In the proposed model, K-means is used for data reduction with J48 decision tree as a classifier for classification. In order to get the experimental result, we used the Pima Indians Diabetes Dataset from UCI Machine Learning Repository. The result shows that the proposed model has reached better accuracy compared to other previous studies that mentioned in the literature. On the basis of the result, it can be proven that the proposed model would be helpful in Type 2 diabetes diagnosis.","PeriodicalId":179815,"journal":{"name":"2017 8th IEEE International Conference on Software Engineering and Service Science (ICSESS)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"70","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 8th IEEE International Conference on Software Engineering and Service Science (ICSESS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSESS.2017.8342938","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 70
Abstract
Type 2 diabetes has a quite high incidence all over the world. For the prevention and treatment of Type 2 diabetes, early detection is demanded. Nowadays, data mining techniques are gaining increasing importance in medical diagnosis field by their classification capability. In this paper, a hybrid prediction model is proposed to help the diagnosis of Type 2 diabetes. In the proposed model, K-means is used for data reduction with J48 decision tree as a classifier for classification. In order to get the experimental result, we used the Pima Indians Diabetes Dataset from UCI Machine Learning Repository. The result shows that the proposed model has reached better accuracy compared to other previous studies that mentioned in the literature. On the basis of the result, it can be proven that the proposed model would be helpful in Type 2 diabetes diagnosis.