{"title":"使用机器学习分类器和数据科学的健康记录预测糖尿病视网膜病变","authors":"B. Sumathy","doi":"10.4018/ijrqeh.299959","DOIUrl":null,"url":null,"abstract":"Diabetes is a rapidly spreading disease. When the pancreas produces insufficient insulin or the body cannot utilise it effectively. Diabetic Retinopathy (DR) and blindness are two major issues for diabetics. Diabetes patients increase the amount of data collected about DR. To extract important information and undiscovered knowledge from data, data mining techniques are required. DM is necessary in DR to improve society's health. Our study focuses on the early detection of Diabetic Retinopathy using patient information. DM approaches are used to extract information from these numeric records. The dataset was used to forecast DR using logistic regression, KNN, SVM, bagged tree, and boosted tree classifiers. Two cross-validations are used to find the best features and avoid overfitting. Our dataset includes 900 diabetes patients. The boosted tree produced the best classification accuracy (90.1%) with 10% hold-out validation. KNN also achieved 88.9% accuracy, which is impressive. As a result, our research suggests that bagged trees and KNN are good classifiers for DR.","PeriodicalId":36298,"journal":{"name":"International Journal of Reliable and Quality E-Healthcare","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Prediction of Diabetic Retinopathy Using Health Records with Machine Learning Classifiers and Data Science\",\"authors\":\"B. Sumathy\",\"doi\":\"10.4018/ijrqeh.299959\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Diabetes is a rapidly spreading disease. When the pancreas produces insufficient insulin or the body cannot utilise it effectively. Diabetic Retinopathy (DR) and blindness are two major issues for diabetics. Diabetes patients increase the amount of data collected about DR. To extract important information and undiscovered knowledge from data, data mining techniques are required. DM is necessary in DR to improve society's health. Our study focuses on the early detection of Diabetic Retinopathy using patient information. DM approaches are used to extract information from these numeric records. The dataset was used to forecast DR using logistic regression, KNN, SVM, bagged tree, and boosted tree classifiers. Two cross-validations are used to find the best features and avoid overfitting. Our dataset includes 900 diabetes patients. The boosted tree produced the best classification accuracy (90.1%) with 10% hold-out validation. KNN also achieved 88.9% accuracy, which is impressive. As a result, our research suggests that bagged trees and KNN are good classifiers for DR.\",\"PeriodicalId\":36298,\"journal\":{\"name\":\"International Journal of Reliable and Quality E-Healthcare\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Reliable and Quality E-Healthcare\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4018/ijrqeh.299959\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Nursing\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Reliable and Quality E-Healthcare","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4018/ijrqeh.299959","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Nursing","Score":null,"Total":0}
Prediction of Diabetic Retinopathy Using Health Records with Machine Learning Classifiers and Data Science
Diabetes is a rapidly spreading disease. When the pancreas produces insufficient insulin or the body cannot utilise it effectively. Diabetic Retinopathy (DR) and blindness are two major issues for diabetics. Diabetes patients increase the amount of data collected about DR. To extract important information and undiscovered knowledge from data, data mining techniques are required. DM is necessary in DR to improve society's health. Our study focuses on the early detection of Diabetic Retinopathy using patient information. DM approaches are used to extract information from these numeric records. The dataset was used to forecast DR using logistic regression, KNN, SVM, bagged tree, and boosted tree classifiers. Two cross-validations are used to find the best features and avoid overfitting. Our dataset includes 900 diabetes patients. The boosted tree produced the best classification accuracy (90.1%) with 10% hold-out validation. KNN also achieved 88.9% accuracy, which is impressive. As a result, our research suggests that bagged trees and KNN are good classifiers for DR.