妊娠期糖尿病预测的超参数调谐集成方法

S. Prasanth, Kuhaneswaran Banujan, Kumara Btgs
{"title":"妊娠期糖尿病预测的超参数调谐集成方法","authors":"S. Prasanth, Kuhaneswaran Banujan, Kumara Btgs","doi":"10.1109/3ICT53449.2021.9581926","DOIUrl":null,"url":null,"abstract":"Diabetes Mellitus is commonly found in human beings around the world and this is one of the serious diseases which causes boundless suffering among patients. There are numerous reasons for the prevalence of this disease. It would be better to consider the predictions carried out earlier in this respect, since diabetes is a non - communicable disease and makes a great impact on the health condition of people nowadays. This is the reason why the existing medicinal practices in most hospitals are collecting patients' life history or the record of the disease. This is done for diagnosing diabetes using various medical tests followed by proper treatment for the disease. Machine learning provides an immense contribution to the sector of healthcare. For this research, Pima Indians Diabetes Dataset, obtained from the University of California, Irvine (UCI) machine learning source that included 768 patients' details along with nine attributes had been chosen for a comprehensive investigation of this grave and widespread problem in the health sector. Eventually, an adequate perfect outcome could be achieved and some effective and transparent conclusions were made. Among 768 diabetics, 500 were recognized as positive for the disease while 268 were recognized as negative. Besides, the recorded facts were put into particular supervised machine learning techniques such as Support Vector Machine (SVM), Naïve Bayes (NB), Decision Tree (DT), Artificial Neural Networks (ANN), Linear Discriminant Analysis (LDA), Logistic Regression (LR) and k-nearest neighbors (k-NN). Along with this, bagging and boosting techniques like Random Forest (RF), Extreme Gradient Boosting (XGBoost), LightGBM, and CatBoost too were taken into consideration. In addition, by considering classifiers with the highest accuracies, the final ensemble model was developed with the adaption of SVM, CatBoost and RF to predict the diabetes mellitus. Thus, the model resulted in an accuracy of 86.15%.","PeriodicalId":133021,"journal":{"name":"2021 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies (3ICT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Hyper Parameter Tuned Ensemble Approach for Gestational Diabetes Prediction\",\"authors\":\"S. Prasanth, Kuhaneswaran Banujan, Kumara Btgs\",\"doi\":\"10.1109/3ICT53449.2021.9581926\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Diabetes Mellitus is commonly found in human beings around the world and this is one of the serious diseases which causes boundless suffering among patients. There are numerous reasons for the prevalence of this disease. It would be better to consider the predictions carried out earlier in this respect, since diabetes is a non - communicable disease and makes a great impact on the health condition of people nowadays. This is the reason why the existing medicinal practices in most hospitals are collecting patients' life history or the record of the disease. This is done for diagnosing diabetes using various medical tests followed by proper treatment for the disease. Machine learning provides an immense contribution to the sector of healthcare. For this research, Pima Indians Diabetes Dataset, obtained from the University of California, Irvine (UCI) machine learning source that included 768 patients' details along with nine attributes had been chosen for a comprehensive investigation of this grave and widespread problem in the health sector. Eventually, an adequate perfect outcome could be achieved and some effective and transparent conclusions were made. Among 768 diabetics, 500 were recognized as positive for the disease while 268 were recognized as negative. Besides, the recorded facts were put into particular supervised machine learning techniques such as Support Vector Machine (SVM), Naïve Bayes (NB), Decision Tree (DT), Artificial Neural Networks (ANN), Linear Discriminant Analysis (LDA), Logistic Regression (LR) and k-nearest neighbors (k-NN). Along with this, bagging and boosting techniques like Random Forest (RF), Extreme Gradient Boosting (XGBoost), LightGBM, and CatBoost too were taken into consideration. In addition, by considering classifiers with the highest accuracies, the final ensemble model was developed with the adaption of SVM, CatBoost and RF to predict the diabetes mellitus. Thus, the model resulted in an accuracy of 86.15%.\",\"PeriodicalId\":133021,\"journal\":{\"name\":\"2021 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies (3ICT)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies (3ICT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/3ICT53449.2021.9581926\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies (3ICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/3ICT53449.2021.9581926","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

糖尿病是世界范围内人类普遍存在的一种严重疾病,给患者带来无尽的痛苦。这种疾病的流行有很多原因。由于糖尿病是一种非传染性疾病,对当今人们的健康状况有很大的影响,因此在这方面最好考虑先前进行的预测。这就是为什么大多数医院现有的医疗实践都在收集患者的生活史或疾病记录。这是为了通过各种医学检查诊断糖尿病,然后对疾病进行适当的治疗。机器学习为医疗保健领域提供了巨大的贡献。在这项研究中,从加州大学欧文分校(UCI)的机器学习资源中获得的皮马印第安人糖尿病数据集,包括768名患者的详细信息以及9个属性,被选中用于对卫生部门这一严重而普遍的问题进行全面调查。最终取得了一个充分完善的结果,并得出了一些有效和透明的结论。在768名糖尿病患者中,500人被诊断为阳性,268人被诊断为阴性。此外,将记录的事实放入特定的监督机器学习技术中,如支持向量机(SVM)、Naïve贝叶斯(NB)、决策树(DT)、人工神经网络(ANN)、线性判别分析(LDA)、逻辑回归(LR)和k-近邻(k-NN)。与此同时,装袋和增强技术,如随机森林(RF),极端梯度增强(XGBoost), LightGBM和CatBoost也被考虑在内。此外,考虑准确率最高的分类器,将SVM、CatBoost和RF自适应构建最终的集成模型来预测糖尿病。因此,该模型的准确率为86.15%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Hyper Parameter Tuned Ensemble Approach for Gestational Diabetes Prediction
Diabetes Mellitus is commonly found in human beings around the world and this is one of the serious diseases which causes boundless suffering among patients. There are numerous reasons for the prevalence of this disease. It would be better to consider the predictions carried out earlier in this respect, since diabetes is a non - communicable disease and makes a great impact on the health condition of people nowadays. This is the reason why the existing medicinal practices in most hospitals are collecting patients' life history or the record of the disease. This is done for diagnosing diabetes using various medical tests followed by proper treatment for the disease. Machine learning provides an immense contribution to the sector of healthcare. For this research, Pima Indians Diabetes Dataset, obtained from the University of California, Irvine (UCI) machine learning source that included 768 patients' details along with nine attributes had been chosen for a comprehensive investigation of this grave and widespread problem in the health sector. Eventually, an adequate perfect outcome could be achieved and some effective and transparent conclusions were made. Among 768 diabetics, 500 were recognized as positive for the disease while 268 were recognized as negative. Besides, the recorded facts were put into particular supervised machine learning techniques such as Support Vector Machine (SVM), Naïve Bayes (NB), Decision Tree (DT), Artificial Neural Networks (ANN), Linear Discriminant Analysis (LDA), Logistic Regression (LR) and k-nearest neighbors (k-NN). Along with this, bagging and boosting techniques like Random Forest (RF), Extreme Gradient Boosting (XGBoost), LightGBM, and CatBoost too were taken into consideration. In addition, by considering classifiers with the highest accuracies, the final ensemble model was developed with the adaption of SVM, CatBoost and RF to predict the diabetes mellitus. Thus, the model resulted in an accuracy of 86.15%.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Securing SCADA Systems against Cyber-Attacks using Artificial Intelligence Quality of Life Integrated Framework: Perspective of Cloud Computing Usage Reference Points Generated on Unit Hypersurfaces for MaOEAs Eye-Tracking Analysis with Deep Learning Method An Implementation and Evaluation of Basic Data Storage Topic for Content Provider Stage in Android Programming Learning Assistance System
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1