Early Prediction of Coronary Heart Disease using Boosting-based Voting Ensemble Learning

Subhash Mondal, Ranjan Maity, Yash Raj Singh, Soumadip Ghosh, A. Nag
{"title":"Early Prediction of Coronary Heart Disease using Boosting-based Voting Ensemble Learning","authors":"Subhash Mondal, Ranjan Maity, Yash Raj Singh, Soumadip Ghosh, A. Nag","doi":"10.1109/IBSSC56953.2022.10037445","DOIUrl":null,"url":null,"abstract":"Coronary-Heart-Disease (CHD) risk increases daily due to the uncontrolled lifestyle of today's adult age group. The early detection of the disease can prevent unfortunate death due to heart-related complications. The Machine Learning (ML) technique is essential for the early diagnosis of CHD and for identifying its many contributing factor variables. To build the prediction model, we have used the dataset consisting of 4240 instances and 15 related features to predict the possibility of future risk of CHD in the next ten years. Initially, thirteen ML models were deployed with 10-fold cross-validation, reflecting the highest test accuracy of 91.28% for the Random Forest (RF) classifier. The models were turned further, and the boosting algorithms showed the highest accuracy of 91 % and above; the Gradient Boost (GB) classifier performed better with an accuracy of 92.11 %. The voting ensemble approaches using the best-performing boosting models, namely GB, HGB, XGB, CB, and LGBM, have been considered for the final prediction. The prediction results reflected an accuracy of 92.26%, an F1 score of 91.25%, a ROC-AUC score of 0.917, and the number of False Negatives (FN) values is about 6.25% of the total test dataset.","PeriodicalId":426897,"journal":{"name":"2022 IEEE Bombay Section Signature Conference (IBSSC)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Bombay Section Signature Conference (IBSSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IBSSC56953.2022.10037445","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Coronary-Heart-Disease (CHD) risk increases daily due to the uncontrolled lifestyle of today's adult age group. The early detection of the disease can prevent unfortunate death due to heart-related complications. The Machine Learning (ML) technique is essential for the early diagnosis of CHD and for identifying its many contributing factor variables. To build the prediction model, we have used the dataset consisting of 4240 instances and 15 related features to predict the possibility of future risk of CHD in the next ten years. Initially, thirteen ML models were deployed with 10-fold cross-validation, reflecting the highest test accuracy of 91.28% for the Random Forest (RF) classifier. The models were turned further, and the boosting algorithms showed the highest accuracy of 91 % and above; the Gradient Boost (GB) classifier performed better with an accuracy of 92.11 %. The voting ensemble approaches using the best-performing boosting models, namely GB, HGB, XGB, CB, and LGBM, have been considered for the final prediction. The prediction results reflected an accuracy of 92.26%, an F1 score of 91.25%, a ROC-AUC score of 0.917, and the number of False Negatives (FN) values is about 6.25% of the total test dataset.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于boosting的投票集合学习的冠心病早期预测
由于当今成年人不受控制的生活方式,冠心病(CHD)的风险日益增加。这种疾病的早期发现可以防止因心脏相关并发症而不幸死亡。机器学习(ML)技术对于冠心病的早期诊断和识别其许多促成因素变量至关重要。为了建立预测模型,我们使用了由4240个实例和15个相关特征组成的数据集来预测未来十年冠心病风险的可能性。最初,部署了13个ML模型并进行了10倍交叉验证,反映了随机森林(RF)分类器的最高测试准确率为91.28%。对模型进行进一步优化,增强算法的准确率达到91%以上;梯度增强(GB)分类器表现较好,准确率为92.11%。使用性能最好的增强模型(即GB、HGB、XGB、CB和LGBM)的投票集成方法已被考虑用于最终预测。预测结果准确率为92.26%,F1得分为91.25%,ROC-AUC得分为0.917,假阴性(False Negatives, FN)值约占整个测试数据集的6.25%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Decentralized Ride Hailing System using Blockchain and IPFS Implementation of RFID-based Lab Inventory System Monkeypox Skin Lesion Classification Using Transfer Learning Approach A Solution to the Techno-Economic Generation Expansion Planning using Enhanced Dwarf Mongoose Optimization Algorithm Citation Count Prediction Using Different Time Series Analysis Models
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1