评估心血管疾病预测的二元分类器:增强早期诊断能力。

IF 2.4 4区 医学 Q2 CARDIAC & CARDIOVASCULAR SYSTEMS Journal of Cardiovascular Development and Disease Pub Date : 2024-12-09 DOI:10.3390/jcdd11120396
Paul Iacobescu, Virginia Marina, Catalin Anghel, Aurelian-Dumitrache Anghele
{"title":"评估心血管疾病预测的二元分类器:增强早期诊断能力。","authors":"Paul Iacobescu, Virginia Marina, Catalin Anghel, Aurelian-Dumitrache Anghele","doi":"10.3390/jcdd11120396","DOIUrl":null,"url":null,"abstract":"<p><p>Cardiovascular disease (CVD) is a significant global health concern and the leading cause of death in many countries. Early detection and diagnosis of CVD can significantly reduce the risk of complications and mortality. Machine learning methods, particularly classification algorithms, have demonstrated their potential to accurately predict the risk of cardiovascular disease (CVD) by analyzing patient data. This study evaluates seven binary classification algorithms, including Random Forests, Logistic Regression, Naive Bayes, K-Nearest Neighbors (kNN), Support Vector Machines, Gradient Boosting, and Artificial Neural Networks, to understand their effectiveness in predicting CVD. Advanced preprocessing techniques, such as SMOTE-ENN for addressing class imbalance and hyperparameter optimization through Grid Search Cross-Validation, were applied to enhance the reliability and performance of these models. Standard evaluation metrics, including accuracy, precision, recall, F1-score, and Area Under the Receiver Operating Characteristic Curve (ROC-AUC), were used to assess predictive capabilities. The results show that kNN achieved the highest accuracy (99%) and AUC (0.99), surpassing traditional models like Logistic Regression and Gradient Boosting. The study examines the challenges encountered when working with datasets related to cardiovascular diseases, such as class imbalance and feature selection. It demonstrates how addressing these issues enhances the reliability and applicability of predictive models. These findings emphasize the potential of kNN as a reliable tool for early CVD prediction, offering significant improvements over previous studies. This research highlights the value of advanced machine learning techniques in healthcare, addressing key challenges and laying a foundation for future studies aimed at improving predictive models for CVD prevention.</p>","PeriodicalId":15197,"journal":{"name":"Journal of Cardiovascular Development and Disease","volume":"11 12","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2024-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11678659/pdf/","citationCount":"0","resultStr":"{\"title\":\"Evaluating Binary Classifiers for Cardiovascular Disease Prediction: Enhancing Early Diagnostic Capabilities.\",\"authors\":\"Paul Iacobescu, Virginia Marina, Catalin Anghel, Aurelian-Dumitrache Anghele\",\"doi\":\"10.3390/jcdd11120396\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Cardiovascular disease (CVD) is a significant global health concern and the leading cause of death in many countries. Early detection and diagnosis of CVD can significantly reduce the risk of complications and mortality. Machine learning methods, particularly classification algorithms, have demonstrated their potential to accurately predict the risk of cardiovascular disease (CVD) by analyzing patient data. This study evaluates seven binary classification algorithms, including Random Forests, Logistic Regression, Naive Bayes, K-Nearest Neighbors (kNN), Support Vector Machines, Gradient Boosting, and Artificial Neural Networks, to understand their effectiveness in predicting CVD. Advanced preprocessing techniques, such as SMOTE-ENN for addressing class imbalance and hyperparameter optimization through Grid Search Cross-Validation, were applied to enhance the reliability and performance of these models. Standard evaluation metrics, including accuracy, precision, recall, F1-score, and Area Under the Receiver Operating Characteristic Curve (ROC-AUC), were used to assess predictive capabilities. The results show that kNN achieved the highest accuracy (99%) and AUC (0.99), surpassing traditional models like Logistic Regression and Gradient Boosting. The study examines the challenges encountered when working with datasets related to cardiovascular diseases, such as class imbalance and feature selection. It demonstrates how addressing these issues enhances the reliability and applicability of predictive models. These findings emphasize the potential of kNN as a reliable tool for early CVD prediction, offering significant improvements over previous studies. This research highlights the value of advanced machine learning techniques in healthcare, addressing key challenges and laying a foundation for future studies aimed at improving predictive models for CVD prevention.</p>\",\"PeriodicalId\":15197,\"journal\":{\"name\":\"Journal of Cardiovascular Development and Disease\",\"volume\":\"11 12\",\"pages\":\"\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2024-12-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11678659/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Cardiovascular Development and Disease\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.3390/jcdd11120396\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CARDIAC & CARDIOVASCULAR SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Cardiovascular Development and Disease","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3390/jcdd11120396","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

心血管疾病是一个重大的全球健康问题,也是许多国家的主要死亡原因。心血管疾病的早期发现和诊断可以显著降低并发症和死亡率的风险。机器学习方法,特别是分类算法,已经证明了它们通过分析患者数据准确预测心血管疾病(CVD)风险的潜力。本研究评估了七种二分类算法,包括随机森林、逻辑回归、朴素贝叶斯、k近邻(kNN)、支持向量机、梯度增强和人工神经网络,以了解它们在预测心血管疾病方面的有效性。采用先进的预处理技术,如用于解决类不平衡的SMOTE-ENN和通过网格搜索交叉验证进行超参数优化,以提高这些模型的可靠性和性能。标准评估指标,包括准确度、精密度、召回率、f1评分和受试者工作特征曲线下面积(ROC-AUC),用于评估预测能力。结果表明,kNN的准确率最高(99%),AUC最高(0.99),超过了Logistic回归和Gradient Boosting等传统模型。该研究考察了在处理与心血管疾病相关的数据集时遇到的挑战,例如类别不平衡和特征选择。它演示了如何解决这些问题,以提高预测模型的可靠性和适用性。这些发现强调了kNN作为早期CVD预测的可靠工具的潜力,比以前的研究提供了重大改进。这项研究强调了先进的机器学习技术在医疗保健中的价值,解决了关键挑战,并为未来旨在改进心血管疾病预防预测模型的研究奠定了基础。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Evaluating Binary Classifiers for Cardiovascular Disease Prediction: Enhancing Early Diagnostic Capabilities.

Cardiovascular disease (CVD) is a significant global health concern and the leading cause of death in many countries. Early detection and diagnosis of CVD can significantly reduce the risk of complications and mortality. Machine learning methods, particularly classification algorithms, have demonstrated their potential to accurately predict the risk of cardiovascular disease (CVD) by analyzing patient data. This study evaluates seven binary classification algorithms, including Random Forests, Logistic Regression, Naive Bayes, K-Nearest Neighbors (kNN), Support Vector Machines, Gradient Boosting, and Artificial Neural Networks, to understand their effectiveness in predicting CVD. Advanced preprocessing techniques, such as SMOTE-ENN for addressing class imbalance and hyperparameter optimization through Grid Search Cross-Validation, were applied to enhance the reliability and performance of these models. Standard evaluation metrics, including accuracy, precision, recall, F1-score, and Area Under the Receiver Operating Characteristic Curve (ROC-AUC), were used to assess predictive capabilities. The results show that kNN achieved the highest accuracy (99%) and AUC (0.99), surpassing traditional models like Logistic Regression and Gradient Boosting. The study examines the challenges encountered when working with datasets related to cardiovascular diseases, such as class imbalance and feature selection. It demonstrates how addressing these issues enhances the reliability and applicability of predictive models. These findings emphasize the potential of kNN as a reliable tool for early CVD prediction, offering significant improvements over previous studies. This research highlights the value of advanced machine learning techniques in healthcare, addressing key challenges and laying a foundation for future studies aimed at improving predictive models for CVD prevention.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Cardiovascular Development and Disease
Journal of Cardiovascular Development and Disease CARDIAC & CARDIOVASCULAR SYSTEMS-
CiteScore
2.60
自引率
12.50%
发文量
381
期刊最新文献
Spinal Cord Stimulation for Refractory Angina Pectoris: Current Status and Future Perspectives, a Narrative Review. Longitudinal Outcomes of Patients with Aortic Stenosis Stratified by Sex: An Asian Perspective. Direct Axillary Artery Cannulation as Standard Perfusion Strategy in Minimally Invasive Coronary Artery Bypass Grafting. Interindividual Variability Response to Resistance and High-Intensity Interval Training on Blood Pressure Reduction in Hypertensive Older Adults. Rapid Improvement in Cardiac Damage Predicts Better Prognosis After Transcatheter Aortic Valve Replacement.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1