Optimized machine learning framework for cardiovascular disease diagnosis: a novel ethical perspective.

IF 2.3 3区医学 Q3 CARDIAC & CARDIOVASCULAR SYSTEMS BMC Cardiovascular Disorders Pub Date : 2025-02-20 DOI:10.1186/s12872-025-04550-w

Ghadah Alwakid, Farman Ul Haq, Noshina Tariq, Mamoona Humayun, Momina Shaheen, Marwa Alsadun

{"title":"Optimized machine learning framework for cardiovascular disease diagnosis: a novel ethical perspective.","authors":"Ghadah Alwakid, Farman Ul Haq, Noshina Tariq, Mamoona Humayun, Momina Shaheen, Marwa Alsadun","doi":"10.1186/s12872-025-04550-w","DOIUrl":null,"url":null,"abstract":"<p><p>Alignment of advanced cutting-edge technologies such as Artificial Intelligence (AI) has emerged as a significant driving force to achieve greater precision and timeliness in identifying cardiovascular diseases (CVDs). However, it is difficult to achieve high accuracy and reliability in CVD diagnostics due to complex clinical data and the selection and modeling process of useful features. Therefore, this paper studies advanced AI-based feature selection techniques and the application of AI technologies in the CVD classification. It uses methodologies such as Chi-square, Info Gain, Forward Selection, and Backward Elimination as an essence of cardiovascular health indicators into a refined eight-feature subset. This study emphasizes ethical considerations, including transparency, interpretability, and bias mitigation. This is achieved by employing unbiased datasets, fair feature selection techniques, and rigorous validation metrics to ensure fairness and trustworthiness in the AI-based diagnostic process. In addition, the integration of various Machine Learning (ML) models, encompassing Random Forest (RF), XGBoost, Decision Trees (DT), and Logistic Regression (LR), facilitates a comprehensive exploration of predictive performance. Among this diverse range of models, XGBoost stands out as the top performer, achieving exceptional scores with a 99% accuracy rate, 100% recall, 99% F1-measure, and 99% precision. Furthermore, we venture into dimensionality reduction, applying Principal Component Analysis (PCA) to the eight-feature subset, effectively refining it to a compact six-attribute feature subset. Once again, XGBoost shines as the model of choice, yielding outstanding results. It achieves accuracy, recall, F1-measure, and precision scores of 98%, 100%, 98%, and 97%, respectively, when applied to the feature subset derived from the combination of Chi-square and Forward Selection methods.</p>","PeriodicalId":9195,"journal":{"name":"BMC Cardiovascular Disorders","volume":"25 1","pages":"123"},"PeriodicalIF":2.3000,"publicationDate":"2025-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11844188/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Cardiovascular Disorders","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12872-025-04550-w","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Alignment of advanced cutting-edge technologies such as Artificial Intelligence (AI) has emerged as a significant driving force to achieve greater precision and timeliness in identifying cardiovascular diseases (CVDs). However, it is difficult to achieve high accuracy and reliability in CVD diagnostics due to complex clinical data and the selection and modeling process of useful features. Therefore, this paper studies advanced AI-based feature selection techniques and the application of AI technologies in the CVD classification. It uses methodologies such as Chi-square, Info Gain, Forward Selection, and Backward Elimination as an essence of cardiovascular health indicators into a refined eight-feature subset. This study emphasizes ethical considerations, including transparency, interpretability, and bias mitigation. This is achieved by employing unbiased datasets, fair feature selection techniques, and rigorous validation metrics to ensure fairness and trustworthiness in the AI-based diagnostic process. In addition, the integration of various Machine Learning (ML) models, encompassing Random Forest (RF), XGBoost, Decision Trees (DT), and Logistic Regression (LR), facilitates a comprehensive exploration of predictive performance. Among this diverse range of models, XGBoost stands out as the top performer, achieving exceptional scores with a 99% accuracy rate, 100% recall, 99% F1-measure, and 99% precision. Furthermore, we venture into dimensionality reduction, applying Principal Component Analysis (PCA) to the eight-feature subset, effectively refining it to a compact six-attribute feature subset. Once again, XGBoost shines as the model of choice, yielding outstanding results. It achieves accuracy, recall, F1-measure, and precision scores of 98%, 100%, 98%, and 97%, respectively, when applied to the feature subset derived from the combination of Chi-square and Forward Selection methods.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

心血管疾病诊断的优化机器学习框架：一个新的伦理视角。

人工智能（AI）等先进尖端技术的结合已成为提高心血管疾病（cvd）识别精度和及时性的重要推动力。然而，由于临床数据的复杂性和有用特征的选择和建模过程，在CVD诊断中很难达到较高的准确性和可靠性。因此，本文研究了先进的基于人工智能的特征选择技术以及人工智能技术在CVD分类中的应用。它使用卡方、信息增益、前向选择和后向消除等方法作为心血管健康指标的精华，并将其提炼为八个特征子集。本研究强调伦理考虑，包括透明度、可解释性和减少偏见。这是通过使用无偏数据集、公平的特征选择技术和严格的验证指标来实现的，以确保基于人工智能的诊断过程中的公平性和可信度。此外，各种机器学习（ML）模型的集成，包括随机森林（RF）， XGBoost，决策树（DT）和逻辑回归（LR），有助于全面探索预测性能。在这些不同的模型中，XGBoost脱颖而出，成为表现最好的，具有99%的准确率，100%的召回率，99%的f1测量和99%的精度。此外，我们冒险进行降维，将主成分分析（PCA）应用于八个特征子集，有效地将其精炼为紧凑的六个属性特征子集。再一次，XGBoost作为首选模型闪耀着光芒，产生了出色的结果。当应用于卡方和正向选择相结合的特征子集时，它的准确率、召回率、F1-measure和精度分别达到98%、100%、98%和97%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

BMC Cardiovascular Disorders CARDIAC & CARDIOVASCULAR SYSTEMS-

CiteScore

3.50

自引率

0.00%

发文量

480

审稿时长

1 months

期刊介绍： BMC Cardiovascular Disorders is an open access, peer-reviewed journal that considers articles on all aspects of the prevention, diagnosis and management of disorders of the heart and circulatory system, as well as related molecular and cell biology, genetics, pathophysiology, epidemiology, and controlled trials.