Cost-Sensitive Models to Predict Risk of Cardiovascular Events in Patients with Chronic Heart Failure

IF 2.9 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Information (Switzerland) Pub Date : 2023-10-03 DOI:10.3390/info14100542

Maria Carmela Groccia, Rosita Guido, Domenico Conforti, Corrado Pelaia, Giuseppe Armentaro, Alfredo Francesco Toscani, Sofia Miceli, Elena Succurro, Marta Letizia Hribal, Angela Sciacqua

{"title":"Cost-Sensitive Models to Predict Risk of Cardiovascular Events in Patients with Chronic Heart Failure","authors":"Maria Carmela Groccia, Rosita Guido, Domenico Conforti, Corrado Pelaia, Giuseppe Armentaro, Alfredo Francesco Toscani, Sofia Miceli, Elena Succurro, Marta Letizia Hribal, Angela Sciacqua","doi":"10.3390/info14100542","DOIUrl":null,"url":null,"abstract":"Chronic heart failure (CHF) is a clinical syndrome characterised by symptoms and signs due to structural and/or functional abnormalities of the heart. CHF confers risk for cardiovascular deterioration events which cause recurrent hospitalisations and high mortality rates. The early prediction of these events is very important to limit serious consequences, improve the quality of care, and reduce its burden. CHF is a progressive condition in which patients may remain asymptomatic before the onset of symptoms, as observed in heart failure with a preserved ejection fraction. The early detection of underlying causes is critical for treatment optimisation and prognosis improvement. To develop models to predict cardiovascular deterioration events in patients with chronic heart failure, a real dataset was constructed and a knowledge discovery task was implemented in this study. The dataset is imbalanced, as it is common in real-world applications. It thus posed a challenge because imbalanced datasets tend to be overwhelmed by the abundance of majority-class instances during the learning process. To address the issue, a pipeline was developed specifically to handle imbalanced data. Different predictive models were developed and compared. To enhance sensitivity and other performance metrics, we employed multiple approaches, including data resampling, cost-sensitive methods, and a hybrid method that combines both techniques. These methods were utilised to assess the predictive capabilities of the models and their effectiveness in handling imbalanced data. By using these metrics, we aimed to identify the most effective strategies for achieving improved model performance in real scenarios with imbalanced datasets. The best model for predicting cardiovascular events achieved mean a sensitivity 65%, a mean specificity 55%, and a mean area under the curve of 0.71. The results show that cost-sensitive models combined with over/under sampling approaches are effective for the meaningful prediction of cardiovascular events in CHF patients.","PeriodicalId":38479,"journal":{"name":"Information (Switzerland)","volume":"113 1","pages":"0"},"PeriodicalIF":2.9000,"publicationDate":"2023-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information (Switzerland)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/info14100542","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Chronic heart failure (CHF) is a clinical syndrome characterised by symptoms and signs due to structural and/or functional abnormalities of the heart. CHF confers risk for cardiovascular deterioration events which cause recurrent hospitalisations and high mortality rates. The early prediction of these events is very important to limit serious consequences, improve the quality of care, and reduce its burden. CHF is a progressive condition in which patients may remain asymptomatic before the onset of symptoms, as observed in heart failure with a preserved ejection fraction. The early detection of underlying causes is critical for treatment optimisation and prognosis improvement. To develop models to predict cardiovascular deterioration events in patients with chronic heart failure, a real dataset was constructed and a knowledge discovery task was implemented in this study. The dataset is imbalanced, as it is common in real-world applications. It thus posed a challenge because imbalanced datasets tend to be overwhelmed by the abundance of majority-class instances during the learning process. To address the issue, a pipeline was developed specifically to handle imbalanced data. Different predictive models were developed and compared. To enhance sensitivity and other performance metrics, we employed multiple approaches, including data resampling, cost-sensitive methods, and a hybrid method that combines both techniques. These methods were utilised to assess the predictive capabilities of the models and their effectiveness in handling imbalanced data. By using these metrics, we aimed to identify the most effective strategies for achieving improved model performance in real scenarios with imbalanced datasets. The best model for predicting cardiovascular events achieved mean a sensitivity 65%, a mean specificity 55%, and a mean area under the curve of 0.71. The results show that cost-sensitive models combined with over/under sampling approaches are effective for the meaningful prediction of cardiovascular events in CHF patients.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

预测慢性心力衰竭患者心血管事件风险的成本敏感模型

慢性心力衰竭(CHF)是一种以心脏结构和/或功能异常引起的症状和体征为特征的临床综合征。CHF具有心血管恶化事件的风险，导致反复住院和高死亡率。这些事件的早期预测对于限制严重后果、提高护理质量和减轻负担非常重要。CHF是一种进行性疾病，患者在出现症状前可能仍无症状，如在射血分数保持不变的心力衰竭中观察到的那样。早期发现潜在病因对优化治疗和改善预后至关重要。为了建立预测慢性心力衰竭患者心血管恶化事件的模型，本研究构建了一个真实数据集，并实施了一个知识发现任务。数据集是不平衡的，这在实际应用程序中很常见。因此，它提出了一个挑战，因为在学习过程中，不平衡的数据集往往被大量的多数类实例所淹没。为了解决这个问题，专门开发了一个管道来处理不平衡数据。建立了不同的预测模型并进行了比较。为了提高灵敏度和其他性能指标，我们采用了多种方法，包括数据重采样、成本敏感方法，以及结合这两种技术的混合方法。这些方法被用来评估模型的预测能力及其在处理不平衡数据方面的有效性。通过使用这些指标，我们旨在确定在不平衡数据集的真实场景中实现改进模型性能的最有效策略。预测心血管事件的最佳模型平均灵敏度为65%，平均特异性为55%，平均曲线下面积为0.71。结果表明，成本敏感模型结合过采样/欠采样方法对CHF患者心血管事件的有意义预测是有效的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊