用机器学习预测心力衰竭的死亡率。与统计建模的比较。

IF 8.1 2区医学 Q1 MEDICINE, GENERAL & INTERNAL European Journal of Internal Medicine Pub Date : 2025-03-01 Epub Date: 2025-01-28 DOI:10.1016/j.ejim.2025.01.020

Domenico Scrutinio , Federica Amitrano , Pietro Guida , Armando Coccia , Gaetano Pagano , Gianni D'addio , Andrea Passantino

{"title":"用机器学习预测心力衰竭的死亡率。与统计建模的比较。","authors":"Domenico Scrutinio , Federica Amitrano , Pietro Guida , Armando Coccia , Gaetano Pagano , Gianni D'addio , Andrea Passantino","doi":"10.1016/j.ejim.2025.01.020","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Assessing the relative performance of machine learning (ML) methods and conventional statistical methods in predicting prognosis in heart failure (HF) still remains a challenging research field.</div></div><div><h3>Methods</h3><div>The primary outcome was three-year mortality. The following 5 machine learning approaches were used for modeling: Random Forest (RF), Gradient Boosting, Extreme Gradient Boosting (XGBoost), Support Vector Machine, and Multilayer perceptron. We compared the performance of the best performing ML models to the MAGGIC (Meta-analysis Global Group in Chronic Heart Failure) score and a novel logistic regression model (LRM) developed using the same set of variables used to develop the machine learning models. The performance was determined based on discrimination, calibration, and net benefit.</div></div><div><h3>Results</h3><div>The XGBoost and the RF were the best performing ML models. The XGBoost provided the highest discrimination (C-statistic: 0.793) and the lowest Brier score (0.178); the RF model had a C-statistic of 0.779 and provided the highest area under the precision‐recall curve (0.636). Both models were well calibrated. Both the XGboost and RF models outperformed MAGGIC score. The LRM had a C-statistic of 0.811 and a Brier score of 0.160 and was well calibrated. The XGBoost, RF, and LRM gave a higher net benefit than MAGGIC score; the XGBoost, RF, and logistic regression model gave similar net benefit.</div></div><div><h3>Conclusions</h3><div>RF and XGBoost models outperformed MAGGIC in predicting mortality. However, they did not offer any improvement over a logistic regression model built using the same set of covariates considered in the ML modeling.</div></div>","PeriodicalId":50485,"journal":{"name":"European Journal of Internal Medicine","volume":"133 ","pages":"Pages 106-112"},"PeriodicalIF":8.1000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prediction of mortality in heart failure by machine learning. Comparison with statistical modeling\",\"authors\":\"Domenico Scrutinio , Federica Amitrano , Pietro Guida , Armando Coccia , Gaetano Pagano , Gianni D'addio , Andrea Passantino\",\"doi\":\"10.1016/j.ejim.2025.01.020\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><div>Assessing the relative performance of machine learning (ML) methods and conventional statistical methods in predicting prognosis in heart failure (HF) still remains a challenging research field.</div></div><div><h3>Methods</h3><div>The primary outcome was three-year mortality. The following 5 machine learning approaches were used for modeling: Random Forest (RF), Gradient Boosting, Extreme Gradient Boosting (XGBoost), Support Vector Machine, and Multilayer perceptron. We compared the performance of the best performing ML models to the MAGGIC (Meta-analysis Global Group in Chronic Heart Failure) score and a novel logistic regression model (LRM) developed using the same set of variables used to develop the machine learning models. The performance was determined based on discrimination, calibration, and net benefit.</div></div><div><h3>Results</h3><div>The XGBoost and the RF were the best performing ML models. The XGBoost provided the highest discrimination (C-statistic: 0.793) and the lowest Brier score (0.178); the RF model had a C-statistic of 0.779 and provided the highest area under the precision‐recall curve (0.636). Both models were well calibrated. Both the XGboost and RF models outperformed MAGGIC score. The LRM had a C-statistic of 0.811 and a Brier score of 0.160 and was well calibrated. The XGBoost, RF, and LRM gave a higher net benefit than MAGGIC score; the XGBoost, RF, and logistic regression model gave similar net benefit.</div></div><div><h3>Conclusions</h3><div>RF and XGBoost models outperformed MAGGIC in predicting mortality. However, they did not offer any improvement over a logistic regression model built using the same set of covariates considered in the ML modeling.</div></div>\",\"PeriodicalId\":50485,\"journal\":{\"name\":\"European Journal of Internal Medicine\",\"volume\":\"133 \",\"pages\":\"Pages 106-112\"},\"PeriodicalIF\":8.1000,\"publicationDate\":\"2025-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"European Journal of Internal Medicine\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0953620525000305\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/28 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"MEDICINE, GENERAL & INTERNAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Internal Medicine","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0953620525000305","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/28 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}

引用次数: 0

摘要

背景：评估机器学习（ML）方法与传统统计方法在预测心力衰竭（HF）预后方面的相对性能仍然是一个具有挑战性的研究领域。方法：主要观察指标为三年死亡率。以下5种机器学习方法用于建模：随机森林（RF），梯度增强，极端梯度增强（XGBoost），支持向量机和多层感知机。我们将表现最佳的ML模型的性能与MAGGIC（全球慢性心力衰竭荟萃分析组）评分和使用用于开发机器学习模型的相同变量集开发的新型逻辑回归模型（LRM）进行了比较。性能是根据鉴别、校准和净效益来确定的。结果：XGBoost和RF是表现最好的ML模型。XGBoost的判别性最高（c统计量为0.793），Brier评分最低（0.178）；RF模型的c统计量为0.779，在精密度-召回率曲线下的面积最大（0.636）。两个模型都经过了很好的校准。XGboost和RF模型都优于magic得分。LRM的c统计量为0.811，Brier评分为0.160，校正效果良好。XGBoost、RF和LRM的净收益高于magic评分；XGBoost、RF和逻辑回归模型给出了类似的净收益。结论：RF和XGBoost模型在预测死亡率方面优于MAGGIC。然而，他们没有提供任何改进的逻辑回归模型，该模型使用的是ML建模中考虑的同一组协变量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

摘要图片

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Prediction of mortality in heart failure by machine learning. Comparison with statistical modeling

Background

Assessing the relative performance of machine learning (ML) methods and conventional statistical methods in predicting prognosis in heart failure (HF) still remains a challenging research field.

Methods

The primary outcome was three-year mortality. The following 5 machine learning approaches were used for modeling: Random Forest (RF), Gradient Boosting, Extreme Gradient Boosting (XGBoost), Support Vector Machine, and Multilayer perceptron. We compared the performance of the best performing ML models to the MAGGIC (Meta-analysis Global Group in Chronic Heart Failure) score and a novel logistic regression model (LRM) developed using the same set of variables used to develop the machine learning models. The performance was determined based on discrimination, calibration, and net benefit.

Results

The XGBoost and the RF were the best performing ML models. The XGBoost provided the highest discrimination (C-statistic: 0.793) and the lowest Brier score (0.178); the RF model had a C-statistic of 0.779 and provided the highest area under the precision‐recall curve (0.636). Both models were well calibrated. Both the XGboost and RF models outperformed MAGGIC score. The LRM had a C-statistic of 0.811 and a Brier score of 0.160 and was well calibrated. The XGBoost, RF, and LRM gave a higher net benefit than MAGGIC score; the XGBoost, RF, and logistic regression model gave similar net benefit.

Conclusions

RF and XGBoost models outperformed MAGGIC in predicting mortality. However, they did not offer any improvement over a logistic regression model built using the same set of covariates considered in the ML modeling.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

European Journal of Internal Medicine 医学-医学：内科

CiteScore

9.60

自引率

6.20%

发文量

364

审稿时长

20 days

期刊介绍： The European Journal of Internal Medicine serves as the official journal of the European Federation of Internal Medicine and is the primary scientific reference for European academic and non-academic internists. It is dedicated to advancing science and practice in internal medicine across Europe. The journal publishes original articles, editorials, reviews, internal medicine flashcards, and other relevant information in the field. Both translational medicine and clinical studies are emphasized. EJIM aspires to be a leading platform for excellent clinical studies, with a focus on enhancing the quality of healthcare in European hospitals.