{"title":"机器学习利用收缩压时间间隔和常规收集的临床数据预测急性心肌梗死后的长期死亡率","authors":"Bijan Roudini , Boshra Khajehpiri , Hamid Abrishami Moghaddam , Mohamad Forouzanfar","doi":"10.1016/j.imed.2024.01.001","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>Precise estimation of current and future comorbidities of patients with cardiovascular disease is an important factor in prioritizing continuous physiological monitoring and new therapies. Machine learning (ML) models have shown satisfactory performance in short-term mortality prediction in patients with heart disease, whereas their utility in long-term predictions is limited. This study aimed to investigate the performance of tree-based ML models on long-term mortality prediction and effect of two recently introduced biomarkers on long-term mortality.</p></div><div><h3>Methods</h3><p>This study used publicly available data from the Collaboration Center of Health Information Application at the Ministry of Health and Welfare, Taiwan, China. The collected data were from patients admitted to the cardiac care unit for acute myocardial infarction (AMI) between November 2003 and September 2004. We collected and analyzed mortality data up to December 2018. Medical records were used to gather demographic and clinical data, including age, gender, body mass index, percutaneous coronary intervention status, and comorbidities such as hypertension, dyslipidemia, ST-segment elevation myocardial infarction, and non-ST-segment elevation myocardial infarction. Using the data, collected from 139 patients with AMI, from medical and demographic records as well as two recently introduced biomarkers, brachial pre-ejection period (bPEP) and brachial ejection time (bET), we investigated the performance of advanced ensemble tree-based ML algorithms (random forest, AdaBoost, and XGBoost) to predict all-cause mortality within 14 years. A nested cross-validation was performed to evaluate and compare the performance of our developed models precisely with that of the conventional logistic regression (LR) as the baseline method.</p></div><div><h3>Results</h3><p>The developed ML models achieved significantly better performance compared to the baseline LR (C-Statistic, 0.80 for random forest, 0.79 for AdaBoost, and 0.78 for XGBoost, <em>vs</em>. 0.77 for LR) (<em>P</em><sub>RF</sub> < 0.001, <em>P</em><sub>AdaBoost</sub> < 0.001, and <em>P</em><sub>XGBoost</sub> < 0.05). Adding bPEP and bET to our feature set significantly improved the performance of the algorithm, leading to an absolute increase in C-statistic of up to 0.03 (C-statistic, 0.83 for random forest, 0.82 for AdaBoost, and 0.80 for XGBoost, <em>vs</em>. 0.74 for LR) (<em>P</em><sub>RF</sub> < 0.001, <em>P</em><sub>AdaBoost</sub> < 0.001, <em>P</em><sub>XGBoost</sub> < 0.05).</p></div><div><h3>Conclusion</h3><p>The study indicates that incorporating new biomarkers into advanced ML models may significantly improve long-term mortality prediction in patients with cardiovascular diseases. This advancement may enable better treatment prioritization for high-risk individuals.</p></div>","PeriodicalId":73400,"journal":{"name":"Intelligent medicine","volume":"4 3","pages":"Pages 170-176"},"PeriodicalIF":4.4000,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667102624000329/pdfft?md5=039b96bf56f33e4f8342d2c062d97570&pid=1-s2.0-S2667102624000329-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Machine learning predicts long-term mortality after acute myocardial infarction using systolic time intervals and routinely collected clinical data\",\"authors\":\"Bijan Roudini , Boshra Khajehpiri , Hamid Abrishami Moghaddam , Mohamad Forouzanfar\",\"doi\":\"10.1016/j.imed.2024.01.001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><p>Precise estimation of current and future comorbidities of patients with cardiovascular disease is an important factor in prioritizing continuous physiological monitoring and new therapies. Machine learning (ML) models have shown satisfactory performance in short-term mortality prediction in patients with heart disease, whereas their utility in long-term predictions is limited. This study aimed to investigate the performance of tree-based ML models on long-term mortality prediction and effect of two recently introduced biomarkers on long-term mortality.</p></div><div><h3>Methods</h3><p>This study used publicly available data from the Collaboration Center of Health Information Application at the Ministry of Health and Welfare, Taiwan, China. The collected data were from patients admitted to the cardiac care unit for acute myocardial infarction (AMI) between November 2003 and September 2004. We collected and analyzed mortality data up to December 2018. Medical records were used to gather demographic and clinical data, including age, gender, body mass index, percutaneous coronary intervention status, and comorbidities such as hypertension, dyslipidemia, ST-segment elevation myocardial infarction, and non-ST-segment elevation myocardial infarction. Using the data, collected from 139 patients with AMI, from medical and demographic records as well as two recently introduced biomarkers, brachial pre-ejection period (bPEP) and brachial ejection time (bET), we investigated the performance of advanced ensemble tree-based ML algorithms (random forest, AdaBoost, and XGBoost) to predict all-cause mortality within 14 years. A nested cross-validation was performed to evaluate and compare the performance of our developed models precisely with that of the conventional logistic regression (LR) as the baseline method.</p></div><div><h3>Results</h3><p>The developed ML models achieved significantly better performance compared to the baseline LR (C-Statistic, 0.80 for random forest, 0.79 for AdaBoost, and 0.78 for XGBoost, <em>vs</em>. 0.77 for LR) (<em>P</em><sub>RF</sub> < 0.001, <em>P</em><sub>AdaBoost</sub> < 0.001, and <em>P</em><sub>XGBoost</sub> < 0.05). Adding bPEP and bET to our feature set significantly improved the performance of the algorithm, leading to an absolute increase in C-statistic of up to 0.03 (C-statistic, 0.83 for random forest, 0.82 for AdaBoost, and 0.80 for XGBoost, <em>vs</em>. 0.74 for LR) (<em>P</em><sub>RF</sub> < 0.001, <em>P</em><sub>AdaBoost</sub> < 0.001, <em>P</em><sub>XGBoost</sub> < 0.05).</p></div><div><h3>Conclusion</h3><p>The study indicates that incorporating new biomarkers into advanced ML models may significantly improve long-term mortality prediction in patients with cardiovascular diseases. This advancement may enable better treatment prioritization for high-risk individuals.</p></div>\",\"PeriodicalId\":73400,\"journal\":{\"name\":\"Intelligent medicine\",\"volume\":\"4 3\",\"pages\":\"Pages 170-176\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2024-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2667102624000329/pdfft?md5=039b96bf56f33e4f8342d2c062d97570&pid=1-s2.0-S2667102624000329-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Intelligent medicine\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2667102624000329\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligent medicine","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667102624000329","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
摘要
背景精确估计心血管疾病患者当前和未来的合并症是优先考虑持续生理监测和新疗法的一个重要因素。机器学习(ML)模型在心脏病患者的短期死亡率预测中表现令人满意,但在长期预测中的作用有限。本研究旨在调查基于树的 ML 模型在长期死亡率预测中的表现,以及最近引入的两种生物标志物对长期死亡率的影响。所收集的数据来自 2003 年 11 月至 2004 年 9 月期间因急性心肌梗死(AMI)入住心脏监护室的患者。我们收集并分析了截至 2018 年 12 月的死亡率数据。病历用于收集人口统计学和临床数据,包括年龄、性别、体重指数、经皮冠状动脉介入治疗情况以及高血压、血脂异常、ST段抬高型心肌梗死和非ST段抬高型心肌梗死等合并症。我们利用从 139 名急性心肌梗死患者的医疗和人口学记录中收集的数据,以及最近推出的两个生物标志物--肱骨射血前时间(bPEP)和肱骨射血时间(bET),研究了基于高级集合树的 ML 算法(随机森林、AdaBoost 和 XGBoost)预测 14 年内全因死亡率的性能。通过嵌套交叉验证来评估和比较我们开发的模型与作为基线方法的传统逻辑回归(LR)的性能。结果与基线逻辑回归相比,所开发的 ML 模型取得了明显更好的性能(C-统计量,随机森林为 0.80,AdaBoost 为 0.79,XGBoost 为 0.78,LR 为 0.77)(PRF < 0.001,PAdaBoost < 0.001,PXGBoost < 0.05)。在特征集中添加 bPEP 和 bET 能显著提高算法的性能,使 C 统计量的绝对值提高了 0.03(随机森林的 C 统计量为 0.83,AdaBoost 为 0.82,XGBoost 为 0.80,而 LR 为 0.74)。74 for LR)(PRF <0.001,PAdaBoost <0.001,PXGBoost <0.05)。结论该研究表明,将新的生物标记物纳入高级 ML 模型可显著改善心血管疾病患者的长期死亡率预测。这种进步可以更好地确定高危人群的治疗优先次序。
Machine learning predicts long-term mortality after acute myocardial infarction using systolic time intervals and routinely collected clinical data
Background
Precise estimation of current and future comorbidities of patients with cardiovascular disease is an important factor in prioritizing continuous physiological monitoring and new therapies. Machine learning (ML) models have shown satisfactory performance in short-term mortality prediction in patients with heart disease, whereas their utility in long-term predictions is limited. This study aimed to investigate the performance of tree-based ML models on long-term mortality prediction and effect of two recently introduced biomarkers on long-term mortality.
Methods
This study used publicly available data from the Collaboration Center of Health Information Application at the Ministry of Health and Welfare, Taiwan, China. The collected data were from patients admitted to the cardiac care unit for acute myocardial infarction (AMI) between November 2003 and September 2004. We collected and analyzed mortality data up to December 2018. Medical records were used to gather demographic and clinical data, including age, gender, body mass index, percutaneous coronary intervention status, and comorbidities such as hypertension, dyslipidemia, ST-segment elevation myocardial infarction, and non-ST-segment elevation myocardial infarction. Using the data, collected from 139 patients with AMI, from medical and demographic records as well as two recently introduced biomarkers, brachial pre-ejection period (bPEP) and brachial ejection time (bET), we investigated the performance of advanced ensemble tree-based ML algorithms (random forest, AdaBoost, and XGBoost) to predict all-cause mortality within 14 years. A nested cross-validation was performed to evaluate and compare the performance of our developed models precisely with that of the conventional logistic regression (LR) as the baseline method.
Results
The developed ML models achieved significantly better performance compared to the baseline LR (C-Statistic, 0.80 for random forest, 0.79 for AdaBoost, and 0.78 for XGBoost, vs. 0.77 for LR) (PRF < 0.001, PAdaBoost < 0.001, and PXGBoost < 0.05). Adding bPEP and bET to our feature set significantly improved the performance of the algorithm, leading to an absolute increase in C-statistic of up to 0.03 (C-statistic, 0.83 for random forest, 0.82 for AdaBoost, and 0.80 for XGBoost, vs. 0.74 for LR) (PRF < 0.001, PAdaBoost < 0.001, PXGBoost < 0.05).
Conclusion
The study indicates that incorporating new biomarkers into advanced ML models may significantly improve long-term mortality prediction in patients with cardiovascular diseases. This advancement may enable better treatment prioritization for high-risk individuals.