Predicting major adverse cardiac events in diabetes and chronic kidney disease: a machine learning study from the Silesia Diabetes-Heart Project.

IF 10.6 1区 医学 Q1 CARDIAC & CARDIOVASCULAR SYSTEMS Cardiovascular Diabetology Pub Date : 2025-02-15 DOI:10.1186/s12933-025-02615-w
Hanna Kwiendacz, Bi Huang, Yang Chen, Oliwia Janota, Krzysztof Irlik, Yang Liu, Marta Mantovani, Yalin Zheng, Mirela Hendel, Julia Piaśnik, Wiktoria Wójcik, Uazman Alam, Janusz Gumprecht, Gregory Y H Lip, Katarzyna Nabrdalik
{"title":"Predicting major adverse cardiac events in diabetes and chronic kidney disease: a machine learning study from the Silesia Diabetes-Heart Project.","authors":"Hanna Kwiendacz, Bi Huang, Yang Chen, Oliwia Janota, Krzysztof Irlik, Yang Liu, Marta Mantovani, Yalin Zheng, Mirela Hendel, Julia Piaśnik, Wiktoria Wójcik, Uazman Alam, Janusz Gumprecht, Gregory Y H Lip, Katarzyna Nabrdalik","doi":"10.1186/s12933-025-02615-w","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>People living with diabetes mellitus (DM) and chronic kidney disease (CKD) are at significantly high risk of cardiovascular events (CVEs), however the predictive performance of traditional risk prediction methods are limited.</p><p><strong>Methods: </strong>We utilised machine learning (ML) model to predict CVEs in persons with DM and CKD from the Silesia Diabetes-Heart Project, a routine standard of care dataset. CVEs were defined as composite of nonfatal myocardial infarction, new onset heart failure, nonfatal stroke, incident atrial fibrillation, undergoing percutaneous coronary intervention or coronary artery bypass grafting, hospitalisation or death due to cardiovascular disease. Five ML models (Logistic regression [LR], Random forest [RF], Support vector classification [SVC], Light gradient boosting machine [LGBM], and eXtreme gradient boosting machine [XGBM]) were constructed. The predictive performance of the five ML models was compared and the model interpretability were evaluated by Shapley Additive exPlanations (SHAP).</p><p><strong>Results: </strong>A total of 1,116 people with DM and CKD out of 3,056 with DM were included (median age 67 [IQR 57-76] years; 57% men). The incidence of CVEs was 14.1% (157/1,116) during a median of 3.1 years follow-up period. Ten important features were identified through univariate Logistic regression, Boruta, and Least Absolute Shrinkage and Selection Operator [LASSO] regression. Among the five ML models based on these features, LGBM had the highest area under curve [AUC] (AUC = 0.740, 95% Confidence Interval [CI] 0.738-0.743), followed by LR (AUC = 0.621, 95% CI 0.618-0.623), RF (AUC = 0.707, 95% CI 0.704-0.709), SVC (AUC = 0.707, 95% CI 0.704-0.710), and XGBM (AUC = 0.710, 95% CI 0.707-0.713). Meanwhile, LGBM had relatively higher Recall (0.739), F1-score (0.820), and G-mean (0.826). The SHAP plot of LGBM revealed that estimated glomerular filtration rate (eGFR), age, and triglyceride glucose index were the three most important features for predicting CVEs.</p><p><strong>Conclusion: </strong>Ten features-based ML models, especially the LGBM model, had acceptable performance in predicting CVEs in persons with DM and CKD. A decrease in eGFR, aging, and elevated inflammatory markers significantly enhanced the predictive capability of the model. Future external validation of our model is required prior to implementation in a clinical environment.</p>","PeriodicalId":9374,"journal":{"name":"Cardiovascular Diabetology","volume":"24 1","pages":"76"},"PeriodicalIF":10.6000,"publicationDate":"2025-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11829423/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cardiovascular Diabetology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12933-025-02615-w","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Background: People living with diabetes mellitus (DM) and chronic kidney disease (CKD) are at significantly high risk of cardiovascular events (CVEs), however the predictive performance of traditional risk prediction methods are limited.

Methods: We utilised machine learning (ML) model to predict CVEs in persons with DM and CKD from the Silesia Diabetes-Heart Project, a routine standard of care dataset. CVEs were defined as composite of nonfatal myocardial infarction, new onset heart failure, nonfatal stroke, incident atrial fibrillation, undergoing percutaneous coronary intervention or coronary artery bypass grafting, hospitalisation or death due to cardiovascular disease. Five ML models (Logistic regression [LR], Random forest [RF], Support vector classification [SVC], Light gradient boosting machine [LGBM], and eXtreme gradient boosting machine [XGBM]) were constructed. The predictive performance of the five ML models was compared and the model interpretability were evaluated by Shapley Additive exPlanations (SHAP).

Results: A total of 1,116 people with DM and CKD out of 3,056 with DM were included (median age 67 [IQR 57-76] years; 57% men). The incidence of CVEs was 14.1% (157/1,116) during a median of 3.1 years follow-up period. Ten important features were identified through univariate Logistic regression, Boruta, and Least Absolute Shrinkage and Selection Operator [LASSO] regression. Among the five ML models based on these features, LGBM had the highest area under curve [AUC] (AUC = 0.740, 95% Confidence Interval [CI] 0.738-0.743), followed by LR (AUC = 0.621, 95% CI 0.618-0.623), RF (AUC = 0.707, 95% CI 0.704-0.709), SVC (AUC = 0.707, 95% CI 0.704-0.710), and XGBM (AUC = 0.710, 95% CI 0.707-0.713). Meanwhile, LGBM had relatively higher Recall (0.739), F1-score (0.820), and G-mean (0.826). The SHAP plot of LGBM revealed that estimated glomerular filtration rate (eGFR), age, and triglyceride glucose index were the three most important features for predicting CVEs.

Conclusion: Ten features-based ML models, especially the LGBM model, had acceptable performance in predicting CVEs in persons with DM and CKD. A decrease in eGFR, aging, and elevated inflammatory markers significantly enhanced the predictive capability of the model. Future external validation of our model is required prior to implementation in a clinical environment.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
预测糖尿病和慢性肾脏疾病的主要不良心脏事件:来自西里西亚糖尿病心脏项目的机器学习研究。
背景:糖尿病(DM)和慢性肾脏疾病(CKD)患者是心血管事件(cve)的高危人群,但传统的风险预测方法预测效果有限。方法:我们利用机器学习(ML)模型预测来自Silesia糖尿病-心脏项目(常规标准护理数据集)的DM和CKD患者的cve。cve被定义为非致死性心肌梗死、新发心力衰竭、非致死性中风、房颤、经皮冠状动脉介入治疗或冠状动脉旁路移植术、住院或因心血管疾病死亡的组合。构建了逻辑回归(Logistic regression, LR)、随机森林(Random forest, RF)、支持向量分类(Support vector classification, SVC)、轻梯度增强机(Light gradient boosting machine, LGBM)和极限梯度增强机(eXtreme gradient boosting machine, XGBM) 5种机器学习模型。比较5种ML模型的预测性能,并采用Shapley加性解释(SHAP)评价模型的可解释性。结果:在3056名糖尿病患者中,共有1116名糖尿病合并CKD患者被纳入研究(中位年龄67岁;57%的男性)。在中位3.1年的随访期间,cve的发生率为14.1%(157/1,116)。通过单变量Logistic回归、Boruta和最小绝对收缩和选择算子[LASSO]回归确定了10个重要特征。在基于这些特征的5个ML模型中,LGBM的曲线下面积(AUC)最高(AUC = 0.740, 95%可信区间[CI] 0.738 ~ 0.743),其次是LR (AUC = 0.621, 95% CI 0.618 ~ 0.623)、RF (AUC = 0.707, 95% CI 0.704 ~ 0.709)、SVC (AUC = 0.707, 95% CI 0.704 ~ 0.710)和XGBM (AUC = 0.710, 95% CI 0.707 ~ 0.713)。同时,LGBM具有较高的召回率(0.739)、f1评分(0.820)和g均值(0.826)。LGBM的SHAP图显示,估计的肾小球滤过率(eGFR)、年龄和甘油三酯葡萄糖指数是预测cve的三个最重要的特征。结论:10种基于特征的ML模型,尤其是LGBM模型,在预测DM和CKD患者cve方面具有良好的性能。eGFR的降低、衰老和炎症标志物的升高显著增强了模型的预测能力。在临床环境中实施之前,需要对我们的模型进行未来的外部验证。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Cardiovascular Diabetology
Cardiovascular Diabetology 医学-内分泌学与代谢
CiteScore
12.30
自引率
15.10%
发文量
240
审稿时长
1 months
期刊介绍: Cardiovascular Diabetology is a journal that welcomes manuscripts exploring various aspects of the relationship between diabetes, cardiovascular health, and the metabolic syndrome. We invite submissions related to clinical studies, genetic investigations, experimental research, pharmacological studies, epidemiological analyses, and molecular biology research in this field.
期刊最新文献
Drug targets for lipid modification and risk of type 2 diabetes: a cis-Mendelian randomization study. Correction: The mineralocorticoid receptor antagonist finerenone enhances cardiovascular recovery upon food intake normalization in obese mice. Beyond glycemic control: rethinking the diabetic heart : Comment on: "Dapagliflozin-induced integrated improvements in left ventricular diastole, endothelial function, and arterial load: a randomized clinical trial". Evaluation of the lipid accumulation product for MASLD risk stratification in type 2 diabetes: establishing sex-specific thresholds and clinical utility. Association of cumulative exposure and dynamic trajectories of the C-reactive protein-triglyceride-glucose and its modified indices with cardiovascular disease in individuals with cardiovascular-kidney-metabolic syndrome stages 0-3: a longitudinal analysis based on CHARLS.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1