Development and Validation of an Explainable Machine Learning Model for Predicting Myocardial Injury After Noncardiac Surgery in Two Centers in China: Retrospective Study.

IF 5 Q1 GERIATRICS & GERONTOLOGY JMIR Aging Pub Date : 2024-07-26 DOI:10.2196/54872
Chang Liu, Kai Zhang, Xiaodong Yang, Bingbing Meng, Jingsheng Lou, Yanhong Liu, Jiangbei Cao, Kexuan Liu, Weidong Mi, Hao Li
{"title":"Development and Validation of an Explainable Machine Learning Model for Predicting Myocardial Injury After Noncardiac Surgery in Two Centers in China: Retrospective Study.","authors":"Chang Liu, Kai Zhang, Xiaodong Yang, Bingbing Meng, Jingsheng Lou, Yanhong Liu, Jiangbei Cao, Kexuan Liu, Weidong Mi, Hao Li","doi":"10.2196/54872","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Myocardial injury after noncardiac surgery (MINS) is an easily overlooked complication but closely related to postoperative cardiovascular adverse outcomes; therefore, the early diagnosis and prediction are particularly important.</p><p><strong>Objective: </strong>We aimed to develop and validate an explainable machine learning (ML) model for predicting MINS among older patients undergoing noncardiac surgery.</p><p><strong>Methods: </strong>The retrospective cohort study included older patients who had noncardiac surgery from 1 northern center and 1 southern center in China. The data sets from center 1 were divided into a training set and an internal validation set. The data set from center 2 was used as an external validation set. Before modeling, the least absolute shrinkage and selection operator and recursive feature elimination methods were used to reduce dimensions of data and select key features from all variables. Prediction models were developed based on the extracted features using several ML algorithms, including category boosting, random forest, logistic regression, naïve Bayes, light gradient boosting machine, extreme gradient boosting, support vector machine, and decision tree. Prediction performance was assessed by the area under the receiver operating characteristic (AUROC) curve as the main evaluation metric to select the best algorithms. The model performance was verified by internal and external validation data sets with the best algorithm and compared to the Revised Cardiac Risk Index. The Shapley Additive Explanations (SHAP) method was applied to calculate values for each feature, representing the contribution to the predicted risk of complication, and generate personalized explanations.</p><p><strong>Results: </strong>A total of 19,463 eligible patients were included; among those, 12,464 patients in center 1 were included as the training set; 4754 patients in center 1 were included as the internal validation set; and 2245 in center 2 were included as the external validation set. The best-performing model for prediction was the CatBoost algorithm, achieving the highest AUROC of 0.805 (95% CI 0.778-0.831) in the training set, validating with an AUROC of 0.780 in the internal validation set and 0.70 in external validation set. Additionally, CatBoost demonstrated superior performance compared to the Revised Cardiac Risk Index (AUROC 0.636; P<.001). The SHAP values indicated the ranking of the level of importance of each variable, with preoperative serum creatinine concentration, red blood cell distribution width, and age accounting for the top three. The results from the SHAP method can predict events with positive values or nonevents with negative values, providing an explicit explanation of individualized risk predictions.</p><p><strong>Conclusions: </strong>The ML models can provide a personalized and fairly accurate risk prediction of MINS, and the explainable perspective can help identify potentially modifiable sources of risk at the patient level.</p>","PeriodicalId":36245,"journal":{"name":"JMIR Aging","volume":"7 ","pages":"e54872"},"PeriodicalIF":5.0000,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11294761/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR Aging","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2196/54872","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GERIATRICS & GERONTOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Myocardial injury after noncardiac surgery (MINS) is an easily overlooked complication but closely related to postoperative cardiovascular adverse outcomes; therefore, the early diagnosis and prediction are particularly important.

Objective: We aimed to develop and validate an explainable machine learning (ML) model for predicting MINS among older patients undergoing noncardiac surgery.

Methods: The retrospective cohort study included older patients who had noncardiac surgery from 1 northern center and 1 southern center in China. The data sets from center 1 were divided into a training set and an internal validation set. The data set from center 2 was used as an external validation set. Before modeling, the least absolute shrinkage and selection operator and recursive feature elimination methods were used to reduce dimensions of data and select key features from all variables. Prediction models were developed based on the extracted features using several ML algorithms, including category boosting, random forest, logistic regression, naïve Bayes, light gradient boosting machine, extreme gradient boosting, support vector machine, and decision tree. Prediction performance was assessed by the area under the receiver operating characteristic (AUROC) curve as the main evaluation metric to select the best algorithms. The model performance was verified by internal and external validation data sets with the best algorithm and compared to the Revised Cardiac Risk Index. The Shapley Additive Explanations (SHAP) method was applied to calculate values for each feature, representing the contribution to the predicted risk of complication, and generate personalized explanations.

Results: A total of 19,463 eligible patients were included; among those, 12,464 patients in center 1 were included as the training set; 4754 patients in center 1 were included as the internal validation set; and 2245 in center 2 were included as the external validation set. The best-performing model for prediction was the CatBoost algorithm, achieving the highest AUROC of 0.805 (95% CI 0.778-0.831) in the training set, validating with an AUROC of 0.780 in the internal validation set and 0.70 in external validation set. Additionally, CatBoost demonstrated superior performance compared to the Revised Cardiac Risk Index (AUROC 0.636; P<.001). The SHAP values indicated the ranking of the level of importance of each variable, with preoperative serum creatinine concentration, red blood cell distribution width, and age accounting for the top three. The results from the SHAP method can predict events with positive values or nonevents with negative values, providing an explicit explanation of individualized risk predictions.

Conclusions: The ML models can provide a personalized and fairly accurate risk prediction of MINS, and the explainable perspective can help identify potentially modifiable sources of risk at the patient level.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
在中国两个中心开发和验证用于预测非心脏手术后心肌损伤的可解释机器学习模型:回顾性研究
背景:非心脏手术后心肌损伤(MINS)是一种容易被忽视的并发症,但与术后心血管不良结局密切相关;因此,早期诊断和预测尤为重要:我们旨在开发并验证一种可解释的机器学习(ML)模型,用于预测接受非心脏手术的老年患者的 MINS:这项回顾性队列研究纳入了来自中国北方和南方各一个中心的非心脏手术老年患者。第一中心的数据集分为训练集和内部验证集。第二中心的数据集作为外部验证集。建模前,使用最小绝对收缩和选择算子以及递归特征消除方法来减少数据维数,并从所有变量中选择关键特征。根据提取的特征,使用多种 ML 算法建立预测模型,包括类别提升、随机森林、逻辑回归、奈夫贝叶斯、轻梯度提升机、极梯度提升、支持向量机和决策树。预测性能以接收者操作特征曲线下面积(AUROC)作为主要评估指标,以选出最佳算法。使用最佳算法的内部和外部验证数据集验证了模型的性能,并与修订版心脏风险指数进行了比较。应用沙普利加法解释(SHAP)方法计算每个特征的值,代表对预测并发症风险的贡献,并生成个性化解释:共纳入 19463 名符合条件的患者;其中,中心 1 的 12464 名患者被纳入训练集;中心 1 的 4754 名患者被纳入内部验证集;中心 2 的 2245 名患者被纳入外部验证集。预测效果最好的模型是 CatBoost 算法,其训练集的 AUROC 为 0.805(95% CI 0.778-0.831),内部验证集的 AUROC 为 0.780,外部验证集的 AUROC 为 0.70。此外,CatBoost 与修订版心脏风险指数(AUROC 0.636;PConclusions:ML模型可以提供个性化的、相当准确的MINS风险预测,可解释的视角有助于在患者层面识别潜在的可改变的风险来源。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
JMIR Aging
JMIR Aging Social Sciences-Health (social science)
CiteScore
6.50
自引率
4.10%
发文量
71
审稿时长
12 weeks
期刊最新文献
Subjective Cognitive Concerns are Associated with Worse Performance on Mobile-App Based Cognitive Assessment: An Observational Study in Cognitively Normal Older Adults. Examining Whether Patient Portal and Video Visit Use Differs by Race and Ethnicity Among Older Adults in a US Integrated Health Care Delivery System: Cross-Sectional Electronic Health Record and Survey-Based Study. Exploring the Landscape of Standards and Guidelines in AgeTech Design and Development: Scoping Review and Thematic Analysis. Decoding the Influence of eHealth on Autonomy, Competence, and Relatedness in Older Adults: Qualitative Analysis of Self-Determination Through the Motivational Technology Model. Interrelationships Among Individual Factors, Family Factors, and Quality of Life in Older Chinese Adults: Cross-Sectional Study Using Structural Equation Modeling.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1