Xiao Luo , Binghan Li , Ronghui Zhu , Yaoyong Tai , Zongyu Wang , Qian He , Yanfang Zhao , Xiaoying Bi , Cheng Wu
{"title":"Development and validation of an interpretable machine learning model for predicting in-hospital mortality for ischemic stroke patients in ICU","authors":"Xiao Luo , Binghan Li , Ronghui Zhu , Yaoyong Tai , Zongyu Wang , Qian He , Yanfang Zhao , Xiaoying Bi , Cheng Wu","doi":"10.1016/j.ijmedinf.2025.105874","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Timely and accurate outcome prediction is essential for clinical decision-making for ischemic stroke patients in the intensive care unit (ICU). However, the interpretation and translation of predictive models into clinical applications are equally crucial. This study aims to develop an interpretable machine learning (IML) model that effectively predicts in-hospital mortality for ischemic stroke patients.</div></div><div><h3>Methods</h3><div>In this study, an IML model was developed and validated using multicenter cohorts of 3225 ischemic stroke patients admitted to the ICU. Nine machine learning (ML) models, including logistic regression (LR), K-nearest neighbors (KNN), naive Bayes (NB), decision tree (DT), support vector machine (SVM), random forest (RF), XGBoost, LightGBM, and artificial neural network (ANN), were developed to predict in-hospital mortality using data from the MIMIC-IV and externally validated in Shanghai Changhai Hospital. Feature selection was conducted using three algorithms. Model’s performance was assessed using area under the receiver operating characteristic (AUROC), accuracy, sensitivity, specificity and F1 score. Calibration curve and Brier score were used to evaluate the degree of calibration of the model, and decision curve analysis were generated to assess the net clinical benefit. Additionally, the SHapley Additive exPlanations (SHAP) method was employed to evaluate the risk of in-hospital mortality among ischemic stroke patients admitted to the ICU.</div></div><div><h3>Results</h3><div>Mechanical ventilation, age, statins, white blood cell, blood urea nitrogen, hematocrit, warfarin, bicarbonate and systolic blood pressure were selected as the nine most influential variables. The RF model demonstrated the most robust predictive performance, achieving AUROC values of 0.908 and 0.858 in the testing set and external validation set, respectively. Calibration curves also revealed a high consistency between observations and predictions. Decision curve analysis showed that the model had the greatest net benefit rate when the prediction probability threshold is 0.10 ∼ 0.80. SHAP was employed to interpret the RF model. In addition, we have developed an online prediction calculator for ischemic stroke patients.</div></div><div><h3>Conclusion</h3><div>This study develops a machine learning–based calculator to predict the probability of in-hospital mortality among patients with ischemic stroke in ICU. The calculator has the potential to guide clinical decision-making and improve the care of patients with ischemic stroke by identifying patients at a higher risk of in-hospital mortality.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"198 ","pages":"Article 105874"},"PeriodicalIF":3.7000,"publicationDate":"2025-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Medical Informatics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1386505625000917","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Timely and accurate outcome prediction is essential for clinical decision-making for ischemic stroke patients in the intensive care unit (ICU). However, the interpretation and translation of predictive models into clinical applications are equally crucial. This study aims to develop an interpretable machine learning (IML) model that effectively predicts in-hospital mortality for ischemic stroke patients.
Methods
In this study, an IML model was developed and validated using multicenter cohorts of 3225 ischemic stroke patients admitted to the ICU. Nine machine learning (ML) models, including logistic regression (LR), K-nearest neighbors (KNN), naive Bayes (NB), decision tree (DT), support vector machine (SVM), random forest (RF), XGBoost, LightGBM, and artificial neural network (ANN), were developed to predict in-hospital mortality using data from the MIMIC-IV and externally validated in Shanghai Changhai Hospital. Feature selection was conducted using three algorithms. Model’s performance was assessed using area under the receiver operating characteristic (AUROC), accuracy, sensitivity, specificity and F1 score. Calibration curve and Brier score were used to evaluate the degree of calibration of the model, and decision curve analysis were generated to assess the net clinical benefit. Additionally, the SHapley Additive exPlanations (SHAP) method was employed to evaluate the risk of in-hospital mortality among ischemic stroke patients admitted to the ICU.
Results
Mechanical ventilation, age, statins, white blood cell, blood urea nitrogen, hematocrit, warfarin, bicarbonate and systolic blood pressure were selected as the nine most influential variables. The RF model demonstrated the most robust predictive performance, achieving AUROC values of 0.908 and 0.858 in the testing set and external validation set, respectively. Calibration curves also revealed a high consistency between observations and predictions. Decision curve analysis showed that the model had the greatest net benefit rate when the prediction probability threshold is 0.10 ∼ 0.80. SHAP was employed to interpret the RF model. In addition, we have developed an online prediction calculator for ischemic stroke patients.
Conclusion
This study develops a machine learning–based calculator to predict the probability of in-hospital mortality among patients with ischemic stroke in ICU. The calculator has the potential to guide clinical decision-making and improve the care of patients with ischemic stroke by identifying patients at a higher risk of in-hospital mortality.
期刊介绍:
International Journal of Medical Informatics provides an international medium for dissemination of original results and interpretative reviews concerning the field of medical informatics. The Journal emphasizes the evaluation of systems in healthcare settings.
The scope of journal covers:
Information systems, including national or international registration systems, hospital information systems, departmental and/or physician''s office systems, document handling systems, electronic medical record systems, standardization, systems integration etc.;
Computer-aided medical decision support systems using heuristic, algorithmic and/or statistical methods as exemplified in decision theory, protocol development, artificial intelligence, etc.
Educational computer based programs pertaining to medical informatics or medicine in general;
Organizational, economic, social, clinical impact, ethical and cost-benefit aspects of IT applications in health care.