Ruimin Tan, Chen Ge, Jingmei Wang, Zinan Yang, He Guo, Yating Yan, Quansheng Du
{"title":"Interpretable machine learning model for early morbidity risk prediction in patients with sepsis-induced coagulopathy: a multi-center study.","authors":"Ruimin Tan, Chen Ge, Jingmei Wang, Zinan Yang, He Guo, Yating Yan, Quansheng Du","doi":"10.3389/fimmu.2025.1552265","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Sepsis-induced coagulopathy (SIC) is a complex condition characterized by systemic inflammation and coagulopathy. This study aimed to develop and validate a machine learning (ML) model to predict SIC risk in patients with sepsis.</p><p><strong>Methods: </strong>Patients with sepsis admitted to the intensive care unit (ICU) between March 1, 2021, and March 1, 2024, at Hebei General Hospital and Handan Central Hospital (East District) were retrospectively included. Patients were categorized into SIC and non-SIC groups. Data were split into training (70%) and testing (30%) sets. Additionally, for temporal validation, patients with sepsis admitted between March 1, 2024, and October 31, 2024, at Hebei General Hospital were included. Feature selection was performed using least absolute shrinkage and selection operator (LASSO) regression and multivariate logistic regression. Nine ML algorithms were tested, and model performance was assessed using receiver operating characteristic curve (ROC) analysis, including area under the curve (AUC), calibration curves, and decision curve analysis (DCA). The SHaply Additive Explanations (SHAP) algorithm was used to interpret the best-performing model and visualize key predictors.</p><p><strong>Results: </strong>Among 847 patients with sepsis, 480 (56.7%) developed SIC. The random forest (RF) model with eight variables performed best, achieving AUCs of 0.782 [95% confidence interval (CI): 0.745, 0.818] in the training set, 0.750 (95% CI: 0.690, 0.809) in the testing set, and 0.784 (95% CI: 0.711, 0.857) in the validation set. Key predictors included activated partial thromboplastin time, lactate, oxygenation index, and total protein.</p><p><strong>Conclusions: </strong>This ML model reliably predicts SIC risk. SHAP enhances interpretability, supporting early, individualized interventions to improve outcomes in patients with sepsis.</p>","PeriodicalId":12622,"journal":{"name":"Frontiers in Immunology","volume":"16 ","pages":"1552265"},"PeriodicalIF":5.9000,"publicationDate":"2025-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11911172/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Immunology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3389/fimmu.2025.1552265","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"IMMUNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Sepsis-induced coagulopathy (SIC) is a complex condition characterized by systemic inflammation and coagulopathy. This study aimed to develop and validate a machine learning (ML) model to predict SIC risk in patients with sepsis.
Methods: Patients with sepsis admitted to the intensive care unit (ICU) between March 1, 2021, and March 1, 2024, at Hebei General Hospital and Handan Central Hospital (East District) were retrospectively included. Patients were categorized into SIC and non-SIC groups. Data were split into training (70%) and testing (30%) sets. Additionally, for temporal validation, patients with sepsis admitted between March 1, 2024, and October 31, 2024, at Hebei General Hospital were included. Feature selection was performed using least absolute shrinkage and selection operator (LASSO) regression and multivariate logistic regression. Nine ML algorithms were tested, and model performance was assessed using receiver operating characteristic curve (ROC) analysis, including area under the curve (AUC), calibration curves, and decision curve analysis (DCA). The SHaply Additive Explanations (SHAP) algorithm was used to interpret the best-performing model and visualize key predictors.
Results: Among 847 patients with sepsis, 480 (56.7%) developed SIC. The random forest (RF) model with eight variables performed best, achieving AUCs of 0.782 [95% confidence interval (CI): 0.745, 0.818] in the training set, 0.750 (95% CI: 0.690, 0.809) in the testing set, and 0.784 (95% CI: 0.711, 0.857) in the validation set. Key predictors included activated partial thromboplastin time, lactate, oxygenation index, and total protein.
Conclusions: This ML model reliably predicts SIC risk. SHAP enhances interpretability, supporting early, individualized interventions to improve outcomes in patients with sepsis.
期刊介绍:
Frontiers in Immunology is a leading journal in its field, publishing rigorously peer-reviewed research across basic, translational and clinical immunology. This multidisciplinary open-access journal is at the forefront of disseminating and communicating scientific knowledge and impactful discoveries to researchers, academics, clinicians and the public worldwide.
Frontiers in Immunology is the official Journal of the International Union of Immunological Societies (IUIS). Encompassing the entire field of Immunology, this journal welcomes papers that investigate basic mechanisms of immune system development and function, with a particular emphasis given to the description of the clinical and immunological phenotype of human immune disorders, and on the definition of their molecular basis.