{"title":"利用机器学习模型预测艰难梭菌感染的心衰患者 28 天内的全因死亡率:来自 MIMIC-IV 数据库的证据。","authors":"Caiping Shi, Qiong Jie, Hongsong Zhang, Xinying Zhang, Weijuan Chu, Chen Chen, Qian Zhang, Zhen Hu","doi":"10.1159/000540994","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Heart failure (HF) may induce bowel hypoperfusion, leading to hypoxia of the villa of the bowel wall and the occurrence of Clostridioides difficile infection (CDI). However, the risk factors for the development of CDI in HF patients have yet to be fully illustrated, especially because of a lack of evidence from real-world data.</p><p><strong>Methods: </strong>Clinical data and survival situations of HF patients with CDI admitted to ICU were extracted from the Medical Information Mart for Intensive Care (MIMIC)-IV database. For developing a model that can predict 28-day all-cause mortality in HF patients with CDI, the Recursive Feature Elimination with Cross-Validation (RFE-CV) method was used for feature selection. And nine machine learning (ML) algorithms, including logistic regression (LR), decision tree, Bayesian, adaptive boosting, random forest (RF), gradient boosting decision tree, XGBoost, light gradient boosting machine, and categorical boosting, were applied for model construction. After training and hyperparameter optimization of the models through grid search 5-fold cross-validation, the performance of models was evaluated by the area under curve (AUC), accuracy, sensitivity, specificity, precision, negative predictive value, and F1 score. Furthermore, the SHapley Additive exPlanations (SHAP) method was used to interpret the optimal model.</p><p><strong>Results: </strong>A total of 526 HF patients with CDI were included in the study, of whom 99 cases (18.8%) experienced death within 28 days. Eighteen of the 57 variables were selected for the model construction algorithm for model construction. Among the ML models considered, the RF model emerged as the optimal model achieving the accuracy, F1-score, and AUC values of 0.821, 0.596, and 0.864, respectively. The net benefit of the model surpassed other models at 16%-22% threshold probabilities based on decision curve analysis. According to the importance of features in the RF model, red blood cell distribution width, blood urea nitrogen, Simplified Acute Physiology Score II, Sequential Organ Failure Assessment, and white blood cell count were highlighted as the five most influential variables.</p><p><strong>Conclusions: </strong>We developed ML models to predict 28-day all-cause mortality in HF patients associated with CDI in the ICU, which are more effective than the conventional LR model. The RF model has the best performance among all the ML models employed. It may be useful to help clinicians identify high-risk HF patients with CDI.</p>","PeriodicalId":9391,"journal":{"name":"Cardiology","volume":" ","pages":"1"},"PeriodicalIF":1.9000,"publicationDate":"2024-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prediction of 28-Day All-Cause Mortality in Heart Failure Patients with Clostridioides difficile Infection Using Machine Learning Models: Evidence from the MIMIC-IV Database.\",\"authors\":\"Caiping Shi, Qiong Jie, Hongsong Zhang, Xinying Zhang, Weijuan Chu, Chen Chen, Qian Zhang, Zhen Hu\",\"doi\":\"10.1159/000540994\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Introduction: </strong>Heart failure (HF) may induce bowel hypoperfusion, leading to hypoxia of the villa of the bowel wall and the occurrence of Clostridioides difficile infection (CDI). However, the risk factors for the development of CDI in HF patients have yet to be fully illustrated, especially because of a lack of evidence from real-world data.</p><p><strong>Methods: </strong>Clinical data and survival situations of HF patients with CDI admitted to ICU were extracted from the Medical Information Mart for Intensive Care (MIMIC)-IV database. For developing a model that can predict 28-day all-cause mortality in HF patients with CDI, the Recursive Feature Elimination with Cross-Validation (RFE-CV) method was used for feature selection. And nine machine learning (ML) algorithms, including logistic regression (LR), decision tree, Bayesian, adaptive boosting, random forest (RF), gradient boosting decision tree, XGBoost, light gradient boosting machine, and categorical boosting, were applied for model construction. After training and hyperparameter optimization of the models through grid search 5-fold cross-validation, the performance of models was evaluated by the area under curve (AUC), accuracy, sensitivity, specificity, precision, negative predictive value, and F1 score. Furthermore, the SHapley Additive exPlanations (SHAP) method was used to interpret the optimal model.</p><p><strong>Results: </strong>A total of 526 HF patients with CDI were included in the study, of whom 99 cases (18.8%) experienced death within 28 days. Eighteen of the 57 variables were selected for the model construction algorithm for model construction. Among the ML models considered, the RF model emerged as the optimal model achieving the accuracy, F1-score, and AUC values of 0.821, 0.596, and 0.864, respectively. The net benefit of the model surpassed other models at 16%-22% threshold probabilities based on decision curve analysis. According to the importance of features in the RF model, red blood cell distribution width, blood urea nitrogen, Simplified Acute Physiology Score II, Sequential Organ Failure Assessment, and white blood cell count were highlighted as the five most influential variables.</p><p><strong>Conclusions: </strong>We developed ML models to predict 28-day all-cause mortality in HF patients associated with CDI in the ICU, which are more effective than the conventional LR model. The RF model has the best performance among all the ML models employed. It may be useful to help clinicians identify high-risk HF patients with CDI.</p>\",\"PeriodicalId\":9391,\"journal\":{\"name\":\"Cardiology\",\"volume\":\" \",\"pages\":\"1\"},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2024-08-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cardiology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1159/000540994\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"CARDIAC & CARDIOVASCULAR SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cardiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1159/000540994","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
Prediction of 28-Day All-Cause Mortality in Heart Failure Patients with Clostridioides difficile Infection Using Machine Learning Models: Evidence from the MIMIC-IV Database.
Introduction: Heart failure (HF) may induce bowel hypoperfusion, leading to hypoxia of the villa of the bowel wall and the occurrence of Clostridioides difficile infection (CDI). However, the risk factors for the development of CDI in HF patients have yet to be fully illustrated, especially because of a lack of evidence from real-world data.
Methods: Clinical data and survival situations of HF patients with CDI admitted to ICU were extracted from the Medical Information Mart for Intensive Care (MIMIC)-IV database. For developing a model that can predict 28-day all-cause mortality in HF patients with CDI, the Recursive Feature Elimination with Cross-Validation (RFE-CV) method was used for feature selection. And nine machine learning (ML) algorithms, including logistic regression (LR), decision tree, Bayesian, adaptive boosting, random forest (RF), gradient boosting decision tree, XGBoost, light gradient boosting machine, and categorical boosting, were applied for model construction. After training and hyperparameter optimization of the models through grid search 5-fold cross-validation, the performance of models was evaluated by the area under curve (AUC), accuracy, sensitivity, specificity, precision, negative predictive value, and F1 score. Furthermore, the SHapley Additive exPlanations (SHAP) method was used to interpret the optimal model.
Results: A total of 526 HF patients with CDI were included in the study, of whom 99 cases (18.8%) experienced death within 28 days. Eighteen of the 57 variables were selected for the model construction algorithm for model construction. Among the ML models considered, the RF model emerged as the optimal model achieving the accuracy, F1-score, and AUC values of 0.821, 0.596, and 0.864, respectively. The net benefit of the model surpassed other models at 16%-22% threshold probabilities based on decision curve analysis. According to the importance of features in the RF model, red blood cell distribution width, blood urea nitrogen, Simplified Acute Physiology Score II, Sequential Organ Failure Assessment, and white blood cell count were highlighted as the five most influential variables.
Conclusions: We developed ML models to predict 28-day all-cause mortality in HF patients associated with CDI in the ICU, which are more effective than the conventional LR model. The RF model has the best performance among all the ML models employed. It may be useful to help clinicians identify high-risk HF patients with CDI.
期刊介绍:
''Cardiology'' features first reports on original clinical, preclinical and fundamental research as well as ''Novel Insights from Clinical Experience'' and topical comprehensive reviews in selected areas of cardiovascular disease. ''Editorial Comments'' provide a critical but positive evaluation of a recent article. Papers not only describe but offer critical appraisals of new developments in non-invasive and invasive diagnostic methods and in pharmacologic, nutritional and mechanical/surgical therapies. Readers are thus kept informed of current strategies in the prevention, recognition and treatment of heart disease. Special sections in a variety of subspecialty areas reinforce the journal''s value as a complete record of recent progress for all cardiologists, internists, cardiac surgeons, clinical physiologists, pharmacologists and professionals in other areas of medicine interested in current activity in cardiovascular diseases.