Early prediction of sepsis-induced respiratory tract infection using a biomarker-based machine-learning algorithm.

IF 1.4 4区医学 Q4 MEDICINE, RESEARCH & EXPERIMENTAL Scandinavian Journal of Clinical & Laboratory Investigation Pub Date : 2024-05-01 Epub Date: 2024-04-29 DOI:10.1080/00365513.2024.2346914

Mingkuan Su, Haiying Wu, Hongbin Chen, Jianfeng Guo, Zongyun Chen, Jie Qiu, Jiancheng Huang

{"title":"Early prediction of sepsis-induced respiratory tract infection using a biomarker-based machine-learning algorithm.","authors":"Mingkuan Su, Haiying Wu, Hongbin Chen, Jianfeng Guo, Zongyun Chen, Jie Qiu, Jiancheng Huang","doi":"10.1080/00365513.2024.2346914","DOIUrl":null,"url":null,"abstract":"<p><p>Early and differential diagnosis of sepsis is essential to avoid unnecessary antibiotic use and further reduce patient morbidity and mortality. Here, we aimed to identify predictors of sepsis and advance a machine-learning strategy to predict sepsis-induced respiratory tract infection (RTI). Patients with sepsis and RTI were selected via retrospective analysis, and essential population characteristics and laboratory parameters were recorded. To improve the performance of the primary model and avoid over-fitting, a recursive feature elimination with cross-validation (RFECV) strategy was used to screen the optimal subset of biomarkers and construct nine machine-learning models based on this subset; the average accuracy, precision, recall, and F1-score were used for evaluation of the models. We identified 430 patients with sepsis and 686 patients with RTI. A total of 39 features were collected, with 23 features identified for initial model construction. Using the RFECV algorithm, we found that the XGBoost classifier, which only needed to include seven biomarkers, demonstrated the best performance among all prediction models, with an average accuracy of 89.24 ± 2.28, while the Ridge classifier, which included 11 biomarkers, had an average accuracy of only 83.87 ± 4.69. The remaining models had prediction accuracies greater than 88%. We developed nine models for predicting sepsis using a strategy that combined RFECV with machine learning. Among these models, the XGBoost classifier, which included seven biomarkers, showed the best performance and highest accuracy for predicting sepsis and may be a promising tool for the timely identification of sepsis.</p>","PeriodicalId":21474,"journal":{"name":"Scandinavian Journal of Clinical & Laboratory Investigation","volume":" ","pages":"202-210"},"PeriodicalIF":1.4000,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scandinavian Journal of Clinical & Laboratory Investigation","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/00365513.2024.2346914","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/4/29 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}

引用次数: 0

Abstract

Early and differential diagnosis of sepsis is essential to avoid unnecessary antibiotic use and further reduce patient morbidity and mortality. Here, we aimed to identify predictors of sepsis and advance a machine-learning strategy to predict sepsis-induced respiratory tract infection (RTI). Patients with sepsis and RTI were selected via retrospective analysis, and essential population characteristics and laboratory parameters were recorded. To improve the performance of the primary model and avoid over-fitting, a recursive feature elimination with cross-validation (RFECV) strategy was used to screen the optimal subset of biomarkers and construct nine machine-learning models based on this subset; the average accuracy, precision, recall, and F1-score were used for evaluation of the models. We identified 430 patients with sepsis and 686 patients with RTI. A total of 39 features were collected, with 23 features identified for initial model construction. Using the RFECV algorithm, we found that the XGBoost classifier, which only needed to include seven biomarkers, demonstrated the best performance among all prediction models, with an average accuracy of 89.24 ± 2.28, while the Ridge classifier, which included 11 biomarkers, had an average accuracy of only 83.87 ± 4.69. The remaining models had prediction accuracies greater than 88%. We developed nine models for predicting sepsis using a strategy that combined RFECV with machine learning. Among these models, the XGBoost classifier, which included seven biomarkers, showed the best performance and highest accuracy for predicting sepsis and may be a promising tool for the timely identification of sepsis.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用基于生物标志物的机器学习算法早期预测败血症诱发的呼吸道感染。

脓毒症的早期鉴别诊断对于避免不必要的抗生素使用和进一步降低患者发病率和死亡率至关重要。在此，我们旨在确定败血症的预测因素，并推进一种机器学习策略，以预测败血症诱发的呼吸道感染（RTI）。我们通过回顾性分析筛选出脓毒症和 RTI 患者，并记录了基本人群特征和实验室参数。为了提高主要模型的性能并避免过度拟合，我们采用了递归特征消除与交叉验证（RFECV）策略来筛选最佳生物标志物子集，并基于该子集构建了九个机器学习模型；模型的评估采用了平均准确度、精确度、召回率和 F1 分数。我们确定了 430 名败血症患者和 686 名 RTI 患者。共收集了 39 个特征，其中 23 个特征被确定用于构建初始模型。使用 RFECV 算法，我们发现在所有预测模型中，只需包含 7 个生物标记物的 XGBoost 分类器表现最佳，平均准确率为 89.24 ± 2.28，而包含 11 个生物标记物的 Ridge 分类器的平均准确率仅为 83.87 ± 4.69。其余模型的预测准确率均超过 88%。我们采用 RFECV 与机器学习相结合的策略开发了九种预测败血症的模型。在这些模型中，包含 7 个生物标记物的 XGBoost 分类器在预测败血症方面表现最佳，准确率最高，可能是及时识别败血症的一种有前途的工具。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Scandinavian Journal of Clinical & Laboratory Investigation 医学-医学：研究与实验

CiteScore

3.50

自引率

4.80%

发文量

审稿时长

4-8 weeks

期刊介绍： The Scandinavian Journal of Clinical and Laboratory Investigation is an international scientific journal covering clinically oriented biochemical and physiological research. Since the launch of the journal in 1949, it has been a forum for international laboratory medicine, closely related to, and edited by, The Scandinavian Society for Clinical Chemistry. The journal contains peer-reviewed articles, editorials, invited reviews, and short technical notes, as well as several supplements each year. Supplements consist of monographs, and symposium and congress reports covering subjects within clinical chemistry and clinical physiology.