Prediction of 30-day mortality for ICU patients with Sepsis-3.

IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS BMC Medical Informatics and Decision Making Pub Date : 2024-08-08 DOI:10.1186/s12911-024-02629-6
Zhijiang Yu, Negin Ashrafi, Hexin Li, Kamiar Alaei, Maryam Pishgar
{"title":"Prediction of 30-day mortality for ICU patients with Sepsis-3.","authors":"Zhijiang Yu, Negin Ashrafi, Hexin Li, Kamiar Alaei, Maryam Pishgar","doi":"10.1186/s12911-024-02629-6","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>There is a growing demand for advanced methods to improve the understanding and prediction of illnesses. This study focuses on Sepsis, a critical response to infection, aiming to enhance early detection and mortality prediction for Sepsis-3 patients to improve hospital resource allocation.</p><p><strong>Methods: </strong>In this study, we developed a Machine Learning (ML) framework to predict the 30-day mortality rate of ICU patients with Sepsis-3 using the MIMIC-III database. Advanced big data extraction tools like Snowflake were used to identify eligible patients. Decision tree models and Entropy Analyses helped refine feature selection, resulting in 30 relevant features curated with clinical experts. We employed the Light Gradient Boosting Machine (LightGBM) model for its efficiency and predictive power.</p><p><strong>Results: </strong>The study comprised a cohort of 9118 Sepsis-3 patients. Our preprocessing techniques significantly improved both the AUC and accuracy metrics. The LightGBM model achieved an impressive AUC of 0.983 (95% CI: [0.980-0.990]), an accuracy of 0.966, and an F1-score of 0.910. Notably, LightGBM showed a substantial 6% improvement over our best baseline model and a 14% enhancement over the best existing literature. These advancements are attributed to (I) the inclusion of the novel and pivotal feature Hospital Length of Stay (HOSP_LOS), absent in previous studies, and (II) LightGBM's gradient boosting architecture, enabling robust predictions with high-dimensional data while maintaining computational efficiency, as demonstrated by its learning curve.</p><p><strong>Conclusions: </strong>Our preprocessing methodology reduced the number of relevant features and identified a crucial feature overlooked in previous studies. The proposed model demonstrated high predictive power and generalization capability, highlighting the potential of ML in ICU settings. This model can streamline ICU resource allocation and provide tailored interventions for Sepsis-3 patients.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":null,"pages":null},"PeriodicalIF":3.3000,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11308624/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-024-02629-6","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

Abstract

Background: There is a growing demand for advanced methods to improve the understanding and prediction of illnesses. This study focuses on Sepsis, a critical response to infection, aiming to enhance early detection and mortality prediction for Sepsis-3 patients to improve hospital resource allocation.

Methods: In this study, we developed a Machine Learning (ML) framework to predict the 30-day mortality rate of ICU patients with Sepsis-3 using the MIMIC-III database. Advanced big data extraction tools like Snowflake were used to identify eligible patients. Decision tree models and Entropy Analyses helped refine feature selection, resulting in 30 relevant features curated with clinical experts. We employed the Light Gradient Boosting Machine (LightGBM) model for its efficiency and predictive power.

Results: The study comprised a cohort of 9118 Sepsis-3 patients. Our preprocessing techniques significantly improved both the AUC and accuracy metrics. The LightGBM model achieved an impressive AUC of 0.983 (95% CI: [0.980-0.990]), an accuracy of 0.966, and an F1-score of 0.910. Notably, LightGBM showed a substantial 6% improvement over our best baseline model and a 14% enhancement over the best existing literature. These advancements are attributed to (I) the inclusion of the novel and pivotal feature Hospital Length of Stay (HOSP_LOS), absent in previous studies, and (II) LightGBM's gradient boosting architecture, enabling robust predictions with high-dimensional data while maintaining computational efficiency, as demonstrated by its learning curve.

Conclusions: Our preprocessing methodology reduced the number of relevant features and identified a crucial feature overlooked in previous studies. The proposed model demonstrated high predictive power and generalization capability, highlighting the potential of ML in ICU settings. This model can streamline ICU resource allocation and provide tailored interventions for Sepsis-3 patients.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
ICU 败血症患者 30 天死亡率预测-3。
背景:人们越来越需要先进的方法来提高对疾病的理解和预测。本研究的重点是败血症,这是一种对感染的关键反应,旨在加强对败血症-3 患者的早期检测和死亡率预测,以改善医院的资源分配:在这项研究中,我们开发了一个机器学习(ML)框架,利用 MIMIC-III 数据库预测 ICU 败血症-3 患者的 30 天死亡率。我们使用雪花(Snowflake)等先进的大数据提取工具来识别符合条件的患者。决策树模型和熵分析帮助完善了特征选择,最终得出了由临床专家策划的 30 个相关特征。我们采用了光梯度提升机(LightGBM)模型,以提高其效率和预测能力:研究包括 9118 例败血症-3 患者。我们的预处理技术大大提高了AUC和准确率指标。LightGBM 模型的 AUC 为 0.983(95% CI:[0.980-0.990]),准确率为 0.966,F1 分数为 0.910。值得注意的是,LightGBM 比我们的最佳基线模型提高了 6%,比现有的最佳文献提高了 14%。这些进步归功于:(I)加入了新颖且关键的特征--住院时间(HOSP_LOS),而这在之前的研究中是没有的;(II)LightGBM 的梯度提升架构,在保持计算效率的同时,还能对高维数据进行稳健预测,这一点从其学习曲线中就能看出:我们的预处理方法减少了相关特征的数量,并发现了以往研究中忽略的一个关键特征。所提出的模型具有很高的预测能力和泛化能力,凸显了 ML 在 ICU 环境中的潜力。该模型可简化重症监护室的资源分配,并为败血症-3 患者提供量身定制的干预措施。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
7.20
自引率
5.70%
发文量
297
审稿时长
1 months
期刊介绍: BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.
期刊最新文献
Real-world data to support post-market safety and performance of embolization coils: evidence generation from a medical device manufacturer and data institute partnership. Development of message passing-based graph convolutional networks for classifying cancer pathology reports Machine learning-based evaluation of prognostic factors for mortality and relapse in patients with acute lymphoblastic leukemia: a comparative simulation study A cross domain access control model for medical consortium based on DBSCAN and penalty function RCC-Supporter: supporting renal cell carcinoma treatment decision-making using machine learning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1