利用机器学习预测患者个人和医院层面的出院情况。

IF 5.4 Q1 MEDICINE, RESEARCH & EXPERIMENTAL Communications medicine Pub Date : 2024-11-18 DOI:10.1038/s43856-024-00673-x

Jia Wei, Jiandong Zhou, Zizheng Zhang, Kevin Yuan, Qingze Gu, Augustine Luk, Andrew J Brent, David A Clifton, A Sarah Walker, David W Eyre

{"title":"利用机器学习预测患者个人和医院层面的出院情况。","authors":"Jia Wei, Jiandong Zhou, Zizheng Zhang, Kevin Yuan, Qingze Gu, Augustine Luk, Andrew J Brent, David A Clifton, A Sarah Walker, David W Eyre","doi":"10.1038/s43856-024-00673-x","DOIUrl":null,"url":null,"abstract":"Background: Accurately predicting hospital discharge events could help improve patient flow and the efficiency of healthcare delivery. However, using machine learning and diverse electronic health record (EHR) data for this task remains incompletely explored.Methods: We used EHR data from February-2017 to January-2020 from Oxfordshire, UK to predict hospital discharges in the next 24 h. We fitted separate extreme gradient boosting models for elective and emergency admissions, trained on the first two years of data and tested on the final year of data. We examined individual-level and hospital-level model performance and evaluated the impact of training data size and recency, prediction time, and performance in subgroups.Results: Our models achieve AUROCs of 0.87 and 0.86, AUPRCs of 0.66 and 0.64, and F1 scores of 0.61 and 0.59 for elective and emergency admissions, respectively. These models outperform a logistic regression model using the same features and are substantially better than a baseline logistic regression model with more limited features. Notably, the relative performance increase from adding additional features is greater than the increase from using a sophisticated model. Aggregating individual probabilities, daily total discharge estimates are accurate with mean absolute errors of 8.9% (elective) and 4.9% (emergency). The most informative predictors include antibiotic prescriptions, medications, and hospital capacity factors. Performance remains robust across patient subgroups and different training strategies, but is lower in patients with longer admissions and those who died in hospital.Conclusions: Our findings highlight the potential of machine learning in optimising hospital patient flow and facilitating patient care and recovery.","PeriodicalId":72646,"journal":{"name":"Communications medicine","volume":"4 1","pages":"236"},"PeriodicalIF":5.4000,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11574281/pdf/","citationCount":"0","resultStr":"{\"title\":\"Predicting individual patient and hospital-level discharge using machine learning.\",\"authors\":\"Jia Wei, Jiandong Zhou, Zizheng Zhang, Kevin Yuan, Qingze Gu, Augustine Luk, Andrew J Brent, David A Clifton, A Sarah Walker, David W Eyre\",\"doi\":\"10.1038/s43856-024-00673-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: Accurately predicting hospital discharge events could help improve patient flow and the efficiency of healthcare delivery. However, using machine learning and diverse electronic health record (EHR) data for this task remains incompletely explored.Methods: We used EHR data from February-2017 to January-2020 from Oxfordshire, UK to predict hospital discharges in the next 24 h. We fitted separate extreme gradient boosting models for elective and emergency admissions, trained on the first two years of data and tested on the final year of data. We examined individual-level and hospital-level model performance and evaluated the impact of training data size and recency, prediction time, and performance in subgroups.Results: Our models achieve AUROCs of 0.87 and 0.86, AUPRCs of 0.66 and 0.64, and F1 scores of 0.61 and 0.59 for elective and emergency admissions, respectively. These models outperform a logistic regression model using the same features and are substantially better than a baseline logistic regression model with more limited features. Notably, the relative performance increase from adding additional features is greater than the increase from using a sophisticated model. Aggregating individual probabilities, daily total discharge estimates are accurate with mean absolute errors of 8.9% (elective) and 4.9% (emergency). The most informative predictors include antibiotic prescriptions, medications, and hospital capacity factors. Performance remains robust across patient subgroups and different training strategies, but is lower in patients with longer admissions and those who died in hospital.Conclusions: Our findings highlight the potential of machine learning in optimising hospital patient flow and facilitating patient care and recovery.\",\"PeriodicalId\":72646,\"journal\":{\"name\":\"Communications medicine\",\"volume\":\"4 1\",\"pages\":\"236\"},\"PeriodicalIF\":5.4000,\"publicationDate\":\"2024-11-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11574281/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Communications medicine\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1038/s43856-024-00673-x\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MEDICINE, RESEARCH & EXPERIMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1038/s43856-024-00673-x","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}

引用次数: 0

摘要

背景：准确预测出院事件有助于改善患者流程和提高医疗服务效率。然而，利用机器学习和多样化的电子健康记录（EHR）数据来完成这项任务的探索仍未完成：我们使用了英国牛津郡 2017 年 2 月至 2020 年 1 月的电子病历数据来预测未来 24 小时内的出院情况。我们为择期入院和急诊入院分别拟合了极端梯度提升模型，在前两年的数据上进行了训练，并在最后一年的数据上进行了测试。我们检查了个人层面和医院层面的模型性能，并评估了训练数据大小和重复性、预测时间以及分组性能的影响：我们的模型在择期入院和急诊入院方面的 AUROC 分别为 0.87 和 0.86，AUPRC 分别为 0.66 和 0.64，F1 分别为 0.61 和 0.59。这些模型优于使用相同特征的逻辑回归模型，也大大优于使用更有限特征的基线逻辑回归模型。值得注意的是，增加额外特征所带来的相对性能提升要大于使用复杂模型所带来的提升。汇总单个概率后，每日总出院率估计值准确，平均绝对误差为 8.9%（择期）和 4.9%（急诊）。最有参考价值的预测因素包括抗生素处方、药物和医院容量因素。在不同的患者亚群和不同的训练策略下，预测结果仍然保持稳定，但在入院时间较长的患者和在医院死亡的患者中，预测结果较低：我们的研究结果凸显了机器学习在优化医院患者流程、促进患者护理和康复方面的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Predicting individual patient and hospital-level discharge using machine learning.

Background: Accurately predicting hospital discharge events could help improve patient flow and the efficiency of healthcare delivery. However, using machine learning and diverse electronic health record (EHR) data for this task remains incompletely explored.

Methods: We used EHR data from February-2017 to January-2020 from Oxfordshire, UK to predict hospital discharges in the next 24 h. We fitted separate extreme gradient boosting models for elective and emergency admissions, trained on the first two years of data and tested on the final year of data. We examined individual-level and hospital-level model performance and evaluated the impact of training data size and recency, prediction time, and performance in subgroups.

Results: Our models achieve AUROCs of 0.87 and 0.86, AUPRCs of 0.66 and 0.64, and F1 scores of 0.61 and 0.59 for elective and emergency admissions, respectively. These models outperform a logistic regression model using the same features and are substantially better than a baseline logistic regression model with more limited features. Notably, the relative performance increase from adding additional features is greater than the increase from using a sophisticated model. Aggregating individual probabilities, daily total discharge estimates are accurate with mean absolute errors of 8.9% (elective) and 4.9% (emergency). The most informative predictors include antibiotic prescriptions, medications, and hospital capacity factors. Performance remains robust across patient subgroups and different training strategies, but is lower in patients with longer admissions and those who died in hospital.

Conclusions: Our findings highlight the potential of machine learning in optimising hospital patient flow and facilitating patient care and recovery.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Communications medicine

自引率

0.00%

发文量