用于再入院预测的生存模型和纵向医疗事件。

IF 2.7 3区医学 Q2 HEALTH CARE SCIENCES & SERVICES BMC Health Services Research Pub Date : 2024-11-13 DOI:10.1186/s12913-024-11771-w

Sacha Davis, Russell Greiner

{"title":"用于再入院预测的生存模型和纵向医疗事件。","authors":"Sacha Davis, Russell Greiner","doi":"10.1186/s12913-024-11771-w","DOIUrl":null,"url":null,"abstract":"Background: The rate of 30-day all-cause hospital readmissions can affect the funding a hospital receives. An accurate and reliable readmission prediction model could save money and increase quality-of-care. Few projects have explored formulating this task as a survival prediction problem, where models can exploit a real-valued time-to-readmission target. This paper demonstrates the effectiveness of a survival-inspired readmission model, especially when paired with a longitudinal patient representation that is agnostic to disease-cohort and predictive task.Methods: We forecast readmissions for a population-level cohort of 421,088 patients discharged in 2015 and 2016 from hospitals in Alberta, Canada. Clinical features and sequences of historical medical codes (calculated from at least four full years prior to discharge) from linked administrative sources serve as model inputs. We trained binary 30-day readmission models (XGBoost and a Deep Neural Network) and time-to-event readmission models (CoxPH and N-MTLR) with and without machine-learned medical knowledge at initialization, then compared against the popular LACE-based model using the AUROC score at 30 days (AUROC@30). Survival models are additionally evaluated using concordance, Integrated Brier, and L1-loss scores.Results: All models that utilize sequence features markedly out-perform even the best models trained on only clinical features. Further, a time-to-event target improves predictive performance at 30 days, given the same model inputs and architecture. N-MTLR, using solely sequence inputs and initialized with pre-learned medical knowledge, achieves an average AUROC@30 of 0.8460 over five folds with a standard deviation of 0.003. All trained models match or out-perform the LACE baseline of 0.6587±0.003.Conclusion: Sequences of administrative medical codes contain rich predictive information for forecasting readmissions, and embedding medical knowledge a priori using machine learning provides readmission models an advantageous foundation for training. When combined with a model that can leverage a time-to-event target, excellent performance is possible on the 30-day all-cause readmission task using only administrative data.","PeriodicalId":9012,"journal":{"name":"BMC Health Services Research","volume":"24 1","pages":"1394"},"PeriodicalIF":2.7000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11559171/pdf/","citationCount":"0","resultStr":"{\"title\":\"Survival models and longitudinal medical events for hospital readmission forecasting.\",\"authors\":\"Sacha Davis, Russell Greiner\",\"doi\":\"10.1186/s12913-024-11771-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: The rate of 30-day all-cause hospital readmissions can affect the funding a hospital receives. An accurate and reliable readmission prediction model could save money and increase quality-of-care. Few projects have explored formulating this task as a survival prediction problem, where models can exploit a real-valued time-to-readmission target. This paper demonstrates the effectiveness of a survival-inspired readmission model, especially when paired with a longitudinal patient representation that is agnostic to disease-cohort and predictive task.Methods: We forecast readmissions for a population-level cohort of 421,088 patients discharged in 2015 and 2016 from hospitals in Alberta, Canada. Clinical features and sequences of historical medical codes (calculated from at least four full years prior to discharge) from linked administrative sources serve as model inputs. We trained binary 30-day readmission models (XGBoost and a Deep Neural Network) and time-to-event readmission models (CoxPH and N-MTLR) with and without machine-learned medical knowledge at initialization, then compared against the popular LACE-based model using the AUROC score at 30 days (AUROC@30). Survival models are additionally evaluated using concordance, Integrated Brier, and L1-loss scores.Results: All models that utilize sequence features markedly out-perform even the best models trained on only clinical features. Further, a time-to-event target improves predictive performance at 30 days, given the same model inputs and architecture. N-MTLR, using solely sequence inputs and initialized with pre-learned medical knowledge, achieves an average AUROC@30 of 0.8460 over five folds with a standard deviation of 0.003. All trained models match or out-perform the LACE baseline of 0.6587±0.003.Conclusion: Sequences of administrative medical codes contain rich predictive information for forecasting readmissions, and embedding medical knowledge a priori using machine learning provides readmission models an advantageous foundation for training. When combined with a model that can leverage a time-to-event target, excellent performance is possible on the 30-day all-cause readmission task using only administrative data.\",\"PeriodicalId\":9012,\"journal\":{\"name\":\"BMC Health Services Research\",\"volume\":\"24 1\",\"pages\":\"1394\"},\"PeriodicalIF\":2.7000,\"publicationDate\":\"2024-11-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11559171/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Health Services Research\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12913-024-11771-w\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Health Services Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12913-024-11771-w","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

摘要

背景：30 天全因再入院率会影响医院获得的资金。一个准确可靠的再入院预测模型可以节省资金并提高医疗质量。很少有项目探讨过将这一任务表述为生存预测问题，即模型可以利用真实值的再入院时间目标。本文展示了受生存启发的再入院模型的有效性，尤其是在与纵向患者代表搭配时，这种模型与疾病队列和预测任务无关：我们预测了加拿大艾伯塔省医院 2015 年和 2016 年出院的 421,088 名患者的再入院率。临床特征和历史医疗代码序列（出院前至少整整四年的计算结果）来自相关的行政来源，作为模型输入。我们训练了二元 30 天再入院模型（XGBoost 和深度神经网络）和时间到事件再入院模型（CoxPH 和 N-MTLR），在初始化时使用和不使用机器学习的医疗知识，然后使用 30 天的 AUROC 分数（AUROC@30）与流行的基于 LACE 的模型进行比较。此外，还使用一致性、综合布赖尔和 L1 损失分数对生存模型进行了评估：结果：所有使用序列特征的模型都明显优于仅使用临床特征训练的最佳模型。此外，在相同的模型输入和架构下，时间到事件目标可提高 30 天的预测性能。N-MTLR 仅使用序列输入，并以预先学习的医学知识进行初始化，在五次折叠中平均 AUROC@30 为 0.8460，标准偏差为 0.003。所有经过训练的模型都达到或超过了 LACE 基线（0.6587±0.003）：行政医疗代码序列包含丰富的预测信息，可用于预测再入院情况，利用机器学习嵌入先验医疗知识可为再入院模型的训练提供有利的基础。当模型与能够利用时间到事件目标的模型相结合时，仅使用行政数据就能在 30 天全因再入院任务中取得优异成绩。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Survival models and longitudinal medical events for hospital readmission forecasting.

Background: The rate of 30-day all-cause hospital readmissions can affect the funding a hospital receives. An accurate and reliable readmission prediction model could save money and increase quality-of-care. Few projects have explored formulating this task as a survival prediction problem, where models can exploit a real-valued time-to-readmission target. This paper demonstrates the effectiveness of a survival-inspired readmission model, especially when paired with a longitudinal patient representation that is agnostic to disease-cohort and predictive task.

Methods: We forecast readmissions for a population-level cohort of 421,088 patients discharged in 2015 and 2016 from hospitals in Alberta, Canada. Clinical features and sequences of historical medical codes (calculated from at least four full years prior to discharge) from linked administrative sources serve as model inputs. We trained binary 30-day readmission models (XGBoost and a Deep Neural Network) and time-to-event readmission models (CoxPH and N-MTLR) with and without machine-learned medical knowledge at initialization, then compared against the popular LACE-based model using the AUROC score at 30 days (AUROC@30). Survival models are additionally evaluated using concordance, Integrated Brier, and L1-loss scores.

Results: All models that utilize sequence features markedly out-perform even the best models trained on only clinical features. Further, a time-to-event target improves predictive performance at 30 days, given the same model inputs and architecture. N-MTLR, using solely sequence inputs and initialized with pre-learned medical knowledge, achieves an average AUROC@30 of 0.8460 over five folds with a standard deviation of 0.003. All trained models match or out-perform the LACE baseline of 0.6587±0.003.

Conclusion: Sequences of administrative medical codes contain rich predictive information for forecasting readmissions, and embedding medical knowledge a priori using machine learning provides readmission models an advantageous foundation for training. When combined with a model that can leverage a time-to-event target, excellent performance is possible on the 30-day all-cause readmission task using only administrative data.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

BMC Health Services Research 医学-卫生保健

CiteScore

4.40

自引率

7.10%

发文量

1372

审稿时长

6 months

期刊介绍： BMC Health Services Research is an open access, peer-reviewed journal that considers articles on all aspects of health services research, including delivery of care, management of health services, assessment of healthcare needs, measurement of outcomes, allocation of healthcare resources, evaluation of different health markets and health services organizations, international comparative analysis of health systems, health economics and the impact of health policies and regulations.