{"title":"用于再入院预测的生存模型和纵向医疗事件。","authors":"Sacha Davis, Russell Greiner","doi":"10.1186/s12913-024-11771-w","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The rate of 30-day all-cause hospital readmissions can affect the funding a hospital receives. An accurate and reliable readmission prediction model could save money and increase quality-of-care. Few projects have explored formulating this task as a survival prediction problem, where models can exploit a real-valued time-to-readmission target. This paper demonstrates the effectiveness of a survival-inspired readmission model, especially when paired with a longitudinal patient representation that is agnostic to disease-cohort and predictive task.</p><p><strong>Methods: </strong>We forecast readmissions for a population-level cohort of 421,088 patients discharged in 2015 and 2016 from hospitals in Alberta, Canada. Clinical features and sequences of historical medical codes (calculated from at least four full years prior to discharge) from linked administrative sources serve as model inputs. We trained binary 30-day readmission models (XGBoost and a Deep Neural Network) and time-to-event readmission models (CoxPH and N-MTLR) with and without machine-learned medical knowledge at initialization, then compared against the popular LACE-based model using the AUROC score at 30 days (AUROC@30). Survival models are additionally evaluated using concordance, Integrated Brier, and L1-loss scores.</p><p><strong>Results: </strong>All models that utilize sequence features markedly out-perform even the best models trained on only clinical features. Further, a time-to-event target improves predictive performance at 30 days, given the same model inputs and architecture. N-MTLR, using solely sequence inputs and initialized with pre-learned medical knowledge, achieves an average AUROC@30 of 0.8460 over five folds with a standard deviation of 0.003. All trained models match or out-perform the LACE baseline of 0.6587±0.003.</p><p><strong>Conclusion: </strong>Sequences of administrative medical codes contain rich predictive information for forecasting readmissions, and embedding medical knowledge a priori using machine learning provides readmission models an advantageous foundation for training. When combined with a model that can leverage a time-to-event target, excellent performance is possible on the 30-day all-cause readmission task using only administrative data.</p>","PeriodicalId":9012,"journal":{"name":"BMC Health Services Research","volume":"24 1","pages":"1394"},"PeriodicalIF":2.7000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11559171/pdf/","citationCount":"0","resultStr":"{\"title\":\"Survival models and longitudinal medical events for hospital readmission forecasting.\",\"authors\":\"Sacha Davis, Russell Greiner\",\"doi\":\"10.1186/s12913-024-11771-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>The rate of 30-day all-cause hospital readmissions can affect the funding a hospital receives. An accurate and reliable readmission prediction model could save money and increase quality-of-care. Few projects have explored formulating this task as a survival prediction problem, where models can exploit a real-valued time-to-readmission target. This paper demonstrates the effectiveness of a survival-inspired readmission model, especially when paired with a longitudinal patient representation that is agnostic to disease-cohort and predictive task.</p><p><strong>Methods: </strong>We forecast readmissions for a population-level cohort of 421,088 patients discharged in 2015 and 2016 from hospitals in Alberta, Canada. Clinical features and sequences of historical medical codes (calculated from at least four full years prior to discharge) from linked administrative sources serve as model inputs. We trained binary 30-day readmission models (XGBoost and a Deep Neural Network) and time-to-event readmission models (CoxPH and N-MTLR) with and without machine-learned medical knowledge at initialization, then compared against the popular LACE-based model using the AUROC score at 30 days (AUROC@30). Survival models are additionally evaluated using concordance, Integrated Brier, and L1-loss scores.</p><p><strong>Results: </strong>All models that utilize sequence features markedly out-perform even the best models trained on only clinical features. Further, a time-to-event target improves predictive performance at 30 days, given the same model inputs and architecture. N-MTLR, using solely sequence inputs and initialized with pre-learned medical knowledge, achieves an average AUROC@30 of 0.8460 over five folds with a standard deviation of 0.003. All trained models match or out-perform the LACE baseline of 0.6587±0.003.</p><p><strong>Conclusion: </strong>Sequences of administrative medical codes contain rich predictive information for forecasting readmissions, and embedding medical knowledge a priori using machine learning provides readmission models an advantageous foundation for training. When combined with a model that can leverage a time-to-event target, excellent performance is possible on the 30-day all-cause readmission task using only administrative data.</p>\",\"PeriodicalId\":9012,\"journal\":{\"name\":\"BMC Health Services Research\",\"volume\":\"24 1\",\"pages\":\"1394\"},\"PeriodicalIF\":2.7000,\"publicationDate\":\"2024-11-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11559171/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Health Services Research\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12913-024-11771-w\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Health Services Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12913-024-11771-w","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
Survival models and longitudinal medical events for hospital readmission forecasting.
Background: The rate of 30-day all-cause hospital readmissions can affect the funding a hospital receives. An accurate and reliable readmission prediction model could save money and increase quality-of-care. Few projects have explored formulating this task as a survival prediction problem, where models can exploit a real-valued time-to-readmission target. This paper demonstrates the effectiveness of a survival-inspired readmission model, especially when paired with a longitudinal patient representation that is agnostic to disease-cohort and predictive task.
Methods: We forecast readmissions for a population-level cohort of 421,088 patients discharged in 2015 and 2016 from hospitals in Alberta, Canada. Clinical features and sequences of historical medical codes (calculated from at least four full years prior to discharge) from linked administrative sources serve as model inputs. We trained binary 30-day readmission models (XGBoost and a Deep Neural Network) and time-to-event readmission models (CoxPH and N-MTLR) with and without machine-learned medical knowledge at initialization, then compared against the popular LACE-based model using the AUROC score at 30 days (AUROC@30). Survival models are additionally evaluated using concordance, Integrated Brier, and L1-loss scores.
Results: All models that utilize sequence features markedly out-perform even the best models trained on only clinical features. Further, a time-to-event target improves predictive performance at 30 days, given the same model inputs and architecture. N-MTLR, using solely sequence inputs and initialized with pre-learned medical knowledge, achieves an average AUROC@30 of 0.8460 over five folds with a standard deviation of 0.003. All trained models match or out-perform the LACE baseline of 0.6587±0.003.
Conclusion: Sequences of administrative medical codes contain rich predictive information for forecasting readmissions, and embedding medical knowledge a priori using machine learning provides readmission models an advantageous foundation for training. When combined with a model that can leverage a time-to-event target, excellent performance is possible on the 30-day all-cause readmission task using only administrative data.
期刊介绍:
BMC Health Services Research is an open access, peer-reviewed journal that considers articles on all aspects of health services research, including delivery of care, management of health services, assessment of healthcare needs, measurement of outcomes, allocation of healthcare resources, evaluation of different health markets and health services organizations, international comparative analysis of health systems, health economics and the impact of health policies and regulations.