Maxim Topaz, Anahita Davoudi, Lauren Evans, Sridevi Sridharan, Jiyoun Song, Sena Chae, Yolanda Barrón, Mollie Hobensack, Danielle Scharp, Kenrick Cato, Sarah Collins Rossetti, Piotr Kapela, Zidu Xu, Pallavi Gupta, Zhihong Zhang, Margaret V Mcdonald, Kathryn H Bowles
{"title":"Building a Time-Series Model to Predict Hospitalization Risks in Home Health Care: Insights Into Development, Accuracy, and Fairness.","authors":"Maxim Topaz, Anahita Davoudi, Lauren Evans, Sridevi Sridharan, Jiyoun Song, Sena Chae, Yolanda Barrón, Mollie Hobensack, Danielle Scharp, Kenrick Cato, Sarah Collins Rossetti, Piotr Kapela, Zidu Xu, Pallavi Gupta, Zhihong Zhang, Margaret V Mcdonald, Kathryn H Bowles","doi":"10.1016/j.jamda.2024.105417","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>Home health care (HHC) serves more than 5 million older adults annually in the United States, aiming to prevent unnecessary hospitalizations and emergency department (ED) visits. Despite efforts, up to 25% of patients in HHC experience these adverse events. The underutilization of clinical notes, aggregated data approaches, and potential demographic biases have limited previous HHC risk prediction models. This study aimed to develop a time-series risk model to predict hospitalizations and ED visits in patients in HHC, examine model performance over various prediction windows, identify top predictive variables and map them to data standards, and assess model fairness across demographic subgroups.</p><p><strong>Setting and participants: </strong>A total of 27,222 HHC episodes between 2015 and 2017.</p><p><strong>Methods: </strong>The study used health care process modeling of electronic health records, including clinical notes processed with natural language processing techniques and Medicare claims data. A Light Gradient Boosting Machine algorithm was used to develop the risk prediction model, with performance evaluated using 5-fold cross-validation. Model fairness was assessed across gender, race/ethnicity, and socioeconomic subgroups.</p><p><strong>Results: </strong>The model achieved high predictive performance, with an F1 score of 0.84 for a 5-day prediction window. Twenty top predictive variables were identified, including novel indicators such as the length of nurse-patient visits and visit frequency. Eighty-five percent of these variables mapped completely to the US Core Data for Interoperability standard. Fairness assessment revealed performance disparities across demographic and socioeconomic groups, with lower model effectiveness for more historically underserved populations.</p><p><strong>Conclusions and implications: </strong>This study developed a robust time-series risk model for predicting adverse events in patients in HHC, incorporating diverse data types and demonstrating high predictive accuracy. The findings highlight the importance of considering established and novel risk factors in HHC. Importantly, the observed performance disparities across subgroups emphasize the need for fairness adjustments to ensure equitable risk prediction across all patient populations.</p>","PeriodicalId":17180,"journal":{"name":"Journal of the American Medical Directors Association","volume":" ","pages":"105417"},"PeriodicalIF":4.2000,"publicationDate":"2024-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the American Medical Directors Association","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.jamda.2024.105417","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GERIATRICS & GERONTOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives: Home health care (HHC) serves more than 5 million older adults annually in the United States, aiming to prevent unnecessary hospitalizations and emergency department (ED) visits. Despite efforts, up to 25% of patients in HHC experience these adverse events. The underutilization of clinical notes, aggregated data approaches, and potential demographic biases have limited previous HHC risk prediction models. This study aimed to develop a time-series risk model to predict hospitalizations and ED visits in patients in HHC, examine model performance over various prediction windows, identify top predictive variables and map them to data standards, and assess model fairness across demographic subgroups.
Setting and participants: A total of 27,222 HHC episodes between 2015 and 2017.
Methods: The study used health care process modeling of electronic health records, including clinical notes processed with natural language processing techniques and Medicare claims data. A Light Gradient Boosting Machine algorithm was used to develop the risk prediction model, with performance evaluated using 5-fold cross-validation. Model fairness was assessed across gender, race/ethnicity, and socioeconomic subgroups.
Results: The model achieved high predictive performance, with an F1 score of 0.84 for a 5-day prediction window. Twenty top predictive variables were identified, including novel indicators such as the length of nurse-patient visits and visit frequency. Eighty-five percent of these variables mapped completely to the US Core Data for Interoperability standard. Fairness assessment revealed performance disparities across demographic and socioeconomic groups, with lower model effectiveness for more historically underserved populations.
Conclusions and implications: This study developed a robust time-series risk model for predicting adverse events in patients in HHC, incorporating diverse data types and demonstrating high predictive accuracy. The findings highlight the importance of considering established and novel risk factors in HHC. Importantly, the observed performance disparities across subgroups emphasize the need for fairness adjustments to ensure equitable risk prediction across all patient populations.
期刊介绍:
JAMDA, the official journal of AMDA - The Society for Post-Acute and Long-Term Care Medicine, is a leading peer-reviewed publication that offers practical information and research geared towards healthcare professionals in the post-acute and long-term care fields. It is also a valuable resource for policy-makers, organizational leaders, educators, and advocates.
The journal provides essential information for various healthcare professionals such as medical directors, attending physicians, nurses, consultant pharmacists, geriatric psychiatrists, nurse practitioners, physician assistants, physical and occupational therapists, social workers, and others involved in providing, overseeing, and promoting quality