Angela T. Chen MA, Richard S. Kuzma MPP, Ari B. Friedman MD, PhD
{"title":"Identifying low acuity Emergency Department visits with a machine learning approach: The low acuity visit algorithms (LAVA)","authors":"Angela T. Chen MA, Richard S. Kuzma MPP, Ari B. Friedman MD, PhD","doi":"10.1111/1475-6773.14305","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Objective</h3>\n \n <p>To improve the performance of International Classification of Disease (ICD) code rule-based algorithms for identifying low acuity Emergency Department (ED) visits by using machine learning methods and additional covariates.</p>\n </section>\n \n <section>\n \n <h3> Data Sources</h3>\n \n <p>We used secondary data on ED visits from the National Hospital Ambulatory Medical Survey (NHAMCS), from 2016 to 2020.</p>\n </section>\n \n <section>\n \n <h3> Study Design</h3>\n \n <p>We established baseline performance metrics with seven published algorithms consisting of International Classification of Disease, Tenth Revision codes used to identify low acuity ED visits. We then trained logistic regression, random forest, and gradient boosting (XGBoost) models to predict low acuity ED visits. Each model was trained on five different covariate sets of demographic and clinical data. Model performance was compared using a separate validation dataset. The primary performance metric was the probability that a visit identified by an algorithm as low acuity did not experience significant testing, treatment, or disposition (positive predictive value, PPV). Subgroup analyses assessed model performance across age, sex, and race/ethnicity.</p>\n </section>\n \n <section>\n \n <h3> Data Collection</h3>\n \n <p>We used 2016–2019 NHAMCS data as the training set and 2020 NHAMCS data for validation.</p>\n </section>\n \n <section>\n \n <h3> Principal Findings</h3>\n \n <p>The training and validation data consisted of 53,074 and 9542 observations, respectively. Among seven rule-based algorithms, the highest-performing had a PPV of 0.35 (95% CI [0.33, 0.36]). All model-based algorithms outperformed existing algorithms, with the least effective—random forest using only age and sex—improving PPV by 26% (up to 0.44; 95% CI [0.40, 0.48]). Logistic regression and XGBoost trained on all variables improved PPV by 83% (to 0.64; 95% CI [0.62, 0.66]). Multivariable models also demonstrated higher PPV across all three demographic subgroups.</p>\n </section>\n \n <section>\n \n <h3> Conclusions</h3>\n \n <p>Machine learning models substantially outperform existing algorithms based on ICD codes in predicting low acuity ED visits. Variations in model performance across demographic groups highlight the need for further research to ensure their applicability and fairness across diverse populations.</p>\n </section>\n </div>","PeriodicalId":55065,"journal":{"name":"Health Services Research","volume":"59 4","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2024-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/1475-6773.14305","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health Services Research","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/1475-6773.14305","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0
Abstract
Objective
To improve the performance of International Classification of Disease (ICD) code rule-based algorithms for identifying low acuity Emergency Department (ED) visits by using machine learning methods and additional covariates.
Data Sources
We used secondary data on ED visits from the National Hospital Ambulatory Medical Survey (NHAMCS), from 2016 to 2020.
Study Design
We established baseline performance metrics with seven published algorithms consisting of International Classification of Disease, Tenth Revision codes used to identify low acuity ED visits. We then trained logistic regression, random forest, and gradient boosting (XGBoost) models to predict low acuity ED visits. Each model was trained on five different covariate sets of demographic and clinical data. Model performance was compared using a separate validation dataset. The primary performance metric was the probability that a visit identified by an algorithm as low acuity did not experience significant testing, treatment, or disposition (positive predictive value, PPV). Subgroup analyses assessed model performance across age, sex, and race/ethnicity.
Data Collection
We used 2016–2019 NHAMCS data as the training set and 2020 NHAMCS data for validation.
Principal Findings
The training and validation data consisted of 53,074 and 9542 observations, respectively. Among seven rule-based algorithms, the highest-performing had a PPV of 0.35 (95% CI [0.33, 0.36]). All model-based algorithms outperformed existing algorithms, with the least effective—random forest using only age and sex—improving PPV by 26% (up to 0.44; 95% CI [0.40, 0.48]). Logistic regression and XGBoost trained on all variables improved PPV by 83% (to 0.64; 95% CI [0.62, 0.66]). Multivariable models also demonstrated higher PPV across all three demographic subgroups.
Conclusions
Machine learning models substantially outperform existing algorithms based on ICD codes in predicting low acuity ED visits. Variations in model performance across demographic groups highlight the need for further research to ensure their applicability and fairness across diverse populations.
期刊介绍:
Health Services Research (HSR) is a peer-reviewed scholarly journal that provides researchers and public and private policymakers with the latest research findings, methods, and concepts related to the financing, organization, delivery, evaluation, and outcomes of health services. Rated as one of the top journals in the fields of health policy and services and health care administration, HSR publishes outstanding articles reporting the findings of original investigations that expand knowledge and understanding of the wide-ranging field of health care and that will help to improve the health of individuals and communities.