Ralph Ward, Jihad S Obeid, Lindsey Jennings, Elizabeth Szwast, William Garrett Hayes, Royal Pipaliya, Cameron Bailey, Skylar Faul, Brianna Polyak, George Hamilton Baker, Jenna L McCauley, Leslie A Lenert
{"title":"Enhanced phenotypes for identifying opioid overdose in emergency department visit electronic health record data","authors":"Ralph Ward, Jihad S Obeid, Lindsey Jennings, Elizabeth Szwast, William Garrett Hayes, Royal Pipaliya, Cameron Bailey, Skylar Faul, Brianna Polyak, George Hamilton Baker, Jenna L McCauley, Leslie A Lenert","doi":"10.1093/jamiaopen/ooad081","DOIUrl":null,"url":null,"abstract":"Abstract Background Accurate identification of opioid overdose (OOD) cases in electronic healthcare record (EHR) data is an important element in surveillance, empirical research, and clinical intervention. We sought to improve existing OOD electronic phenotypes by incorporating new data types beyond diagnostic codes and by applying several statistical and machine learning methods. Materials and Methods We developed an EHR dataset of emergency department visits involving OOD cases or patients considered at risk for an OOD and ascertained true OOD status through manual chart reviews. We developed and validated prediction models using Random Forest, Extreme Gradient Boost, and Elastic Net models that incorporated 717 features involving primary and second diagnoses, chief complaints, medications prescribed, vital signs, laboratory results, and procedural codes. We also developed models limited to single data types. Results A total of 1718 records involving 1485 patients were manually reviewed; 541 (36.4%) patients had one or more OOD. Prediction performance was similar for all models; sensitivity varied from 94% to 97%; and area under the receiver operating characteristic curve (AUC) was 98% for all methods. The primary diagnosis and chief complaint were the most important contributors to AUC performance; primary diagnoses and medication class contributed most to sensitivity; chief complaint, primary diagnosis, and vital signs were most important for specificity. Models limited to decision support data types available in real time demonstrated robust prediction performance. Conclusions Substantial prediction performance improvements were demonstrated for identifying OODs in EHR data. Our e-phenotypes could be applied in surveillance, retrospective empirical applications, or clinical decision support systems.","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":null,"pages":null},"PeriodicalIF":2.5000,"publicationDate":"2023-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JAMIA Open","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/jamiaopen/ooad081","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0
Abstract
Abstract Background Accurate identification of opioid overdose (OOD) cases in electronic healthcare record (EHR) data is an important element in surveillance, empirical research, and clinical intervention. We sought to improve existing OOD electronic phenotypes by incorporating new data types beyond diagnostic codes and by applying several statistical and machine learning methods. Materials and Methods We developed an EHR dataset of emergency department visits involving OOD cases or patients considered at risk for an OOD and ascertained true OOD status through manual chart reviews. We developed and validated prediction models using Random Forest, Extreme Gradient Boost, and Elastic Net models that incorporated 717 features involving primary and second diagnoses, chief complaints, medications prescribed, vital signs, laboratory results, and procedural codes. We also developed models limited to single data types. Results A total of 1718 records involving 1485 patients were manually reviewed; 541 (36.4%) patients had one or more OOD. Prediction performance was similar for all models; sensitivity varied from 94% to 97%; and area under the receiver operating characteristic curve (AUC) was 98% for all methods. The primary diagnosis and chief complaint were the most important contributors to AUC performance; primary diagnoses and medication class contributed most to sensitivity; chief complaint, primary diagnosis, and vital signs were most important for specificity. Models limited to decision support data types available in real time demonstrated robust prediction performance. Conclusions Substantial prediction performance improvements were demonstrated for identifying OODs in EHR data. Our e-phenotypes could be applied in surveillance, retrospective empirical applications, or clinical decision support systems.