Enhanced phenotypes for identifying opioid overdose in emergency department visit electronic health record data

IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES JAMIA Open Pub Date : 2023-09-11 DOI:10.1093/jamiaopen/ooad081

Ralph Ward, Jihad S Obeid, Lindsey Jennings, Elizabeth Szwast, William Garrett Hayes, Royal Pipaliya, Cameron Bailey, Skylar Faul, Brianna Polyak, George Hamilton Baker, Jenna L McCauley, Leslie A Lenert

{"title":"Enhanced phenotypes for identifying opioid overdose in emergency department visit electronic health record data","authors":"Ralph Ward, Jihad S Obeid, Lindsey Jennings, Elizabeth Szwast, William Garrett Hayes, Royal Pipaliya, Cameron Bailey, Skylar Faul, Brianna Polyak, George Hamilton Baker, Jenna L McCauley, Leslie A Lenert","doi":"10.1093/jamiaopen/ooad081","DOIUrl":null,"url":null,"abstract":"Abstract Background Accurate identification of opioid overdose (OOD) cases in electronic healthcare record (EHR) data is an important element in surveillance, empirical research, and clinical intervention. We sought to improve existing OOD electronic phenotypes by incorporating new data types beyond diagnostic codes and by applying several statistical and machine learning methods. Materials and Methods We developed an EHR dataset of emergency department visits involving OOD cases or patients considered at risk for an OOD and ascertained true OOD status through manual chart reviews. We developed and validated prediction models using Random Forest, Extreme Gradient Boost, and Elastic Net models that incorporated 717 features involving primary and second diagnoses, chief complaints, medications prescribed, vital signs, laboratory results, and procedural codes. We also developed models limited to single data types. Results A total of 1718 records involving 1485 patients were manually reviewed; 541 (36.4%) patients had one or more OOD. Prediction performance was similar for all models; sensitivity varied from 94% to 97%; and area under the receiver operating characteristic curve (AUC) was 98% for all methods. The primary diagnosis and chief complaint were the most important contributors to AUC performance; primary diagnoses and medication class contributed most to sensitivity; chief complaint, primary diagnosis, and vital signs were most important for specificity. Models limited to decision support data types available in real time demonstrated robust prediction performance. Conclusions Substantial prediction performance improvements were demonstrated for identifying OODs in EHR data. Our e-phenotypes could be applied in surveillance, retrospective empirical applications, or clinical decision support systems.","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"6 1","pages":"0"},"PeriodicalIF":3.4000,"publicationDate":"2023-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JAMIA Open","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/jamiaopen/ooad081","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

Abstract

Abstract Background Accurate identification of opioid overdose (OOD) cases in electronic healthcare record (EHR) data is an important element in surveillance, empirical research, and clinical intervention. We sought to improve existing OOD electronic phenotypes by incorporating new data types beyond diagnostic codes and by applying several statistical and machine learning methods. Materials and Methods We developed an EHR dataset of emergency department visits involving OOD cases or patients considered at risk for an OOD and ascertained true OOD status through manual chart reviews. We developed and validated prediction models using Random Forest, Extreme Gradient Boost, and Elastic Net models that incorporated 717 features involving primary and second diagnoses, chief complaints, medications prescribed, vital signs, laboratory results, and procedural codes. We also developed models limited to single data types. Results A total of 1718 records involving 1485 patients were manually reviewed; 541 (36.4%) patients had one or more OOD. Prediction performance was similar for all models; sensitivity varied from 94% to 97%; and area under the receiver operating characteristic curve (AUC) was 98% for all methods. The primary diagnosis and chief complaint were the most important contributors to AUC performance; primary diagnoses and medication class contributed most to sensitivity; chief complaint, primary diagnosis, and vital signs were most important for specificity. Models limited to decision support data types available in real time demonstrated robust prediction performance. Conclusions Substantial prediction performance improvements were demonstrated for identifying OODs in EHR data. Our e-phenotypes could be applied in surveillance, retrospective empirical applications, or clinical decision support systems.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

在急诊科访问电子健康记录数据中识别阿片类药物过量的增强表型

背景在电子医疗记录(EHR)数据中准确识别阿片类药物过量(OOD)病例是监测、实证研究和临床干预的重要组成部分。我们试图通过结合诊断代码之外的新数据类型以及应用几种统计和机器学习方法来改善现有的OOD电子表型。材料和方法我们建立了一个EHR数据集，包括急诊就诊的OOD病例或被认为有OOD风险的患者，并通过手工图表审查确定了真正的OOD状态。我们使用随机森林、极端梯度增强和弹性网模型开发并验证了预测模型，这些模型纳入了717个特征，包括初诊和二次诊断、主诉、处方药物、生命体征、实验室结果和程序代码。我们还开发了仅限于单一数据类型的模型。结果共手工审核病历1718份，患者1485例;541例(36.4%)患者有一种或多种OOD。所有模型的预测性能相似;灵敏度从94%到97%不等;所有方法的受试者工作特征曲线下面积(AUC)均为98%。初诊和主诉是影响AUC表现的最重要因素;初步诊断和用药类别对敏感性影响最大;主诉、初诊和生命体征对特异性最重要。仅限于实时可用的决策支持数据类型的模型显示了稳健的预测性能。结论在EHR数据中识别OODs的预测性能有了实质性的提高。我们的e表型可以应用于监测，回顾性经验应用，或临床决策支持系统。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊