Phenotyping Diabetes Mellitus on Aggregated Electronic Health Records from Disparate Health Systems

Hui Xing Tan, R. Lim, Pei San Ang, Belinda P. Q. Foo, Yen Ling Koon, Jing Wei Neo, Amelia Jing Jing Ng, S. Tan, D. Teo, Mun Yee Tham, Aaron Jun Yi Yap, Nicholas Kai Ming Ng, C. Loke, Li Fung Peck, Huilin Huang, S. Dorajoo
{"title":"Phenotyping Diabetes Mellitus on Aggregated Electronic Health Records from Disparate Health Systems","authors":"Hui Xing Tan, R. Lim, Pei San Ang, Belinda P. Q. Foo, Yen Ling Koon, Jing Wei Neo, Amelia Jing Jing Ng, S. Tan, D. Teo, Mun Yee Tham, Aaron Jun Yi Yap, Nicholas Kai Ming Ng, C. Loke, Li Fung Peck, Huilin Huang, S. Dorajoo","doi":"10.3390/pharma2030019","DOIUrl":null,"url":null,"abstract":"Background: Identifying patients with diabetes mellitus (DM) is often performed in epidemiological studies using electronic health records (EHR), but currently available algorithms have features that limit their generalizability. Methods: We developed a rule-based algorithm to determine DM status using the nationally aggregated EHR database. The algorithm was validated on two chart-reviewed samples (n = 2813) of (a) patients with atrial fibrillation (AF, n = 1194) and (b) randomly sampled hospitalized patients (n = 1619). Results: DM diagnosis codes alone resulted in a sensitivity of 77.0% and 83.4% in the AF and random hospitalized samples, respectively. The proposed algorithm combines blood glucose values and DM medication usage with diagnostic codes and exhibits sensitivities between 96.9% and 98.0%, while positive predictive values (PPV) ranged between 61.1% and 75.6%. Performances were comparable across sexes, but a lower specificity was observed in younger patients (below 65 versus 65 and above) in both validation samples (75.8% vs. 90.8% and 60.6% vs. 88.8%). The algorithm was robust for missing laboratory data but not for missing medication data. Conclusions: In this nationwide EHR database analysis, an algorithm for identifying patients with DM has been developed and validated. The algorithm supports quantitative bias analyses in future studies involving EHR-based DM studies.","PeriodicalId":74431,"journal":{"name":"Pharmacoepidemiology","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pharmacoepidemiology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/pharma2030019","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Identifying patients with diabetes mellitus (DM) is often performed in epidemiological studies using electronic health records (EHR), but currently available algorithms have features that limit their generalizability. Methods: We developed a rule-based algorithm to determine DM status using the nationally aggregated EHR database. The algorithm was validated on two chart-reviewed samples (n = 2813) of (a) patients with atrial fibrillation (AF, n = 1194) and (b) randomly sampled hospitalized patients (n = 1619). Results: DM diagnosis codes alone resulted in a sensitivity of 77.0% and 83.4% in the AF and random hospitalized samples, respectively. The proposed algorithm combines blood glucose values and DM medication usage with diagnostic codes and exhibits sensitivities between 96.9% and 98.0%, while positive predictive values (PPV) ranged between 61.1% and 75.6%. Performances were comparable across sexes, but a lower specificity was observed in younger patients (below 65 versus 65 and above) in both validation samples (75.8% vs. 90.8% and 60.6% vs. 88.8%). The algorithm was robust for missing laboratory data but not for missing medication data. Conclusions: In this nationwide EHR database analysis, an algorithm for identifying patients with DM has been developed and validated. The algorithm supports quantitative bias analyses in future studies involving EHR-based DM studies.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
不同医疗系统汇总电子健康记录的糖尿病表型
背景:在流行病学研究中,识别糖尿病患者通常使用电子健康记录(EHR),但目前可用的算法具有限制其可推广性的特点。方法:我们使用全国汇总的EHR数据库开发了一种基于规则的算法来确定DM状态。该算法在两个图表审查样本(n=2813)上进行了验证,其中(a)心房颤动(AF,n=1194)患者和(b)随机抽样住院患者(n=1619)。结果:仅DM诊断代码在AF和随机住院样本中的敏感性分别为77.0%和83.4%。所提出的算法将血糖值和糖尿病药物使用与诊断代码相结合,灵敏度在96.9%至98.0%之间,阳性预测值(PPV)在61.1%至75.6%之间,但在两个验证样本中,年轻患者的特异性较低(65岁以下对65岁及以上)(75.8%对90.8%和60.6%对88.8%)。该算法对缺失的实验室数据很稳健,但对缺失的药物数据不稳健。结论:在这项全国性的EHR数据库分析中,已经开发并验证了一种识别DM患者的算法。该算法支持在未来涉及基于EHR的DM研究的研究中进行定量偏差分析。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
EVALUATION OF SCREENING TOOL OF OLDER PEOPLE'S PRESCRIPTIONS (STOPP) CRITERIA IN AN URBAN COHORT OF OLDER PEOPLE WITH HIV. Associations between Suspected Adverse Drug Reactions of HMG-CoA Reductase Inhibitors and Polypharmacology Using a National Registry Approach A Retrospective Analysis of the Clinical Effectiveness of Tigecycline in the Treatment of Clostridioides difficile-Associated Diarrhea Association of Receipt of Opioid Prescription for Acute Post-Delivery Pain Management with Buprenorphine Discontinuation among Postpartum People with Opioid Use Disorder Pioneering Arterial Hypertension Phenotyping on Nationally Aggregated Electronic Health Records
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1