{"title":"Using Electronic Health Records To Generate Phenotypes For Research","authors":"Sarah A. Pendergrass, Dana C. Crawford","doi":"10.1002/cphg.80","DOIUrl":null,"url":null,"abstract":"<p>Electronic health records contain patient-level data collected during and for clinical care. Data within the electronic health record include diagnostic billing codes, procedure codes, vital signs, laboratory test results, clinical imaging, and physician notes. With repeated clinic visits, these data are longitudinal, providing important information on disease development, progression, and response to treatment or intervention strategies. The near universal adoption of electronic health records nationally has the potential to provide population-scale real-world clinical data accessible for biomedical research, including genetic association studies. For this research potential to be realized, high-quality research-grade variables must be extracted from these clinical data warehouses. We describe here common and emerging electronic phenotyping approaches applied to electronic health records, as well as current limitations of both the approaches and the biases associated with these clinically collected data that impact their use in research. © 2018 by John Wiley & Sons, Inc.</p>","PeriodicalId":40007,"journal":{"name":"Current Protocols in Human Genetics","volume":"100 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2018-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cphg.80","citationCount":"45","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Protocols in Human Genetics","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cphg.80","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 45
Abstract
Electronic health records contain patient-level data collected during and for clinical care. Data within the electronic health record include diagnostic billing codes, procedure codes, vital signs, laboratory test results, clinical imaging, and physician notes. With repeated clinic visits, these data are longitudinal, providing important information on disease development, progression, and response to treatment or intervention strategies. The near universal adoption of electronic health records nationally has the potential to provide population-scale real-world clinical data accessible for biomedical research, including genetic association studies. For this research potential to be realized, high-quality research-grade variables must be extracted from these clinical data warehouses. We describe here common and emerging electronic phenotyping approaches applied to electronic health records, as well as current limitations of both the approaches and the biases associated with these clinically collected data that impact their use in research. © 2018 by John Wiley & Sons, Inc.
使用电子健康记录为研究生成表型
电子健康记录包含在临床护理期间和为临床护理收集的患者级别数据。电子健康记录中的数据包括诊断账单代码、程序代码、生命体征、实验室测试结果、临床成像和医生记录。通过反复的临床访问,这些数据是纵向的,提供了关于疾病发展、进展和对治疗或干预策略的反应的重要信息。电子健康记录在全国范围内的几乎普遍采用,有可能为生物医学研究提供人口规模的真实世界临床数据,包括遗传关联研究。为了实现这一研究潜力,必须从这些临床数据仓库中提取高质量的研究级变量。我们在这里描述了应用于电子健康记录的常见和新兴的电子表型方法,以及这两种方法的当前局限性和与这些临床收集的数据相关的影响其在研究中使用的偏差。©2018 by John Wiley &儿子,Inc。
本文章由计算机程序翻译,如有差异,请以英文原文为准。