Understanding data provenance when using electronic medical records for research: Lessons learned from the Deliver Primary Healthcare Information (DELPHI) database

Jason Edward Black, Amanda Terry, Sonny Cejic, Thomas Freeman, Daniel Lizotte, Scott McKay, Mark Speechley, Bridget Ryan
{"title":"Understanding data provenance when using electronic medical records for research: Lessons learned from the Deliver Primary Healthcare Information (DELPHI) database","authors":"Jason Edward Black, Amanda Terry, Sonny Cejic, Thomas Freeman, Daniel Lizotte, Scott McKay, Mark Speechley, Bridget Ryan","doi":"10.23889/ijpds.v8i5.2177","DOIUrl":null,"url":null,"abstract":"IntroductionWe set out to assess the impact of Choosing Wisely Canada recommendations (2014) on reducing unnecessary health investigations and interventions in primary care across Southwestern Ontario. MethodsWe used the Deliver Primary Healthcare Information (DELPHI) database, which stores deidentified electronic medical records (EMR) of nearly 65,000 primary care patients across Southwestern Ontario. When conducting research using EMR data, data provenance (i.e., how the data came to be) should first be established. We first considered DELPHI data provenance in relation to longitudinal analyses, flagging a change in EMR software that occurred during 2012 and 2013. We attempted to link records between EMR databases produced by different software using probabilistic linkage and inspected 10 years of data in the DELPHI database (2009 to 2019) for data quality issues, including comparability over time. ResultsWe encountered several issues resulting from this change in EMR software. These included limited linkage of records between software without a common identifier; data migration issues that distorted procedure dates; and unusual changes in laboratory test and medication prescription volumes. ConclusionThis study reinforces the necessity of assessing data provenance and quality for new research projects. By understanding data provenance, we can anticipate related data quality issues such as changes in EMR data over time-which represent a growing concern as longitudinal data analyses increase in feasibility and popularity.","PeriodicalId":132937,"journal":{"name":"International Journal for Population Data Science","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal for Population Data Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23889/ijpds.v8i5.2177","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

IntroductionWe set out to assess the impact of Choosing Wisely Canada recommendations (2014) on reducing unnecessary health investigations and interventions in primary care across Southwestern Ontario. MethodsWe used the Deliver Primary Healthcare Information (DELPHI) database, which stores deidentified electronic medical records (EMR) of nearly 65,000 primary care patients across Southwestern Ontario. When conducting research using EMR data, data provenance (i.e., how the data came to be) should first be established. We first considered DELPHI data provenance in relation to longitudinal analyses, flagging a change in EMR software that occurred during 2012 and 2013. We attempted to link records between EMR databases produced by different software using probabilistic linkage and inspected 10 years of data in the DELPHI database (2009 to 2019) for data quality issues, including comparability over time. ResultsWe encountered several issues resulting from this change in EMR software. These included limited linkage of records between software without a common identifier; data migration issues that distorted procedure dates; and unusual changes in laboratory test and medication prescription volumes. ConclusionThis study reinforces the necessity of assessing data provenance and quality for new research projects. By understanding data provenance, we can anticipate related data quality issues such as changes in EMR data over time-which represent a growing concern as longitudinal data analyses increase in feasibility and popularity.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
在使用电子医疗记录进行研究时了解数据来源:从提供初级医疗保健信息(DELPHI)数据库获得的经验教训
我们开始评估明智选择加拿大建议(2014)对减少安大略省西南部初级保健中不必要的健康调查和干预的影响。方法我们使用提供初级医疗保健信息(DELPHI)数据库,该数据库存储了安大略省西南部近65,000名初级保健患者的未识别电子医疗记录(EMR)。在使用电子病历数据进行研究时,首先应该确定数据的来源(即数据是如何产生的)。我们首先考虑了与纵向分析相关的DELPHI数据来源,标记了2012年和2013年发生的EMR软件变化。我们尝试使用概率链接将不同软件生成的EMR数据库之间的记录链接起来,并检查了DELPHI数据库中10年(2009年至2019年)的数据质量问题,包括随时间的可比性。结果我们在EMR软件中遇到了由于这一更改而导致的几个问题。其中包括在没有通用标识符的软件之间有限的记录链接;扭曲程序日期的数据迁移问题;以及实验室检测和药物处方量的异常变化。结论本研究强调了对新研究项目进行数据来源和质量评估的必要性。通过了解数据来源,我们可以预测相关的数据质量问题,如EMR数据随时间的变化——随着纵向数据分析的可行性和普及程度的提高,这一问题越来越受到关注。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Using novel data linkage of biobank data with administrative health data to inform genomic analysis for future precision medicine treatment of congenital heart disease Common governance model: a way to avoid data segregation between existing trusted research environment Federated learning for generating synthetic data: a scoping review Health Data Governance for Research Use in Alberta Establishment of a birth-to-education cohort of 1 million Palestinian refugees using electronic medical records and electronic education records
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1