Application of a Data Quality Framework to Ductal Carcinoma In Situ Using Electronic Health Record Data From the All of Us Research Program.

IF 3.3 Q2 ONCOLOGY JCO Clinical Cancer Informatics Pub Date : 2024-08-01 DOI:10.1200/CCI.24.00052
Lew Berman, Yechiam Ostchega, John Giannini, Lakshmi Priya Anandan, Emily Clark, Matthew Spotnitz, Lina Sulieman, Michael Volynski, Andrea Ramirez
{"title":"Application of a Data Quality Framework to Ductal Carcinoma In Situ Using Electronic Health Record Data From the <i>All of Us</i> Research Program.","authors":"Lew Berman, Yechiam Ostchega, John Giannini, Lakshmi Priya Anandan, Emily Clark, Matthew Spotnitz, Lina Sulieman, Michael Volynski, Andrea Ramirez","doi":"10.1200/CCI.24.00052","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>The specific aims of this paper are to (1) develop and operationalize an electronic health record (EHR) data quality framework, (2) apply the dimensions of the framework to the phenotype and treatment pathways of ductal carcinoma in situ (DCIS) using <i>All of Us</i> Research Program data, and (3) propose and apply a checklist to evaluate the application of the framework.</p><p><strong>Methods: </strong>We developed a framework of five data quality dimensions (DQD; completeness, concordance, conformance, plausibility, and temporality). Participants signed a consent and Health Insurance Portability and Accountability Act authorization to share EHR data and responded to demographic questions in the Basics questionnaire. We evaluated the internal characteristics of the data and compared data with external benchmarks with descriptive and inferential statistics. We developed a DQD checklist to evaluate concept selection, internal verification, and external validity for each DQD. The Observational Medical Outcomes Partnership Common Data Model (OMOP CDM) concept ID codes for DCIS were used to select a cohort of 2,209 females 18 years and older.</p><p><strong>Results: </strong>Using the proposed DQD checklist criteria, (1) concepts were selected and internally verified for conformance; (2) concepts were selected and internally verified for completeness; (3) concepts were selected, internally verified, and externally validated for concordance; (4) concepts were selected, internally verified, and externally validated for plausibility; and (5) concepts were selected, internally verified, and externally validated for temporality.</p><p><strong>Conclusion: </strong>This assessment and evaluation provided insights into data quality for the DCIS phenotype using EHR data from the <i>All of Us</i> Research Program. The review demonstrates that salient clinical measures can be selected, applied, and operationalized within a conceptual framework and evaluated for fitness for use by applying a proposed checklist.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":null,"pages":null},"PeriodicalIF":3.3000,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JCO Clinical Cancer Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1200/CCI.24.00052","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose: The specific aims of this paper are to (1) develop and operationalize an electronic health record (EHR) data quality framework, (2) apply the dimensions of the framework to the phenotype and treatment pathways of ductal carcinoma in situ (DCIS) using All of Us Research Program data, and (3) propose and apply a checklist to evaluate the application of the framework.

Methods: We developed a framework of five data quality dimensions (DQD; completeness, concordance, conformance, plausibility, and temporality). Participants signed a consent and Health Insurance Portability and Accountability Act authorization to share EHR data and responded to demographic questions in the Basics questionnaire. We evaluated the internal characteristics of the data and compared data with external benchmarks with descriptive and inferential statistics. We developed a DQD checklist to evaluate concept selection, internal verification, and external validity for each DQD. The Observational Medical Outcomes Partnership Common Data Model (OMOP CDM) concept ID codes for DCIS were used to select a cohort of 2,209 females 18 years and older.

Results: Using the proposed DQD checklist criteria, (1) concepts were selected and internally verified for conformance; (2) concepts were selected and internally verified for completeness; (3) concepts were selected, internally verified, and externally validated for concordance; (4) concepts were selected, internally verified, and externally validated for plausibility; and (5) concepts were selected, internally verified, and externally validated for temporality.

Conclusion: This assessment and evaluation provided insights into data quality for the DCIS phenotype using EHR data from the All of Us Research Program. The review demonstrates that salient clinical measures can be selected, applied, and operationalized within a conceptual framework and evaluated for fitness for use by applying a proposed checklist.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用 "我们所有人 "研究计划的电子健康记录数据,将数据质量框架应用于原位导管癌。
目的:本文的具体目的是:(1) 制定电子健康记录(EHR)数据质量框架并使之可操作化;(2) 利用 "我们所有人 "研究计划数据,将该框架的各个维度应用于乳腺导管原位癌(DCIS)的表型和治疗途径;(3) 提出并应用检查表来评估该框架的应用:我们开发了一个包含五个数据质量维度(DQD;完整性、一致性、连贯性、可信性和时间性)的框架。参与者签署了同意书和《健康保险可携性与责任法案》(Health Insurance Portability and Accountability Act)授权书以共享电子病历数据,并回答了基础知识问卷中的人口统计学问题。我们评估了数据的内部特征,并通过描述性和推论性统计将数据与外部基准进行了比较。我们制定了一份 DQD 核对表,以评估每个 DQD 的概念选择、内部验证和外部有效性。我们使用观察性医疗结果合作组织通用数据模型(OMOP CDM)的DCIS概念ID代码选择了2209名18岁及18岁以上的女性:使用提出的 DQD 核对表标准,(1) 选择了概念,并对其一致性进行了内部验证;(2) 选择了概念,并对其完整性进行了内部验证;(3) 选择了概念,并对其一致性进行了内部验证和外部验证;(4) 选择了概念,并对其合理性进行了内部验证和外部验证;(5) 选择了概念,并对其时间性进行了内部验证和外部验证:本次评估和评价利用 "我们所有人 "研究计划的电子病历数据,对 DCIS 表型的数据质量进行了深入了解。审查表明,可以在概念框架内选择、应用和操作突出的临床措施,并通过应用建议的核对表来评估是否适合使用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
6.20
自引率
4.80%
发文量
190
期刊最新文献
Identifying Oncology Patients at High Risk for Potentially Preventable Emergency Department Visits Using a Novel Definition. Use of Patient-Reported Outcomes in Risk Prediction Model Development to Support Cancer Care Delivery: A Scoping Review. Optimizing End Points for Phase III Cancer Trials. Informatics and Artificial Intelligence-Guided Assessment of the Regulatory and Translational Research Landscape of First-in-Class Oncology Drugs in the United States, 2018-2022. Development and Portability of a Text Mining Algorithm for Capturing Disease Progression in Electronic Health Records of Patients With Stage IV Non-Small Cell Lung Cancer.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1