Implementing Accuracy, Completeness, and Traceability for Data Reliability.

IF 9.7 1区 医学 Q1 MEDICINE, GENERAL & INTERNAL JAMA Network Open Pub Date : 2025-03-03 DOI:10.1001/jamanetworkopen.2025.0128
Daniel Jay Riskin, Keri L Monda, Joshua J Gagne, Robert Reynolds, A Reshad Garan, Nancy Dreyer, Paul Muntner, Brian D Bradbury
{"title":"Implementing Accuracy, Completeness, and Traceability for Data Reliability.","authors":"Daniel Jay Riskin, Keri L Monda, Joshua J Gagne, Robert Reynolds, A Reshad Garan, Nancy Dreyer, Paul Muntner, Brian D Bradbury","doi":"10.1001/jamanetworkopen.2025.0128","DOIUrl":null,"url":null,"abstract":"<p><strong>Importance: </strong>While it is well known that data quality underlies evidence validity, the measurement and impacts of data reliability are less well understood. The need has been highlighted in the 21st Century Cures Act of 2016 and US Food and Drug Administration (FDA) Real-World Evidence Program framework in 2018, draft guidance in 2021 and final guidance in 2024. Timely visibility into implementation may be provided by the Transforming Real-World Evidence With Unstructured and Structured Data to Advance Tailored Therapy (TRUST) study, a Verantos Inc-led FDA-funded demonstration project to explore data quality and inform regulatory decision-making.</p><p><strong>Objective: </strong>To report early learnings from the TRUST study on distilling data reliability to practice including developing a practical approach to quantify accuracy, completeness, and traceability of real-world data (routinely collected patient health data) and comparing traditional to advanced data and technologies on these dimensions.</p><p><strong>Design, setting, and participants: </strong>This quality improvement study was performed using data from 58 hospitals and more than 1180 associated outpatient clinics from academic and community settings in the US. Participants included patients with asthma treated between January 1, 2014, and December 31, 2022. Data were analyzed from January 1 to June 30, 2024.</p><p><strong>Exposures: </strong>The traditional approach used medical and pharmacy claims as source documentation. The advanced approach used medical and pharmacy claims, electronic health records with unstructured data extracted using artificial intelligence methods, and mortality registry data.</p><p><strong>Main outcomes and measures: </strong>Accuracy was assessed using the F1 score. Completeness was estimated as a weighted mean of available data sources during each calendar year under study for each patient. Traceability was estimated as the proportion of data elements identified in clinical source documentation.</p><p><strong>Results: </strong>In total, 120 616 patients met the minimum data requirements (mean [SD] age, 43.2 [18.5] years; 41 011 male [34.0%]). For accuracy, traditional approaches had F1 scores of 59.5% and advanced approaches had scores of 93.4%. For completeness, traditional approaches yielded mean scores of 46.1% (95% CI, 38.2%-54.0%); advanced approaches, 96.6% (95% CI, 85.8%-1.1%). For traceability, traditional approaches had 11.5% (95% CI, 11.4%-11.5%) and advanced approaches had 77.3% (95% CI, 77.3%-77.3%) of data elements traceable to clinical source data.</p><p><strong>Conclusions and relevance: </strong>In this study, practical implementation of data reliability measurement is described. Findings suggest the potential of using multiple data sources and applying advanced methods to increase real-world data reliability. The inclusion of data reliability standards when generating evidence from these sources has the potential to strengthen support for the use of real-world evidence in the prescription, reimbursement, and approval of medications.</p>","PeriodicalId":14694,"journal":{"name":"JAMA Network Open","volume":"8 3","pages":"e250128"},"PeriodicalIF":9.7000,"publicationDate":"2025-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11894483/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JAMA Network Open","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1001/jamanetworkopen.2025.0128","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0

Abstract

Importance: While it is well known that data quality underlies evidence validity, the measurement and impacts of data reliability are less well understood. The need has been highlighted in the 21st Century Cures Act of 2016 and US Food and Drug Administration (FDA) Real-World Evidence Program framework in 2018, draft guidance in 2021 and final guidance in 2024. Timely visibility into implementation may be provided by the Transforming Real-World Evidence With Unstructured and Structured Data to Advance Tailored Therapy (TRUST) study, a Verantos Inc-led FDA-funded demonstration project to explore data quality and inform regulatory decision-making.

Objective: To report early learnings from the TRUST study on distilling data reliability to practice including developing a practical approach to quantify accuracy, completeness, and traceability of real-world data (routinely collected patient health data) and comparing traditional to advanced data and technologies on these dimensions.

Design, setting, and participants: This quality improvement study was performed using data from 58 hospitals and more than 1180 associated outpatient clinics from academic and community settings in the US. Participants included patients with asthma treated between January 1, 2014, and December 31, 2022. Data were analyzed from January 1 to June 30, 2024.

Exposures: The traditional approach used medical and pharmacy claims as source documentation. The advanced approach used medical and pharmacy claims, electronic health records with unstructured data extracted using artificial intelligence methods, and mortality registry data.

Main outcomes and measures: Accuracy was assessed using the F1 score. Completeness was estimated as a weighted mean of available data sources during each calendar year under study for each patient. Traceability was estimated as the proportion of data elements identified in clinical source documentation.

Results: In total, 120 616 patients met the minimum data requirements (mean [SD] age, 43.2 [18.5] years; 41 011 male [34.0%]). For accuracy, traditional approaches had F1 scores of 59.5% and advanced approaches had scores of 93.4%. For completeness, traditional approaches yielded mean scores of 46.1% (95% CI, 38.2%-54.0%); advanced approaches, 96.6% (95% CI, 85.8%-1.1%). For traceability, traditional approaches had 11.5% (95% CI, 11.4%-11.5%) and advanced approaches had 77.3% (95% CI, 77.3%-77.3%) of data elements traceable to clinical source data.

Conclusions and relevance: In this study, practical implementation of data reliability measurement is described. Findings suggest the potential of using multiple data sources and applying advanced methods to increase real-world data reliability. The inclusion of data reliability standards when generating evidence from these sources has the potential to strengthen support for the use of real-world evidence in the prescription, reimbursement, and approval of medications.

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
实现数据可靠性的准确性、完整性和可追溯性。
重要性:虽然众所周知,数据质量是证据有效性的基础,但对数据可靠性的测量和影响却知之甚少。2016年的《21世纪治愈法案》和2018年的美国食品和药物管理局(FDA)真实世界证据计划框架、2021年的指南草案和2024年的最终指南都强调了这一需求。使用非结构化和结构化数据转化真实世界证据以推进量身定制治疗(TRUST)研究可以提供及时的实施可见性,该研究是Verantos公司领导的fda资助的示范项目,旨在探索数据质量并为监管决策提供信息。目的:报告TRUST研究关于将数据可靠性提炼到实践中的早期经验,包括开发一种实用方法来量化真实世界数据(常规收集的患者健康数据)的准确性、完整性和可追溯性,并在这些方面比较传统数据和先进数据和技术。设计、环境和参与者:这项质量改进研究使用了来自美国58家医院和1180多家相关门诊诊所的数据,这些门诊诊所来自学术和社区环境。参与者包括2014年1月1日至2022年12月31日期间接受治疗的哮喘患者。数据分析时间为2024年1月1日至6月30日。暴露:传统方法使用医疗和药房索赔作为源文件。先进的方法使用了医疗和药房索赔、使用人工智能方法提取的带有非结构化数据的电子健康记录以及死亡率登记数据。主要结果和测量方法:使用F1评分评估准确性。完整性以每个患者在研究的每个日历年中可用数据源的加权平均值来估计。可追溯性被估计为临床源文件中确定的数据元素的比例。结果:共有120例 616例患者符合最低数据要求(平均[SD]年龄43.2[18.5]岁;41 011男性[34.0%])。准确率方面,传统方法F1得分为59.5%,先进方法F1得分为93.4%。为了完整性,传统方法的平均得分为46.1% (95% CI, 38.2%-54.0%);晚期入路占96.6% (95% CI, 85.8%-1.1%)。在可追溯性方面,传统方法有11.5% (95% CI, 11.4%-11.5%),先进方法有77.3% (95% CI, 77.3%-77.3%)的数据元素可追溯至临床源数据。结论与相关性:本研究描述了数据可靠性测量的实际实施。研究结果表明,使用多个数据源和应用先进方法来提高真实世界数据可靠性的潜力。当从这些来源产生证据时,纳入数据可靠性标准有可能加强对在药物处方、报销和批准中使用真实证据的支持。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
JAMA Network Open
JAMA Network Open Medicine-General Medicine
CiteScore
16.00
自引率
2.90%
发文量
2126
审稿时长
16 weeks
期刊介绍: JAMA Network Open, a member of the esteemed JAMA Network, stands as an international, peer-reviewed, open-access general medical journal.The publication is dedicated to disseminating research across various health disciplines and countries, encompassing clinical care, innovation in health care, health policy, and global health. JAMA Network Open caters to clinicians, investigators, and policymakers, providing a platform for valuable insights and advancements in the medical field. As part of the JAMA Network, a consortium of peer-reviewed general medical and specialty publications, JAMA Network Open contributes to the collective knowledge and understanding within the medical community.
期刊最新文献
Media Reporting of the 2024 US Preventive Services Task Force Mammography Guideline Update. Early-Stage Lung Cancer Treatment Disparities by Race Among Medicare Beneficiaries. Measuring the Impacts of RNA Vaccine Research and the Consequences of Defunding. National Institutes of Health Funding for RNA Vaccine Research. Long COVID and Recovery Among US Adults.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1