Organizational Breast Cancer Data Mart: A Solution for Assessing Outcomes of Imaging and Treatment.

IF 3.3 Q2 ONCOLOGY JCO Clinical Cancer Informatics Pub Date : 2024-04-01 DOI:10.1200/CCI.23.00193
Margarita L Zuley, Jonathan Silverstein, Durwin Logue, Richard S Morgan, Rohit Bhargava, Priscilla F. McAuliffe, A. Brufsky, Andriy I Bandos, Robert M. Nishikawa
{"title":"Organizational Breast Cancer Data Mart: A Solution for Assessing Outcomes of Imaging and Treatment.","authors":"Margarita L Zuley, Jonathan Silverstein, Durwin Logue, Richard S Morgan, Rohit Bhargava, Priscilla F. McAuliffe, A. Brufsky, Andriy I Bandos, Robert M. Nishikawa","doi":"10.1200/CCI.23.00193","DOIUrl":null,"url":null,"abstract":"PURPOSE\nIn the United States, a comprehensive national breast cancer registry (CR) does not exist. Thus, care and coverage decisions are based on data from population subsets, other countries, or models. We report a prototype real-world research data mart to assess mortality, morbidity, and costs for breast cancer diagnosis and treatment.\n\n\nMETHODS\nWith institutional review board approval and Health Insurance Portability and Accountability Act (HIPPA) compliance, a multidisciplinary clinical and research data warehouse (RDW) expert group curated demographic, risk, imaging, pathology, treatment, and outcome data from the electronic health records (EHR), radiology (RIS), and CR for patients having breast imaging and/or a diagnosis of breast cancer in our institution from January 1, 2004, to December 31, 2020. Domains were defined by prebuilt views to extract data denormalized according to requirements from the existing RDW using an export, transform, load pattern. Data dictionaries were included. Structured query language was used for data cleaning.\n\n\nRESULTS\nFive-hundred eighty-nine elements (EHR 311, RIS 211, and CR 67) were mapped to 27 domains; all, except one containing CR elements, had cancer and noncancer cohort views, resulting in a total of 53 views (average 12 elements/view; range, 4-67). EHR and RIS queries returned 497,218 patients with 2,967,364 imaging examinations and associated visit details. Cancer biology, treatment, and outcome details for 15,619 breast cancer cases were imported from the CR of our primary breast care facility for this prototype mart.\n\n\nCONCLUSION\nInstitutional real-world data marts enable comprehensive understanding of care outcomes within an organization. As clinical data sources become increasingly structured, such marts may be an important source for future interinstitution analysis and potentially an opportunity to create robust real-world results that could be used to support evidence-based national policy and care decisions for breast cancer.","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":null,"pages":null},"PeriodicalIF":3.3000,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JCO Clinical Cancer Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1200/CCI.23.00193","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

PURPOSE In the United States, a comprehensive national breast cancer registry (CR) does not exist. Thus, care and coverage decisions are based on data from population subsets, other countries, or models. We report a prototype real-world research data mart to assess mortality, morbidity, and costs for breast cancer diagnosis and treatment. METHODS With institutional review board approval and Health Insurance Portability and Accountability Act (HIPPA) compliance, a multidisciplinary clinical and research data warehouse (RDW) expert group curated demographic, risk, imaging, pathology, treatment, and outcome data from the electronic health records (EHR), radiology (RIS), and CR for patients having breast imaging and/or a diagnosis of breast cancer in our institution from January 1, 2004, to December 31, 2020. Domains were defined by prebuilt views to extract data denormalized according to requirements from the existing RDW using an export, transform, load pattern. Data dictionaries were included. Structured query language was used for data cleaning. RESULTS Five-hundred eighty-nine elements (EHR 311, RIS 211, and CR 67) were mapped to 27 domains; all, except one containing CR elements, had cancer and noncancer cohort views, resulting in a total of 53 views (average 12 elements/view; range, 4-67). EHR and RIS queries returned 497,218 patients with 2,967,364 imaging examinations and associated visit details. Cancer biology, treatment, and outcome details for 15,619 breast cancer cases were imported from the CR of our primary breast care facility for this prototype mart. CONCLUSION Institutional real-world data marts enable comprehensive understanding of care outcomes within an organization. As clinical data sources become increasingly structured, such marts may be an important source for future interinstitution analysis and potentially an opportunity to create robust real-world results that could be used to support evidence-based national policy and care decisions for breast cancer.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
组织乳腺癌数据集市:评估成像和治疗结果的解决方案。
目的 在美国,还没有一个全面的全国性乳腺癌登记处(CR)。因此,护理和保险决策都是基于来自人口子集、其他国家或模型的数据。我们报告了一个真实世界研究数据集市原型,用于评估乳腺癌诊断和治疗的死亡率、发病率和成本。方法经机构审查委员会批准并遵守《健康保险可携性和责任法案》(HIPPA),一个多学科临床和研究数据仓库(RDW)专家组从电子健康记录(EHR)、放射学(RIS)和 CR 中收集了本机构 2004 年 1 月 1 日至 2020 年 12 月 31 日期间乳腺成像和/或乳腺癌诊断患者的人口统计学、风险、成像、病理学、治疗和结果数据。域由预建视图定义,以便使用导出、转换、加载模式从现有 RDW 中提取符合要求的去规范化数据。数据字典也包括在内。结果589个元素(EHR 311个、RIS 211个和CR 67个)被映射到27个域;除一个包含CR元素的域外,所有域都有癌症和非癌症队列视图,因此共有53个视图(平均12个元素/视图;范围4-67)。EHR 和 RIS 查询返回了 497,218 名患者的 2,967,364 次成像检查和相关就诊详情。该原型市场从我们主要乳腺医疗机构的 CR 中导入了 15619 个乳腺癌病例的癌症生物学、治疗和结果详情。随着临床数据源的结构化程度越来越高,此类数据集市可能会成为未来机构间分析的重要来源,并有可能成为创建强大的真实世界结果的机会,这些结果可用于支持以证据为基础的乳腺癌国家政策和护理决策。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
6.20
自引率
4.80%
发文量
190
期刊最新文献
Increasing Power in Phase III Oncology Trials With Multivariable Regression: An Empirical Assessment of 535 Primary End Point Analyses. Validation of Non-Small Cell Lung Cancer Clinical Insights Using a Generalized Oncology Natural Language Processing Model. Interinstitutional Approach to Advancing Geospatial Technologies for US Cancer Centers. Classification and Regression Trees to Predict for Survival for Patients With Hepatocellular Carcinoma Treated With Atezolizumab and Bevacizumab. Cureit: An End-to-End Pipeline for Implementing Mixture Cure Models With an Application to Liposarcoma Data.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1