Converting Health Level 7 Clinical Document Architecture (CDA) documents to Observational Medical Outcomes Partnership Common Data Model (OMOP CDM) by leveraging CDA Template definitions.

IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES JAMIA Open Pub Date : 2025-03-26 eCollection Date: 2025-04-01 DOI:10.1093/jamiaopen/ooaf022
Florian Katsch, Rada Hussein, Tanja Stamm, Georg Duftschmid
{"title":"Converting Health Level 7 Clinical Document Architecture (CDA) documents to Observational Medical Outcomes Partnership Common Data Model (OMOP CDM) by leveraging CDA Template definitions.","authors":"Florian Katsch, Rada Hussein, Tanja Stamm, Georg Duftschmid","doi":"10.1093/jamiaopen/ooaf022","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>This work aims to develop a methodology for transforming Health Level 7 (HL7) Clinical Document Architecture (CDA) documents into the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM). The described method seeks to improve the Extract, Transform, Load (ETL) design process by using HL7 CDA Template definitions and the CDA Refined Message Information Model (CDA R-MIM).</p><p><strong>Material and methods: </strong>Our approach utilizes HL7 CDA Templates to define structural and semantic mappings. Supported by the CDA R-MIM for semantic alignment with the OMOP CDM, we developed a tool named CDA Rabbit that enables the generation of Rabbit-In-a-Hat project files from HL7 CDA Template definitions and could be successfully integrated into the existing toolchain around OMOP.</p><p><strong>Results: </strong>We tested our approach using 13 CDA Templates from the Austrian national EHR System (ELGA) and 430 anonymized CDA test documents that were mapped to 10 OMOP CDM tables. The data quality assessment, using OMOP's DataQualityDashboard, showed a 99% pass rate, indicating a robust and accurate data transformation.</p><p><strong>Conclusion: </strong>This study presents a novel framework for transforming HL7 CDA documents into OMOP CDM using template definitions and CDA R-MIM. The methodology improves semantic interoperability, mapping reusability, and ETL design efficiency. Future work should focus on automating code generation, improving data profiling, and addressing cyclic dependencies within CDA templates. The presented approach supports improved secondary use of health data and research while adhering to standardized data models and semantics.</p><p><strong>Discussion: </strong>Using CDA Templates for ETL design addresses common ETL challenges, such as data accessibility during ETL design, by decoupling the process from the actual CDA instances. Future work could focus on extending this approach to automatically generate boilerplate code, address cyclic dependencies within CDA Templates, and adapt the method for the use with FHIR profiles.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 2","pages":"ooaf022"},"PeriodicalIF":3.4000,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11945294/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JAMIA Open","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/jamiaopen/ooaf022","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0

Abstract

Objectives: This work aims to develop a methodology for transforming Health Level 7 (HL7) Clinical Document Architecture (CDA) documents into the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM). The described method seeks to improve the Extract, Transform, Load (ETL) design process by using HL7 CDA Template definitions and the CDA Refined Message Information Model (CDA R-MIM).

Material and methods: Our approach utilizes HL7 CDA Templates to define structural and semantic mappings. Supported by the CDA R-MIM for semantic alignment with the OMOP CDM, we developed a tool named CDA Rabbit that enables the generation of Rabbit-In-a-Hat project files from HL7 CDA Template definitions and could be successfully integrated into the existing toolchain around OMOP.

Results: We tested our approach using 13 CDA Templates from the Austrian national EHR System (ELGA) and 430 anonymized CDA test documents that were mapped to 10 OMOP CDM tables. The data quality assessment, using OMOP's DataQualityDashboard, showed a 99% pass rate, indicating a robust and accurate data transformation.

Conclusion: This study presents a novel framework for transforming HL7 CDA documents into OMOP CDM using template definitions and CDA R-MIM. The methodology improves semantic interoperability, mapping reusability, and ETL design efficiency. Future work should focus on automating code generation, improving data profiling, and addressing cyclic dependencies within CDA templates. The presented approach supports improved secondary use of health data and research while adhering to standardized data models and semantics.

Discussion: Using CDA Templates for ETL design addresses common ETL challenges, such as data accessibility during ETL design, by decoupling the process from the actual CDA instances. Future work could focus on extending this approach to automatically generate boilerplate code, address cyclic dependencies within CDA Templates, and adapt the method for the use with FHIR profiles.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
通过利用CDA模板定义,将健康级别7临床文档架构(CDA)文档转换为观察性医疗结果合作伙伴关系公共数据模型(OMOP CDM)。
目的:本工作旨在开发一种将健康级别7 (HL7)临床文档架构(CDA)文档转化为观察性医疗结果合作伙伴关系(OMOP)公共数据模型(CDM)的方法。所描述的方法试图通过使用HL7 CDA模板定义和CDA精炼消息信息模型(CDA R-MIM)来改进提取、转换、加载(ETL)设计过程。材料和方法:我们的方法利用HL7 CDA模板来定义结构和语义映射。在CDA R-MIM与OMOP CDM语义一致的支持下,我们开发了一个名为CDA Rabbit的工具,它可以从HL7 CDA模板定义生成Rabbit- in -a- hat项目文件,并且可以成功地集成到OMOP周围的现有工具链中。结果:我们使用来自奥地利国家电子病历系统(ELGA)的13个CDA模板和430个匿名CDA测试文档来测试我们的方法,这些文档被映射到10个OMOP CDM表。使用OMOP的DataQualityDashboard进行的数据质量评估显示,通过率为99%,表明数据转换稳健且准确。结论:本研究提出了一种利用模板定义和CDA R-MIM将HL7 CDA文档转换为OMOP CDM的新框架。该方法提高了语义互操作性、映射可重用性和ETL设计效率。未来的工作应该集中在自动化代码生成、改进数据分析和处理CDA模板中的循环依赖。所提出的方法支持改进卫生数据和研究的二次使用,同时坚持标准化的数据模型和语义。讨论:在ETL设计中使用CDA模板解决了常见的ETL挑战,例如ETL设计期间的数据可访问性,通过将过程与实际的CDA实例解耦。未来的工作可以集中在扩展这种方法,以自动生成样板代码,在CDA模板中处理循环依赖,并使该方法适应FHIR配置文件的使用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
JAMIA Open
JAMIA Open Medicine-Health Informatics
CiteScore
4.10
自引率
4.80%
发文量
102
审稿时长
16 weeks
期刊最新文献
Interpretability of an FDA-authorized AI/ML sepsis diagnostic tool improved by SHAP values. Electronic health record use factors linked to efficiency and productivity: an explainable machine learning analysis. Correction to: Biomedical data repositories require governance for artificial intelligence/machine learning applications at every step. SynNER: syntax-infused named entity recognition in the biomedical domain. RELEAP: reinforcement-enhanced label-efficient active phenotyping for electronic health records.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1