Evolution of a Graph Model for the OMOP Common Data Model.

IF 2.1 2区 医学 Q4 MEDICAL INFORMATICS Applied Clinical Informatics Pub Date : 2024-10-01 Epub Date: 2024-12-04 DOI:10.1055/s-0044-1791487
Mengjia Kang, Jose A Alvarado-Guzman, Luke V Rasmussen, Justin B Starren
{"title":"Evolution of a Graph Model for the OMOP Common Data Model.","authors":"Mengjia Kang, Jose A Alvarado-Guzman, Luke V Rasmussen, Justin B Starren","doi":"10.1055/s-0044-1791487","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong> Graph databases for electronic health record (EHR) data have become a useful tool for clinical research in recent years, but there is a lack of published methods to transform relational databases to a graph database schema. We developed a graph model for the Observational Medical Outcomes Partnership (OMOP) common data model (CDM) that can be reused across research institutions.</p><p><strong>Methods: </strong> We created and evaluated four models, representing two different strategies, for converting the standardized clinical and vocabulary tables of OMOP into a property graph model within the Neo4j graph database. Taking the Successful Clinical Response in Pneumonia Therapy (SCRIPT) and Collaborative Resource for Intensive care Translational science, Informatics, Comprehensive Analytics, and Learning (CRITICAL) cohorts as test datasets with different sizes, we compared two of the resulting graph models with respect to database performance including database building time, query complexity, and runtime for both cohorts.</p><p><strong>Results: </strong> Utilizing a graph schema that was optimized for storing critical information as topology rather than attributes resulted in a significant improvement in both data creation and querying. The graph database for our larger cohort, CRITICAL, can be built within 1 hour for 134,145 patients, with a total of 749,011,396 nodes and 1,703,560,910 edges.</p><p><strong>Discussion: </strong> To our knowledge, this is the first generalized solution to convert the OMOP CDM to a graph-optimized schema. Despite being developed for studies at a single institution, the modeling method can be applied to other OMOP CDM v5.x databases. Our evaluation with the SCRIPT and CRITICAL cohorts and comparison between the current and previous versions show advantages in code simplicity, database building, and query speed.</p><p><strong>Conclusion: </strong> We developed a method for converting OMOP CDM databases into graph databases. Our experiments revealed that the final model outperformed the initial relational-to-graph transformation in both code simplicity and query efficiency, particularly for complex queries.</p>","PeriodicalId":48956,"journal":{"name":"Applied Clinical Informatics","volume":"15 5","pages":"1056-1065"},"PeriodicalIF":2.1000,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11617070/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Clinical Informatics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1055/s-0044-1791487","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/4 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

Abstract

Objective:  Graph databases for electronic health record (EHR) data have become a useful tool for clinical research in recent years, but there is a lack of published methods to transform relational databases to a graph database schema. We developed a graph model for the Observational Medical Outcomes Partnership (OMOP) common data model (CDM) that can be reused across research institutions.

Methods:  We created and evaluated four models, representing two different strategies, for converting the standardized clinical and vocabulary tables of OMOP into a property graph model within the Neo4j graph database. Taking the Successful Clinical Response in Pneumonia Therapy (SCRIPT) and Collaborative Resource for Intensive care Translational science, Informatics, Comprehensive Analytics, and Learning (CRITICAL) cohorts as test datasets with different sizes, we compared two of the resulting graph models with respect to database performance including database building time, query complexity, and runtime for both cohorts.

Results:  Utilizing a graph schema that was optimized for storing critical information as topology rather than attributes resulted in a significant improvement in both data creation and querying. The graph database for our larger cohort, CRITICAL, can be built within 1 hour for 134,145 patients, with a total of 749,011,396 nodes and 1,703,560,910 edges.

Discussion:  To our knowledge, this is the first generalized solution to convert the OMOP CDM to a graph-optimized schema. Despite being developed for studies at a single institution, the modeling method can be applied to other OMOP CDM v5.x databases. Our evaluation with the SCRIPT and CRITICAL cohorts and comparison between the current and previous versions show advantages in code simplicity, database building, and query speed.

Conclusion:  We developed a method for converting OMOP CDM databases into graph databases. Our experiments revealed that the final model outperformed the initial relational-to-graph transformation in both code simplicity and query efficiency, particularly for complex queries.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
面向OMOP公共数据模型的图模型演化。
目的:电子健康记录(EHR)数据的图形数据库近年来已成为临床研究的有用工具,但缺乏将关系数据库转换为图形数据库模式的公开方法。我们为观察性医疗结果伙伴关系(OMOP)公共数据模型(CDM)开发了一个图形模型,该模型可以在研究机构之间重用。方法:我们创建并评估了四个模型,代表了两种不同的策略,用于将OMOP的标准化临床和词汇表转换为Neo4j图数据库中的属性图模型。以肺炎治疗的成功临床反应(SCRIPT)和重症监护转化科学、信息学、综合分析和学习的协作资源(CRITICAL)队列作为不同大小的测试数据集,我们比较了两种结果图模型在数据库性能方面的差异,包括数据库构建时间、查询复杂性和运行时间。结果:利用一个为将关键信息存储为拓扑而不是属性而优化的图模式,可以显著改善数据创建和查询。我们更大的队列的图形数据库CRITICAL可以在1小时内为134,145名患者建立,总共有749,011,396个节点和1,703,560,910条边。讨论:据我们所知,这是将OMOP CDM转换为图形优化模式的第一个通用解决方案。尽管该建模方法是为单个机构的研究而开发的,但它可以应用于其他OMOP CDM v5。x数据库。我们对SCRIPT和CRITICAL队列的评估以及当前和以前版本之间的比较显示出在代码简单性、数据库构建和查询速度方面的优势。结论:建立了一种将OMOP CDM数据库转换为图形数据库的方法。我们的实验表明,最终模型在代码简单性和查询效率方面优于初始的关系到图转换,特别是对于复杂查询。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Applied Clinical Informatics
Applied Clinical Informatics MEDICAL INFORMATICS-
CiteScore
4.60
自引率
24.10%
发文量
132
期刊介绍: ACI is the third Schattauer journal dealing with biomedical and health informatics. It perfectly complements our other journals Öffnet internen Link im aktuellen FensterMethods of Information in Medicine and the Öffnet internen Link im aktuellen FensterYearbook of Medical Informatics. The Yearbook of Medical Informatics being the “Milestone” or state-of-the-art journal and Methods of Information in Medicine being the “Science and Research” journal of IMIA, ACI intends to be the “Practical” journal of IMIA.
期刊最新文献
Sharing a Hybrid EHR + FHIR CDS Tool Across Health Systems: Automating Smoking Cessation for Pediatric Caregivers. Application of an Externally Developed Algorithm to Identify Research Cases and Controls from Electronic Health Record Data: Failures and Successes. Association of an HIV-Prediction Model with Uptake of Pre-Exposure Prophylaxis (PrEP). Pediatric Predictive Artificial Intelligence Implemented in Clinical Practice from 2010-2021: A Systematic Review. Exploring Mixed Reality for Patient Education in Cerebral Angiograms: A Pilot Study.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1