Semantic enrichment of Pomeranian health study data using LOINC and WHO-FIC terminology mapping principles.

IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES JAMIA Open Pub Date : 2025-03-06 eCollection Date: 2025-04-01 DOI:10.1093/jamiaopen/ooaf010
Esther Thea Inau, Dörte Radke, Linda Bird, Susanne Westphal, Till Ittermann, Christian Schäfer, Matthias Nauck, Atinkut Alamirrew Zeleke, Carsten Oliver Schmidt, Dagmar Waltemath
{"title":"Semantic enrichment of Pomeranian health study data using LOINC and WHO-FIC terminology mapping principles.","authors":"Esther Thea Inau, Dörte Radke, Linda Bird, Susanne Westphal, Till Ittermann, Christian Schäfer, Matthias Nauck, Atinkut Alamirrew Zeleke, Carsten Oliver Schmidt, Dagmar Waltemath","doi":"10.1093/jamiaopen/ooaf010","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>To semantically enrich the laboratory data dictionary of the Study of Health in Pomerania (SHIP), a population-based cohort study, with LOINC to achieve better compliance with the FAIR principles for data stewardship.</p><p><strong>Materials and methods: </strong>We employed a workflow that maps codes from the SHIP-START-4 laboratory data dictionary to LOINC codes following the terminology mapping principles and best practices recommended by the World Health Organization Family of International Classifications (WHO-FIC) Network.</p><p><strong>Results: </strong>We were able to annotate 71 out of 72 (98.6%) of the source codes in the SHIP-START-4 laboratory data dictionary with LOINC codes. 32 source codes were mapped to a single LOINC code (cardinality 1:1) and 39 resulted in a complex mapping. All of the successful mappings are equivalent (=) matches.</p><p><strong>Discussion: </strong>We increased the FAIRness of the SHIP laboratory data dictionary by semantically enriching laboratory items with links to an accessible, established, and machine-readable language for knowledge representation (LOINC). Our mapping improves semantic data retrieval and integration. However, not all clinically and significantly relevant data are included in the LOINC code. Therefore, these missing aspects have to be considered in data interpretation as well.</p><p><strong>Conclusion: </strong>Semantically enriching the SHIP-START-4 laboratory data dictionary has contributed to its improved data interoperability and reuse. We recommend that data owners and standardization experts collaboratively perform annotations before data collection starts instead of doing this retrospectively. These experiences may inform the development of standard operating procedures for annotating data dictionaries developed for other population-based cohort studies.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 2","pages":"ooaf010"},"PeriodicalIF":3.4000,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11884810/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JAMIA Open","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/jamiaopen/ooaf010","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0

Abstract

Objective: To semantically enrich the laboratory data dictionary of the Study of Health in Pomerania (SHIP), a population-based cohort study, with LOINC to achieve better compliance with the FAIR principles for data stewardship.

Materials and methods: We employed a workflow that maps codes from the SHIP-START-4 laboratory data dictionary to LOINC codes following the terminology mapping principles and best practices recommended by the World Health Organization Family of International Classifications (WHO-FIC) Network.

Results: We were able to annotate 71 out of 72 (98.6%) of the source codes in the SHIP-START-4 laboratory data dictionary with LOINC codes. 32 source codes were mapped to a single LOINC code (cardinality 1:1) and 39 resulted in a complex mapping. All of the successful mappings are equivalent (=) matches.

Discussion: We increased the FAIRness of the SHIP laboratory data dictionary by semantically enriching laboratory items with links to an accessible, established, and machine-readable language for knowledge representation (LOINC). Our mapping improves semantic data retrieval and integration. However, not all clinically and significantly relevant data are included in the LOINC code. Therefore, these missing aspects have to be considered in data interpretation as well.

Conclusion: Semantically enriching the SHIP-START-4 laboratory data dictionary has contributed to its improved data interoperability and reuse. We recommend that data owners and standardization experts collaboratively perform annotations before data collection starts instead of doing this retrospectively. These experiences may inform the development of standard operating procedures for annotating data dictionaries developed for other population-based cohort studies.

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用LOINC和WHO-FIC术语映射原则对波美拉尼亚健康研究数据进行语义丰富。
目的:用LOINC对波美拉尼亚健康研究(SHIP)这一基于人群的队列研究的实验室数据词典进行语义丰富,以更好地遵守FAIR数据管理原则。材料和方法:我们采用了一个工作流,按照世界卫生组织国际分类族(WHO-FIC)网络推荐的术语映射原则和最佳做法,将ship -起始-4实验室数据字典中的代码映射到LOINC代码。结果:在SHIP-START-4实验数据字典中,72个源代码中有71个(98.6%)用LOINC代码进行了注释。32个源代码被映射到一个LOINC代码(基数1:1),39个源代码被映射成一个复杂的映射。所有成功的映射都是等价的(=)匹配。讨论:我们增加了SHIP实验室数据字典的公平性,方法是在语义上丰富实验室条目,并链接到一种可访问的、已建立的、机器可读的知识表示语言(LOINC)。我们的映射改进了语义数据检索和集成。然而,并非所有临床和重要相关的数据都包含在LOINC代码中。因此,在数据解释中也必须考虑这些缺失的方面。结论:从语义上丰富SHIP-START-4实验室数据词典有助于提高其数据互操作性和重用性。我们建议数据所有者和标准化专家在数据收集开始之前协作执行注释,而不是回顾性地执行注释。这些经验可以为为其他基于人群的队列研究开发的注释数据字典的标准操作程序的开发提供信息。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
JAMIA Open
JAMIA Open Medicine-Health Informatics
CiteScore
4.10
自引率
4.80%
发文量
102
审稿时长
16 weeks
期刊最新文献
Establishing the Digital Health Equity & Literacy Program (D-HELP): a student-led initiative to address digital health literacy gaps among emergency department patients at rush. Assessment of the adequacy of the Fast Healthcare Interoperability Resources (FHIR) Genomics standard for the representation of somatic testing reports. Interpretability of an FDA-authorized AI/ML sepsis diagnostic tool improved by SHAP values. Electronic health record use factors linked to efficiency and productivity: an explainable machine learning analysis. Correction to: Biomedical data repositories require governance for artificial intelligence/machine learning applications at every step.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1