{"title":"Defining health data elements under the HL7 development framework for metadata management","authors":"Yang, Zhe, Jiang, Kun, Lou, Miaomiao, Gong, Yang, Zhang, Lili, Liu, Jing, Bao, Xinyu, Liu, Danhong, Yang, Peng","doi":"10.1186/s13326-022-00265-5","DOIUrl":null,"url":null,"abstract":"Health data from different specialties or domains generallly have diverse formats and meanings, which can cause semantic communication barriers when these data are exchanged among heterogeneous systems. As such, this study is intended to develop a national health concept data model (HCDM) and develop a corresponding system to facilitate healthcare data standardization and centralized metadata management. Based on 55 data sets (4640 data items) from 7 health business domains in China, a bottom-up approach was employed to build the structure and metadata for HCDM by referencing HL7 RIM. According to ISO/IEC 11179, a top-down approach was used to develop and standardize the data elements. HCDM adopted three-level architecture of class, attribute and data type, and consisted of 6 classes and 15 sub-classes. Each class had a set of descriptive attributes and every attribute was assigned a data type. 100 initial data elements (DEs) were extracted from HCDM and 144 general DEs were derived from corresponding initial DEs. Domain DEs were transformed by specializing general DEs using 12 controlled vocabularies which developed from HL7 vocabularies and actual health demands. A model-based system was successfully established to evaluate and manage the NHDD. HCDM provided a unified metadata reference for multi-source data standardization and management. This approach of defining health data elements was a feasible solution in healthcare information standardization to enable healthcare interoperability in China.","PeriodicalId":15055,"journal":{"name":"Journal of Biomedical Semantics","volume":"24 1","pages":""},"PeriodicalIF":1.6000,"publicationDate":"2022-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Biomedical Semantics","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1186/s13326-022-00265-5","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 2
Abstract
Health data from different specialties or domains generallly have diverse formats and meanings, which can cause semantic communication barriers when these data are exchanged among heterogeneous systems. As such, this study is intended to develop a national health concept data model (HCDM) and develop a corresponding system to facilitate healthcare data standardization and centralized metadata management. Based on 55 data sets (4640 data items) from 7 health business domains in China, a bottom-up approach was employed to build the structure and metadata for HCDM by referencing HL7 RIM. According to ISO/IEC 11179, a top-down approach was used to develop and standardize the data elements. HCDM adopted three-level architecture of class, attribute and data type, and consisted of 6 classes and 15 sub-classes. Each class had a set of descriptive attributes and every attribute was assigned a data type. 100 initial data elements (DEs) were extracted from HCDM and 144 general DEs were derived from corresponding initial DEs. Domain DEs were transformed by specializing general DEs using 12 controlled vocabularies which developed from HL7 vocabularies and actual health demands. A model-based system was successfully established to evaluate and manage the NHDD. HCDM provided a unified metadata reference for multi-source data standardization and management. This approach of defining health data elements was a feasible solution in healthcare information standardization to enable healthcare interoperability in China.
来自不同专业或领域的健康数据通常具有不同的格式和含义,当这些数据在异构系统之间交换时,可能会造成语义通信障碍。因此,本研究旨在建立国家健康概念数据模型(HCDM)并开发相应的系统,以促进医疗数据标准化和元数据的集中管理。基于来自中国7个健康业务领域的55个数据集(4640个数据项),采用自底向上的方法,参照HL7 RIM构建HCDM的结构和元数据。根据ISO/IEC 11179,采用了自顶向下的方法来开发和标准化数据元素。HCDM采用类、属性、数据类型三级架构,由6个类和15个子类组成。每个类都有一组描述性属性,每个属性都被分配了一个数据类型。从HCDM中提取了100个初始数据元素(initial data elements, DEs),并由相应的初始数据元素衍生出144个通用数据元素(general data elements),利用HL7词汇表和实际健康需求发展而来的12个受控词汇表,对通用数据元素进行专门化转换。成功建立了一个基于模型的NHDD评估与管理系统。HCDM为多源数据的标准化和管理提供了统一的元数据参考。这种定义健康数据元素的方法是实现中国医疗保健互操作性的医疗保健信息标准化的可行解决方案。
期刊介绍:
Journal of Biomedical Semantics addresses issues of semantic enrichment and semantic processing in the biomedical domain. The scope of the journal covers two main areas:
Infrastructure for biomedical semantics: focusing on semantic resources and repositories, meta-data management and resource description, knowledge representation and semantic frameworks, the Biomedical Semantic Web, and semantic interoperability.
Semantic mining, annotation, and analysis: focusing on approaches and applications of semantic resources; and tools for investigation, reasoning, prediction, and discoveries in biomedicine.