Josceli M Tenorio, Fabrício Landi de Moraes, Ivan Torres Pisa
{"title":"CHV.br: Exploratory study for the development of a consumer health vocabulary (CHV) supported by a network model for Brazilian Portuguese language","authors":"Josceli M Tenorio, Fabrício Landi de Moraes, Ivan Torres Pisa","doi":"10.1177/01655515231196391","DOIUrl":null,"url":null,"abstract":"Successful consumer health vocabulary (CHV) models have been engineered and updated by using automatic term extraction techniques from online content. However, the relationship between terms has yet to be mapped. This study aims to describe a CHV model for the Brazilian Portuguese language that is supported by a complex network. The method was split up into three distinct stages: (1) collect and automatically extract terms from structured data sources on the web, such as Unified Medical Language System (UMLS) vocabularies and DBpedia; (2) construct a complex network; and (3) select the terms supported by clustering techniques. A model called CHV.br was developed and supported by a complex network structure which makes connections between the controlled vocabulary and consumer vocabulary and maps semantic relationships as categories, synonyms and related terms. CHV.br contains 146,956 terms, of which 31,439 are UMLS preferred terms and 83,279 are synonyms. The CHV.br is available and powered by Simple Knowledge Organization System and Resource Description Framework standards. The method used in this study showed to be valid for the selection of the candidate terms by connecting the terms from different reliable resources, in addition to expanding the number of terms and their semantic relationships. The content and structure of CHV.br could play a vital role in enhancing the development of consumer-oriented health applications.","PeriodicalId":54796,"journal":{"name":"Journal of Information Science","volume":"55 1","pages":"0"},"PeriodicalIF":1.8000,"publicationDate":"2023-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Information Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/01655515231196391","RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Successful consumer health vocabulary (CHV) models have been engineered and updated by using automatic term extraction techniques from online content. However, the relationship between terms has yet to be mapped. This study aims to describe a CHV model for the Brazilian Portuguese language that is supported by a complex network. The method was split up into three distinct stages: (1) collect and automatically extract terms from structured data sources on the web, such as Unified Medical Language System (UMLS) vocabularies and DBpedia; (2) construct a complex network; and (3) select the terms supported by clustering techniques. A model called CHV.br was developed and supported by a complex network structure which makes connections between the controlled vocabulary and consumer vocabulary and maps semantic relationships as categories, synonyms and related terms. CHV.br contains 146,956 terms, of which 31,439 are UMLS preferred terms and 83,279 are synonyms. The CHV.br is available and powered by Simple Knowledge Organization System and Resource Description Framework standards. The method used in this study showed to be valid for the selection of the candidate terms by connecting the terms from different reliable resources, in addition to expanding the number of terms and their semantic relationships. The content and structure of CHV.br could play a vital role in enhancing the development of consumer-oriented health applications.
期刊介绍:
The Journal of Information Science is a peer-reviewed international journal of high repute covering topics of interest to all those researching and working in the sciences of information and knowledge management. The Editors welcome material on any aspect of information science theory, policy, application or practice that will advance thinking in the field.