F. Buendía, Joaquín Gayoso-Cabada, J. A. J. Méndez, J. Sierra
{"title":"将非结构化的临床自由文本语料库转化为可重构的医学数字馆藏","authors":"F. Buendía, Joaquín Gayoso-Cabada, J. A. J. Méndez, J. Sierra","doi":"10.1109/CBMS.2019.00105","DOIUrl":null,"url":null,"abstract":"In this paper, we describe how to transform unstructured free-text clinical corpora, made from reports written in natural language and complementary assets (e.g., medical images, laboratory results, etc.), into collections of digital objects compatible with Clavy, a tool for managing reconfigurable digital collections. It will allow healthcare experts to subsequently reorganize the resulting collections to adapt them to their specific needs. The transformation will be achieved through the use of MetaMap, a robust tool for mapping clinical texts into the UMLS (Unified Medical Language System) thesaurus. Thus, by processing reports with MetaMap, we will be able to extract a significant set of corpus-specific UMLS terms, grouped according to relevant semantic types, which will be used to support a preliminary organization of the resources in the Clavy collection. We illustrate the viability of the approach with the generation of a reconfigurable Clavy collection from the Indiana Chest X-ray corpus of radiology reports and images. On the basis of this case study, we also discuss the strengths and weaknesses of the approach proposed.","PeriodicalId":311634,"journal":{"name":"2019 IEEE 32nd International Symposium on Computer-Based Medical Systems (CBMS)","volume":"208 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Transforming Unstructured Clinical Free-Text Corpora into Reconfigurable Medical Digital Collections\",\"authors\":\"F. Buendía, Joaquín Gayoso-Cabada, J. A. J. Méndez, J. Sierra\",\"doi\":\"10.1109/CBMS.2019.00105\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we describe how to transform unstructured free-text clinical corpora, made from reports written in natural language and complementary assets (e.g., medical images, laboratory results, etc.), into collections of digital objects compatible with Clavy, a tool for managing reconfigurable digital collections. It will allow healthcare experts to subsequently reorganize the resulting collections to adapt them to their specific needs. The transformation will be achieved through the use of MetaMap, a robust tool for mapping clinical texts into the UMLS (Unified Medical Language System) thesaurus. Thus, by processing reports with MetaMap, we will be able to extract a significant set of corpus-specific UMLS terms, grouped according to relevant semantic types, which will be used to support a preliminary organization of the resources in the Clavy collection. We illustrate the viability of the approach with the generation of a reconfigurable Clavy collection from the Indiana Chest X-ray corpus of radiology reports and images. On the basis of this case study, we also discuss the strengths and weaknesses of the approach proposed.\",\"PeriodicalId\":311634,\"journal\":{\"name\":\"2019 IEEE 32nd International Symposium on Computer-Based Medical Systems (CBMS)\",\"volume\":\"208 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-06-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE 32nd International Symposium on Computer-Based Medical Systems (CBMS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CBMS.2019.00105\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 32nd International Symposium on Computer-Based Medical Systems (CBMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CBMS.2019.00105","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Transforming Unstructured Clinical Free-Text Corpora into Reconfigurable Medical Digital Collections
In this paper, we describe how to transform unstructured free-text clinical corpora, made from reports written in natural language and complementary assets (e.g., medical images, laboratory results, etc.), into collections of digital objects compatible with Clavy, a tool for managing reconfigurable digital collections. It will allow healthcare experts to subsequently reorganize the resulting collections to adapt them to their specific needs. The transformation will be achieved through the use of MetaMap, a robust tool for mapping clinical texts into the UMLS (Unified Medical Language System) thesaurus. Thus, by processing reports with MetaMap, we will be able to extract a significant set of corpus-specific UMLS terms, grouped according to relevant semantic types, which will be used to support a preliminary organization of the resources in the Clavy collection. We illustrate the viability of the approach with the generation of a reconfigurable Clavy collection from the Indiana Chest X-ray corpus of radiology reports and images. On the basis of this case study, we also discuss the strengths and weaknesses of the approach proposed.