Manuel Quesada-Martínez, J. Fernández-breis, R. Stevens
{"title":"生物医学本体中标签结构的提取与分析","authors":"Manuel Quesada-Martínez, J. Fernández-breis, R. Stevens","doi":"10.1145/2389672.2389675","DOIUrl":null,"url":null,"abstract":"The increasing interest in biomedical ontologies has provoked the development of a significant number of ontologies, and many more are expected to be produced in the near future. A significant proportion of such ontologies have not been created by computer scientists or ontology engineers, but by domain experts. Many such ontologies are rich in implicit knowledge, but are really just plain taxonomies and controlled vocabularies, with little axiomatization.\n Many of these ontologies have much information within the labels of the classes. There is a great deal of knowledge about the entities described within such labels and text definitions held on classes; these are useful for human users, but not for machine processing. In previous work we proposed a process for enriching ontologies, which included the analysis of such labels, the identification of lexical patterns and the design of corresponding knowledge patterns. However, this process relied on manual intervention.\n In this paper we present a method to analyze and extract unused information contained in the structure of the labels in biomedical ontologies. The aim of this method is to improve the source ontology. The first step is the identification of lexical patterns based on repetitions of sets of words. Second, such lexical patterns will be examined in existing biomedical ontologies to identify whether those patterns are referencing existing ontological entities. Finally, the results obtained with relevant biomedical ontologies are presented and discussed.","PeriodicalId":91363,"journal":{"name":"MIX-HS'12 : proceedings of the 2nd International Workshop on Managing Interoperability and Complexity in Health Systems October 29, 2012, Maui, Hawaii, USA. International Workshop on Managing Interoperability and Complexity in Health Sy...","volume":"13 1","pages":"7-16"},"PeriodicalIF":0.0000,"publicationDate":"2012-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Extraction and analysis of the structure of labels in biomedical ontologies\",\"authors\":\"Manuel Quesada-Martínez, J. Fernández-breis, R. Stevens\",\"doi\":\"10.1145/2389672.2389675\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The increasing interest in biomedical ontologies has provoked the development of a significant number of ontologies, and many more are expected to be produced in the near future. A significant proportion of such ontologies have not been created by computer scientists or ontology engineers, but by domain experts. Many such ontologies are rich in implicit knowledge, but are really just plain taxonomies and controlled vocabularies, with little axiomatization.\\n Many of these ontologies have much information within the labels of the classes. There is a great deal of knowledge about the entities described within such labels and text definitions held on classes; these are useful for human users, but not for machine processing. In previous work we proposed a process for enriching ontologies, which included the analysis of such labels, the identification of lexical patterns and the design of corresponding knowledge patterns. However, this process relied on manual intervention.\\n In this paper we present a method to analyze and extract unused information contained in the structure of the labels in biomedical ontologies. The aim of this method is to improve the source ontology. The first step is the identification of lexical patterns based on repetitions of sets of words. Second, such lexical patterns will be examined in existing biomedical ontologies to identify whether those patterns are referencing existing ontological entities. Finally, the results obtained with relevant biomedical ontologies are presented and discussed.\",\"PeriodicalId\":91363,\"journal\":{\"name\":\"MIX-HS'12 : proceedings of the 2nd International Workshop on Managing Interoperability and Complexity in Health Systems October 29, 2012, Maui, Hawaii, USA. International Workshop on Managing Interoperability and Complexity in Health Sy...\",\"volume\":\"13 1\",\"pages\":\"7-16\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-10-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"MIX-HS'12 : proceedings of the 2nd International Workshop on Managing Interoperability and Complexity in Health Systems October 29, 2012, Maui, Hawaii, USA. International Workshop on Managing Interoperability and Complexity in Health Sy...\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2389672.2389675\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"MIX-HS'12 : proceedings of the 2nd International Workshop on Managing Interoperability and Complexity in Health Systems October 29, 2012, Maui, Hawaii, USA. International Workshop on Managing Interoperability and Complexity in Health Sy...","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2389672.2389675","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Extraction and analysis of the structure of labels in biomedical ontologies
The increasing interest in biomedical ontologies has provoked the development of a significant number of ontologies, and many more are expected to be produced in the near future. A significant proportion of such ontologies have not been created by computer scientists or ontology engineers, but by domain experts. Many such ontologies are rich in implicit knowledge, but are really just plain taxonomies and controlled vocabularies, with little axiomatization.
Many of these ontologies have much information within the labels of the classes. There is a great deal of knowledge about the entities described within such labels and text definitions held on classes; these are useful for human users, but not for machine processing. In previous work we proposed a process for enriching ontologies, which included the analysis of such labels, the identification of lexical patterns and the design of corresponding knowledge patterns. However, this process relied on manual intervention.
In this paper we present a method to analyze and extract unused information contained in the structure of the labels in biomedical ontologies. The aim of this method is to improve the source ontology. The first step is the identification of lexical patterns based on repetitions of sets of words. Second, such lexical patterns will be examined in existing biomedical ontologies to identify whether those patterns are referencing existing ontological entities. Finally, the results obtained with relevant biomedical ontologies are presented and discussed.