{"title":"Extraction and analysis of the structure of labels in biomedical ontologies","authors":"Manuel Quesada-Martínez, J. Fernández-breis, R. Stevens","doi":"10.1145/2389672.2389675","DOIUrl":null,"url":null,"abstract":"The increasing interest in biomedical ontologies has provoked the development of a significant number of ontologies, and many more are expected to be produced in the near future. A significant proportion of such ontologies have not been created by computer scientists or ontology engineers, but by domain experts. Many such ontologies are rich in implicit knowledge, but are really just plain taxonomies and controlled vocabularies, with little axiomatization.\n Many of these ontologies have much information within the labels of the classes. There is a great deal of knowledge about the entities described within such labels and text definitions held on classes; these are useful for human users, but not for machine processing. In previous work we proposed a process for enriching ontologies, which included the analysis of such labels, the identification of lexical patterns and the design of corresponding knowledge patterns. However, this process relied on manual intervention.\n In this paper we present a method to analyze and extract unused information contained in the structure of the labels in biomedical ontologies. The aim of this method is to improve the source ontology. The first step is the identification of lexical patterns based on repetitions of sets of words. Second, such lexical patterns will be examined in existing biomedical ontologies to identify whether those patterns are referencing existing ontological entities. Finally, the results obtained with relevant biomedical ontologies are presented and discussed.","PeriodicalId":91363,"journal":{"name":"MIX-HS'12 : proceedings of the 2nd International Workshop on Managing Interoperability and Complexity in Health Systems October 29, 2012, Maui, Hawaii, USA. International Workshop on Managing Interoperability and Complexity in Health Sy...","volume":"13 1","pages":"7-16"},"PeriodicalIF":0.0000,"publicationDate":"2012-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"MIX-HS'12 : proceedings of the 2nd International Workshop on Managing Interoperability and Complexity in Health Systems October 29, 2012, Maui, Hawaii, USA. International Workshop on Managing Interoperability and Complexity in Health Sy...","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2389672.2389675","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

Abstract

The increasing interest in biomedical ontologies has provoked the development of a significant number of ontologies, and many more are expected to be produced in the near future. A significant proportion of such ontologies have not been created by computer scientists or ontology engineers, but by domain experts. Many such ontologies are rich in implicit knowledge, but are really just plain taxonomies and controlled vocabularies, with little axiomatization. Many of these ontologies have much information within the labels of the classes. There is a great deal of knowledge about the entities described within such labels and text definitions held on classes; these are useful for human users, but not for machine processing. In previous work we proposed a process for enriching ontologies, which included the analysis of such labels, the identification of lexical patterns and the design of corresponding knowledge patterns. However, this process relied on manual intervention. In this paper we present a method to analyze and extract unused information contained in the structure of the labels in biomedical ontologies. The aim of this method is to improve the source ontology. The first step is the identification of lexical patterns based on repetitions of sets of words. Second, such lexical patterns will be examined in existing biomedical ontologies to identify whether those patterns are referencing existing ontological entities. Finally, the results obtained with relevant biomedical ontologies are presented and discussed.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
生物医学本体中标签结构的提取与分析
对生物医学本体日益增长的兴趣已经激发了大量本体的发展,并且预计在不久的将来会产生更多的本体。很大一部分这样的本体不是由计算机科学家或本体工程师创建的,而是由领域专家创建的。许多这样的本体具有丰富的隐含知识,但实际上只是简单的分类法和受控词汇表,几乎没有公理化。这些本体中的许多都在类的标签中包含很多信息。在类的标签和文本定义中描述的实体有大量的知识;这些对于人类用户是有用的,但对于机器处理则不是。在之前的工作中,我们提出了一个丰富本体的过程,包括对这些标签的分析、词汇模式的识别和相应知识模式的设计。然而,这个过程依赖于人工干预。本文提出了一种分析和提取生物医学本体中标签结构中未使用信息的方法。该方法的目的是改进源本体。第一步是根据单词组的重复来识别词汇模式。其次,这些词汇模式将在现有的生物医学本体中进行检查,以确定这些模式是否引用了现有的本体实体。最后,对相关生物医学本体的结果进行了介绍和讨论。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
POSITIVE EDUCATION IN SECONDARY SCHOOLS AS A REQUIREMENT OF FORMATIVE EDUCATION CRAFTS CREATIVITY AND ITS DEVELOPMENT FROM THE PRIMARY SCHOOL TEACHERS` PERSPECTIVE COMPANIES´USAGE OF AI IN THE CZECH REPUBLIC TENDENCIES OF HUNGARIAN CHILDREN’S LITERATURE IN 2020 AND OTTÓ KISS’S CHILDREN’S MONOLOGUE TITLED A BÁTYÁM ÖCCSE [THE LITTLE BROTHER OF MY BIG BROTHER] IMPACT OF SHORT-TERM EDUCATIONAL ACTIVITIES ON PUPILS' ATTITUDES TOWARDS WASTE IN DISTANCE EDUCATION
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1