Semi-automatic extraction and validation of concepts in ontology learning from texts in Spanish

Manuela Gómez-Suta, J. Echeverry-Correa, José A. Soto Mejía
{"title":"Semi-automatic extraction and validation of concepts in ontology learning from texts in Spanish","authors":"Manuela Gómez-Suta, J. Echeverry-Correa, José A. Soto Mejía","doi":"10.1145/3405962.3405977","DOIUrl":null,"url":null,"abstract":"The construction of ontologies from texts in Spanish is a challenge since this language lacks conceptual databases to validate abstract ontology structures as concepts and relations between them. The preceding generates the necessity of using manual evaluation by human experts; carrying high expenses that limit the calibration of algorithm parameters and large-scale evaluations. This document presents a proposal to evaluate abstract ontology structures through the task of semantic clustering of documents, without the expensive necessity of using manual evaluation or conceptual databases. The proposal is not only affordable but also applicable to model data and domains that lack structured knowledge resources. The experiments lead to the extraction and validation of the ontology structures from texts in Spanish regarding the domain of the Colombian armed conflict.","PeriodicalId":247414,"journal":{"name":"Proceedings of the 10th International Conference on Web Intelligence, Mining and Semantics","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 10th International Conference on Web Intelligence, Mining and Semantics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3405962.3405977","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

The construction of ontologies from texts in Spanish is a challenge since this language lacks conceptual databases to validate abstract ontology structures as concepts and relations between them. The preceding generates the necessity of using manual evaluation by human experts; carrying high expenses that limit the calibration of algorithm parameters and large-scale evaluations. This document presents a proposal to evaluate abstract ontology structures through the task of semantic clustering of documents, without the expensive necessity of using manual evaluation or conceptual databases. The proposal is not only affordable but also applicable to model data and domains that lack structured knowledge resources. The experiments lead to the extraction and validation of the ontology structures from texts in Spanish regarding the domain of the Colombian armed conflict.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
西班牙语文本本体学习中概念的半自动提取与验证
从西班牙语文本构建本体是一个挑战,因为这种语言缺乏概念性数据库来验证抽象本体结构作为概念和它们之间的关系。这就产生了由人类专家进行人工评估的必要性;高昂的费用限制了算法参数的校准和大规模的评估。本文提出了一种通过文档的语义聚类任务来评估抽象本体结构的建议,而不需要使用昂贵的人工评估或概念数据库。该建议不仅经济实惠,而且适用于缺乏结构化知识资源的模型数据和领域。实验从哥伦比亚武装冲突领域的西班牙语文本中提取和验证了本体结构。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Splitting the Web Analytics Atom: From Page Metrics and KPIs to Sub-Page Metrics and KPIs SciPuRe BREXIT Election: Forecasting a Conservative Party Victory through the Pound using ARIMA and Facebook's Prophet Concept Drift Detection on Data Stream for Revising DBSCAN Cluster Adaptive Error Prediction for Production Lines with Unknown Dependencies
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1