Enhancing Knowledge Graphs with Data Representatives

André Pomp, Lucian Poth, Vadim Kraus, Tobias Meisen
{"title":"Enhancing Knowledge Graphs with Data Representatives","authors":"André Pomp, Lucian Poth, Vadim Kraus, Tobias Meisen","doi":"10.5220/0007677400490060","DOIUrl":null,"url":null,"abstract":"Due to the digitalization of many processes in companies and the increasing networking of devices, there is an ever-increasing amount of data sources and corresponding data sets. To make these data sets accessible, searchable and understandable, recent approaches focus on the creation of semantic models by domain experts, which enable the annotation of the available data attributes with meaningful semantic concepts from knowledge graphs. For simplifying the annotation process, recommendation engines based on the data attribute labels can support this process. However, as soon as the labels are incomprehensible, cryptic or ambiguous, the domain expert will not receive any support. In this paper, we propose a semantic concept recommendation for data attributes based on the data values rather than on the label. Therefore, we extend knowledge graphs to learn different dedicated data representations by including data instances. Using different approaches, such as machine learning, rules or statistical methods, enables us to recommend semantic concepts based on the content of data points rather than on the labels. Our evaluation with public available data sets shows that the accuracy improves when using our flexible and dedicated classification approach. Further, we present shortcomings and extension points that we received from the analysis of our evaluation.","PeriodicalId":271024,"journal":{"name":"International Conference on Enterprise Information Systems","volume":"98 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Enterprise Information Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5220/0007677400490060","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

Due to the digitalization of many processes in companies and the increasing networking of devices, there is an ever-increasing amount of data sources and corresponding data sets. To make these data sets accessible, searchable and understandable, recent approaches focus on the creation of semantic models by domain experts, which enable the annotation of the available data attributes with meaningful semantic concepts from knowledge graphs. For simplifying the annotation process, recommendation engines based on the data attribute labels can support this process. However, as soon as the labels are incomprehensible, cryptic or ambiguous, the domain expert will not receive any support. In this paper, we propose a semantic concept recommendation for data attributes based on the data values rather than on the label. Therefore, we extend knowledge graphs to learn different dedicated data representations by including data instances. Using different approaches, such as machine learning, rules or statistical methods, enables us to recommend semantic concepts based on the content of data points rather than on the labels. Our evaluation with public available data sets shows that the accuracy improves when using our flexible and dedicated classification approach. Further, we present shortcomings and extension points that we received from the analysis of our evaluation.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用数据代表增强知识图谱
由于公司中许多流程的数字化和设备的网络化,数据源和相应的数据集的数量不断增加。为了使这些数据集可访问、可搜索和可理解,最近的方法侧重于由领域专家创建语义模型,这使得可以用知识图中有意义的语义概念对可用数据属性进行注释。为了简化标注过程,基于数据属性标签的推荐引擎可以支持这一过程。然而,一旦标签是不可理解的,神秘的或模棱两可的,领域专家将得不到任何支持。在本文中,我们提出了一种基于数据值而不是标签的数据属性语义概念推荐。因此,我们扩展知识图,通过包含数据实例来学习不同的专用数据表示。使用不同的方法,如机器学习、规则或统计方法,使我们能够根据数据点的内容而不是标签来推荐语义概念。我们对公共可用数据集的评估表明,当使用我们灵活和专用的分类方法时,准确性得到了提高。此外,我们提出了我们从评估分析中得到的缺点和扩展点。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
CrudeBERT: Applying Economic Theory towards fine-tuning Transformer-based Sentiment Analysis Models to the Crude Oil Market A Next-Generation Digital Procurement Workspace Focusing on Information Integration, Automation, Analytics, and Sustainability An Applied Risk Identification Approach in the ICT Governance and Management Macroprocesses of a Brazilian Federal Government Agency Towards Unlocking the Potential of the Internet of Things for the Skilled Crafts An Open Platform for Smart Production: IT/OT Integration in a Smart Factory
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1