The RDF2vec family of knowledge graph embedding methods

IF 4.7 3区 材料科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC ACS Applied Electronic Materials Pub Date : 2024-01-25 DOI:10.3233/sw-233514
Jan Portisch, Heiko Paulheim
{"title":"The RDF2vec family of knowledge graph embedding methods","authors":"Jan Portisch, Heiko Paulheim","doi":"10.3233/sw-233514","DOIUrl":null,"url":null,"abstract":"Knowledge graph embeddings represent a group of machine learning techniques which project entities and relations of a knowledge graph to continuous vector spaces. RDF2vec is a scalable embedding approach rooted in the combination of random walks with a language model. It has been successfully used in various applications. Recently, multiple variants to the RDF2vec approach have been proposed, introducing variations both on the walk generation and on the language modeling side. The combination of those different approaches has lead to an increasing family of RDF2vec variants. In this paper, we evaluate a total of twelve RDF2vec variants on a comprehensive set of benchmark models, and compare them to seven existing knowledge graph embedding methods from the family of link prediction approaches. Besides the established GEval benchmark introducing various downstream machine learning tasks on the DBpedia knowledge graph, we also use the new DLCC (Description Logic Class Constructors) benchmark consisting of two gold standards, one based on DBpedia, and one based on synthetically generated graphs. The latter allows for analyzing which ontological patterns in a knowledge graph can actually be learned by different embedding. With this evaluation, we observe that certain tailored RDF2vec variants can lead to improved performance on different downstream tasks, given the nature of the underlying problem, and that they, in particular, have a different behavior in modeling similarity and relatedness. The findings can be used to provide guidance in selecting a particular RDF2vec method for a given task.","PeriodicalId":3,"journal":{"name":"ACS Applied Electronic Materials","volume":"33 19","pages":""},"PeriodicalIF":4.7000,"publicationDate":"2024-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Electronic Materials","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.3233/sw-233514","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 1

Abstract

Knowledge graph embeddings represent a group of machine learning techniques which project entities and relations of a knowledge graph to continuous vector spaces. RDF2vec is a scalable embedding approach rooted in the combination of random walks with a language model. It has been successfully used in various applications. Recently, multiple variants to the RDF2vec approach have been proposed, introducing variations both on the walk generation and on the language modeling side. The combination of those different approaches has lead to an increasing family of RDF2vec variants. In this paper, we evaluate a total of twelve RDF2vec variants on a comprehensive set of benchmark models, and compare them to seven existing knowledge graph embedding methods from the family of link prediction approaches. Besides the established GEval benchmark introducing various downstream machine learning tasks on the DBpedia knowledge graph, we also use the new DLCC (Description Logic Class Constructors) benchmark consisting of two gold standards, one based on DBpedia, and one based on synthetically generated graphs. The latter allows for analyzing which ontological patterns in a knowledge graph can actually be learned by different embedding. With this evaluation, we observe that certain tailored RDF2vec variants can lead to improved performance on different downstream tasks, given the nature of the underlying problem, and that they, in particular, have a different behavior in modeling similarity and relatedness. The findings can be used to provide guidance in selecting a particular RDF2vec method for a given task.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
知识图谱嵌入方法的 RDF2vec 系列
知识图谱嵌入是一组机器学习技术,可将知识图谱的实体和关系投射到连续向量空间。RDF2vec 是一种可扩展的嵌入方法,根植于随机行走与语言模型的结合。它已成功应用于多种领域。最近,人们提出了 RDF2vec 方法的多种变体,在随机游走生成和语言建模方面都引入了变化。这些不同方法的组合导致 RDF2vec 变体的数量不断增加。在本文中,我们在一组全面的基准模型上评估了总共十二种 RDF2vec 变体,并将它们与链接预测方法系列中的七种现有知识图嵌入方法进行了比较。除了在 DBpedia 知识图谱上引入各种下游机器学习任务的 GEval 基准外,我们还使用了新的 DLCC(描述逻辑类构造函数)基准,该基准由两个黄金标准组成,一个基于 DBpedia,另一个基于合成生成的图谱。后者可以分析知识图谱中哪些本体模式可以通过不同的嵌入学习到。通过这项评估,我们观察到,考虑到基础问题的性质,某些定制的 RDF2vec 变体可以在不同的下游任务中提高性能,尤其是在相似性和相关性建模方面有不同的表现。这些发现可为特定任务选择特定的 RDF2vec 方法提供指导。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
7.20
自引率
4.30%
发文量
567
期刊介绍: ACS Applied Electronic Materials is an interdisciplinary journal publishing original research covering all aspects of electronic materials. The journal is devoted to reports of new and original experimental and theoretical research of an applied nature that integrate knowledge in the areas of materials science, engineering, optics, physics, and chemistry into important applications of electronic materials. Sample research topics that span the journal's scope are inorganic, organic, ionic and polymeric materials with properties that include conducting, semiconducting, superconducting, insulating, dielectric, magnetic, optoelectronic, piezoelectric, ferroelectric and thermoelectric. Indexed/​Abstracted: Web of Science SCIE Scopus CAS INSPEC Portico
期刊最新文献
Issue Editorial Masthead Issue Publication Information Enhanced Zero-Bias Rectification in 1D Metal-Double-Insulator-Graphene Diodes for RF Energy Harvesting MnO2-Embedded PVDF-HFP/PEG Composite Membrane: A Multifunctional Approach for Electrochemical and Biomedical Systems Effect of Oxygen Substitution in the X Sublattice of Ti3C2Tx MXene on the Performance of Triboelectric Nanogenerators
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1