基于潜在空间模型生成链接的大规模信息网络嵌入评价

Shotaro Kawasaki, Ryosuke Motegi, Shogo Matsuno, Yoichi Seki
{"title":"基于潜在空间模型生成链接的大规模信息网络嵌入评价","authors":"Shotaro Kawasaki, Ryosuke Motegi, Shogo Matsuno, Yoichi Seki","doi":"10.1145/3520084.3520111","DOIUrl":null,"url":null,"abstract":"Graph representation learning encodes vertices as low-dimensional vectors that summarize their graph position and the structure of their local graph neighborhood. These methods give us beneficial representation in continuous space from big relational data. However, the algorithms are usually evaluated indirectly from the accuracy of applying the learning results to classification tasks because of not giving the correct answer when graph representation learning is applied. Therefore, this study proposes a method to evaluate graph representation learning algorithms by preparing correct learning results for the data by distributing objects in the latent space in advance and probabilistically generating relational graph data from the distributions in the latent space. Using this method, we evaluated LINE: Large-scale information network embedding, one of the most popular algorithms for learning graph representations. LINE consists of two algorithms optimizing two objective functions defined by first-order proximity and second-order proximity. We prepared two link-generating models suitable for these two objective functions and clarified that the corresponding LINE algorithm performed well for the link data generated by each model.","PeriodicalId":444957,"journal":{"name":"Proceedings of the 2022 5th International Conference on Software Engineering and Information Management","volume":"82 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Evaluation of Large-scale Information Network Embedding based on Latent Space Model Generating Links\",\"authors\":\"Shotaro Kawasaki, Ryosuke Motegi, Shogo Matsuno, Yoichi Seki\",\"doi\":\"10.1145/3520084.3520111\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Graph representation learning encodes vertices as low-dimensional vectors that summarize their graph position and the structure of their local graph neighborhood. These methods give us beneficial representation in continuous space from big relational data. However, the algorithms are usually evaluated indirectly from the accuracy of applying the learning results to classification tasks because of not giving the correct answer when graph representation learning is applied. Therefore, this study proposes a method to evaluate graph representation learning algorithms by preparing correct learning results for the data by distributing objects in the latent space in advance and probabilistically generating relational graph data from the distributions in the latent space. Using this method, we evaluated LINE: Large-scale information network embedding, one of the most popular algorithms for learning graph representations. LINE consists of two algorithms optimizing two objective functions defined by first-order proximity and second-order proximity. We prepared two link-generating models suitable for these two objective functions and clarified that the corresponding LINE algorithm performed well for the link data generated by each model.\",\"PeriodicalId\":444957,\"journal\":{\"name\":\"Proceedings of the 2022 5th International Conference on Software Engineering and Information Management\",\"volume\":\"82 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-01-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2022 5th International Conference on Software Engineering and Information Management\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3520084.3520111\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 5th International Conference on Software Engineering and Information Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3520084.3520111","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

图表示学习将顶点编码为低维向量,总结了它们的图位置和局部图邻域的结构。这些方法为我们从大关系数据中获得连续空间的表示提供了有益的途径。然而,由于在应用图表示学习时不能给出正确的答案,因此通常从将学习结果应用于分类任务的准确性来间接评价算法。因此,本研究提出了一种评估图表示学习算法的方法,通过预先在潜在空间中分布对象,并从潜在空间的分布中概率地生成关系图数据,为数据准备正确的学习结果。使用这种方法,我们评估了LINE:大规模信息网络嵌入,这是学习图表示最流行的算法之一。LINE由两种算法组成,分别对一阶接近度和二阶接近度定义的两个目标函数进行优化。我们针对这两个目标函数准备了两个适合的链路生成模型,并阐明了对应的LINE算法对于每个模型生成的链路数据都表现良好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
An Evaluation of Large-scale Information Network Embedding based on Latent Space Model Generating Links
Graph representation learning encodes vertices as low-dimensional vectors that summarize their graph position and the structure of their local graph neighborhood. These methods give us beneficial representation in continuous space from big relational data. However, the algorithms are usually evaluated indirectly from the accuracy of applying the learning results to classification tasks because of not giving the correct answer when graph representation learning is applied. Therefore, this study proposes a method to evaluate graph representation learning algorithms by preparing correct learning results for the data by distributing objects in the latent space in advance and probabilistically generating relational graph data from the distributions in the latent space. Using this method, we evaluated LINE: Large-scale information network embedding, one of the most popular algorithms for learning graph representations. LINE consists of two algorithms optimizing two objective functions defined by first-order proximity and second-order proximity. We prepared two link-generating models suitable for these two objective functions and clarified that the corresponding LINE algorithm performed well for the link data generated by each model.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
HSAACE: Design a Cloud Platform Health Status Assessment Application to Support Continuous Evolution of Assessment Capabilities Development of Real-Time Hand Gesture for Volume Control Application using Python on Raspberry Pi Adapting the Scrum Framework to the Needs of Virtual Teams of Game Developers with Multi-site Members Impact of Remote Working During Covid-19 Pandemic on Scrum Team: Experts View on Indonesian E-Commerce Companies Case Analysis Factors that Influence the Increasing of Generation Z's Interest in Using Social Media as the Implementation of Online to Offline and Offline to Online Business Model in Pandemic Era at Indonesia
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1