Efficient Unsupervised Graph Embedding With Attributed Graph Reduction and Dual-Level Loss

IF 10.4 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE IEEE Transactions on Knowledge and Data Engineering Pub Date : 2024-07-31 DOI:10.1109/TKDE.2024.3436076

Ziyang Liu;Chaokun Wang;Hao Feng;Ziyang Chen

{"title":"Efficient Unsupervised Graph Embedding With Attributed Graph Reduction and Dual-Level Loss","authors":"Ziyang Liu;Chaokun Wang;Hao Feng;Ziyang Chen","doi":"10.1109/TKDE.2024.3436076","DOIUrl":null,"url":null,"abstract":"Graph embedding aims to extract low-dimensional representation vectors, commonly referred to as embeddings, from graph data. The generated embeddings simplify subsequent data analysis and machine learning tasks. Recently, researchers have proposed the use of contrastive learning on graphs to extract node embeddings in an unsupervised manner. Although existing graph contrastive learning methods have significantly advanced this field, there is still potential for further exploration, particularly in optimizing \n<italic>training efficiency</i>\n and enhancing \n<italic>embedding quality</i>\n. In this paper, we propose an efficient unsupervised graph embedding method named GEARED. First, the method involves an attributed graph reduction module that converts the raw graph into a reduced graph, greatly improving model training efficiency. Second, GEARED employs a dual-level loss with adaptive scaling factors to ensure the acquisition of high-quality embeddings. Finally, we conduct a partial derivative analysis to elucidate the specific mechanisms through which GEARED is capable of generating high-quality embeddings. Extensive experimental evaluations on 14 benchmark datasets show that GEARED consistently outperforms state-of-the-art methods in terms of training efficiency and classification accuracy. For instance, GEARED achieves a training speedup of over 40 times on both the CS and Physics datasets while maintaining superior classification accuracy.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"36 12","pages":"8120-8134"},"PeriodicalIF":10.4000,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Knowledge and Data Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10616385/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Graph embedding aims to extract low-dimensional representation vectors, commonly referred to as embeddings, from graph data. The generated embeddings simplify subsequent data analysis and machine learning tasks. Recently, researchers have proposed the use of contrastive learning on graphs to extract node embeddings in an unsupervised manner. Although existing graph contrastive learning methods have significantly advanced this field, there is still potential for further exploration, particularly in optimizing training efficiency and enhancing embedding quality . In this paper, we propose an efficient unsupervised graph embedding method named GEARED. First, the method involves an attributed graph reduction module that converts the raw graph into a reduced graph, greatly improving model training efficiency. Second, GEARED employs a dual-level loss with adaptive scaling factors to ensure the acquisition of high-quality embeddings. Finally, we conduct a partial derivative analysis to elucidate the specific mechanisms through which GEARED is capable of generating high-quality embeddings. Extensive experimental evaluations on 14 benchmark datasets show that GEARED consistently outperforms state-of-the-art methods in terms of training efficiency and classification accuracy. For instance, GEARED achieves a training speedup of over 40 times on both the CS and Physics datasets while maintaining superior classification accuracy.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用归属图缩减和双级损失实现高效无监督图嵌入

图嵌入旨在从图数据中提取低维表示向量，通常称为嵌入。生成的嵌入可以简化后续的数据分析和机器学习任务。最近，研究人员提出在图中使用对比学习，以无监督的方式提取节点嵌入。虽然现有的图对比学习方法大大推进了这一领域的发展，但仍有进一步探索的潜力，尤其是在优化训练效率和提高嵌入质量方面。在本文中，我们提出了一种名为 GEARED 的高效无监督图嵌入方法。首先，该方法包含一个归因图缩减模块，可将原始图转换为缩减图，从而大大提高模型训练效率。其次，GEARED 采用了具有自适应缩放因子的双级损失，以确保获得高质量的嵌入。最后，我们进行了偏导数分析，以阐明 GEARED 能够生成高质量嵌入的具体机制。在 14 个基准数据集上进行的广泛实验评估表明，GEARED 在训练效率和分类准确性方面始终优于最先进的方法。例如，GEARED 在 CS 和物理数据集上的训练速度提高了 40 多倍，同时保持了出色的分类准确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Knowledge and Data Engineering 工程技术-工程：电子与电气

CiteScore

11.70

自引率

3.40%

发文量

515

审稿时长

6 months

期刊介绍： The IEEE Transactions on Knowledge and Data Engineering encompasses knowledge and data engineering aspects within computer science, artificial intelligence, electrical engineering, computer engineering, and related fields. It provides an interdisciplinary platform for disseminating new developments in knowledge and data engineering and explores the practicality of these concepts in both hardware and software. Specific areas covered include knowledge-based and expert systems, AI techniques for knowledge and data management, tools, and methodologies, distributed processing, real-time systems, architectures, data management practices, database design, query languages, security, fault tolerance, statistical databases, algorithms, performance evaluation, and applications.

期刊最新文献

2025 Reviewers List Toward Federated Learning of Deep Graph Neural Networks HCGBot: Learning Homophilous Context Graphs for Twitter Bot Detection Optimizing KBQA by Correcting LLM-Generated Non-Executable Logical Form Through Knowledge-Assisted Path Reconstruction Privacy-Preserving Graph Similarity Matching Query Over Encrypted Graph Database