Disentangled Active Learning on Graphs.

IF 6 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Neural Networks Pub Date : 2025-01-09 DOI:10.1016/j.neunet.2025.107130

Haoran Yang, Junli Wang, Rui Duan, Changwei Wang, Chungang Yan

{"title":"Disentangled Active Learning on Graphs.","authors":"Haoran Yang, Junli Wang, Rui Duan, Changwei Wang, Chungang Yan","doi":"10.1016/j.neunet.2025.107130","DOIUrl":null,"url":null,"abstract":"<p><p>Active learning on graphs (ALG) has emerged as a compelling research field due to its capacity to address the challenge of label scarcity. Existing ALG methods incorporate diversity into their query strategies to maximize the gains from node sampling, improving robustness and reducing redundancy in graph learning. However, they often overlook the complex entanglement of latent factors inherent in graph-structured data. This oversight can lead to a sampling process that fails to ensure diversity at a finer-grained level, thereby missing the opportunity to sample more valuable nodes. To this end, we propose a novel approach, Disentangled Active Learning on Graphs (DALG). In this work, we first design the Disenconv-AL layer to learn disentangled feature embedding, then construct the influence graph for each node and create a dedicated \"memory list\" to store the resultant influence weights. On this basis, our approach aims to make the model not excessively focus on a few latent factors during the sampling phase. Specifically, we prioritize addressing latent factors with the most significant impact on the sampled node in the previous round, thereby ensuring that current sampling can better focus on other latent factors. Compared with existing methodologies, our approach pioneers reach diversity from the latent factor that drives the formation of graph data at a finer-grained level, thereby enabling further improvements in the benefits delivered with a limited labeling budget. Extensive experiments across eight public datasets show that DALG surpasses state-of-the-art graph active learning methods, achieving an improvement of up to approximately 15% in both Micro-F1 and Macro-F1.</p>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"185 ","pages":"107130"},"PeriodicalIF":6.0000,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Networks","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1016/j.neunet.2025.107130","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Active learning on graphs (ALG) has emerged as a compelling research field due to its capacity to address the challenge of label scarcity. Existing ALG methods incorporate diversity into their query strategies to maximize the gains from node sampling, improving robustness and reducing redundancy in graph learning. However, they often overlook the complex entanglement of latent factors inherent in graph-structured data. This oversight can lead to a sampling process that fails to ensure diversity at a finer-grained level, thereby missing the opportunity to sample more valuable nodes. To this end, we propose a novel approach, Disentangled Active Learning on Graphs (DALG). In this work, we first design the Disenconv-AL layer to learn disentangled feature embedding, then construct the influence graph for each node and create a dedicated "memory list" to store the resultant influence weights. On this basis, our approach aims to make the model not excessively focus on a few latent factors during the sampling phase. Specifically, we prioritize addressing latent factors with the most significant impact on the sampled node in the previous round, thereby ensuring that current sampling can better focus on other latent factors. Compared with existing methodologies, our approach pioneers reach diversity from the latent factor that drives the formation of graph data at a finer-grained level, thereby enabling further improvements in the benefits delivered with a limited labeling budget. Extensive experiments across eight public datasets show that DALG surpasses state-of-the-art graph active learning methods, achieving an improvement of up to approximately 15% in both Micro-F1 and Macro-F1.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

图上的解纠缠主动学习。

图上主动学习（ALG）已经成为一个引人注目的研究领域，因为它能够解决标签稀缺的挑战。现有的ALG方法将多样性纳入其查询策略中，以最大限度地提高节点采样的收益，提高图学习的鲁棒性并减少冗余。然而，他们往往忽略了图结构数据中固有的潜在因素的复杂纠缠。这种疏忽可能导致采样过程无法确保细粒度级别上的多样性，从而失去对更有价值的节点进行采样的机会。为此，我们提出了一种新的方法——图上解纠缠主动学习（DALG）。在这项工作中，我们首先设计disenconvo - al层来学习解纠缠的特征嵌入，然后为每个节点构建影响图，并创建一个专用的“记忆列表”来存储得到的影响权重。在此基础上，我们的方法旨在使模型在采样阶段不会过度关注少数潜在因素。具体来说，我们优先处理对前一轮采样节点影响最大的潜在因素，从而确保当前采样能够更好地关注其他潜在因素。与现有的方法相比，我们的方法从驱动更细粒度的图形数据形成的潜在因素中获得了多样性，从而在有限的标签预算下实现了进一步的改进。在8个公共数据集上进行的广泛实验表明，DALG超越了最先进的图主动学习方法，在Micro-F1和Macro-F1中都实现了大约15%的改进。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Neural Networks 工程技术-计算机：人工智能

CiteScore

13.90

自引率

7.70%

发文量

425

审稿时长

67 days

期刊介绍： Neural Networks is a platform that aims to foster an international community of scholars and practitioners interested in neural networks, deep learning, and other approaches to artificial intelligence and machine learning. Our journal invites submissions covering various aspects of neural networks research, from computational neuroscience and cognitive modeling to mathematical analyses and engineering applications. By providing a forum for interdisciplinary discussions between biology and technology, we aim to encourage the development of biologically-inspired artificial intelligence.