Haoran Yang, Junli Wang, Rui Duan, Changwei Wang, Chungang Yan
{"title":"Disentangled Active Learning on Graphs.","authors":"Haoran Yang, Junli Wang, Rui Duan, Changwei Wang, Chungang Yan","doi":"10.1016/j.neunet.2025.107130","DOIUrl":null,"url":null,"abstract":"<p><p>Active learning on graphs (ALG) has emerged as a compelling research field due to its capacity to address the challenge of label scarcity. Existing ALG methods incorporate diversity into their query strategies to maximize the gains from node sampling, improving robustness and reducing redundancy in graph learning. However, they often overlook the complex entanglement of latent factors inherent in graph-structured data. This oversight can lead to a sampling process that fails to ensure diversity at a finer-grained level, thereby missing the opportunity to sample more valuable nodes. To this end, we propose a novel approach, Disentangled Active Learning on Graphs (DALG). In this work, we first design the Disenconv-AL layer to learn disentangled feature embedding, then construct the influence graph for each node and create a dedicated \"memory list\" to store the resultant influence weights. On this basis, our approach aims to make the model not excessively focus on a few latent factors during the sampling phase. Specifically, we prioritize addressing latent factors with the most significant impact on the sampled node in the previous round, thereby ensuring that current sampling can better focus on other latent factors. Compared with existing methodologies, our approach pioneers reach diversity from the latent factor that drives the formation of graph data at a finer-grained level, thereby enabling further improvements in the benefits delivered with a limited labeling budget. Extensive experiments across eight public datasets show that DALG surpasses state-of-the-art graph active learning methods, achieving an improvement of up to approximately 15% in both Micro-F1 and Macro-F1.</p>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"185 ","pages":"107130"},"PeriodicalIF":6.0000,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Networks","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1016/j.neunet.2025.107130","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Active learning on graphs (ALG) has emerged as a compelling research field due to its capacity to address the challenge of label scarcity. Existing ALG methods incorporate diversity into their query strategies to maximize the gains from node sampling, improving robustness and reducing redundancy in graph learning. However, they often overlook the complex entanglement of latent factors inherent in graph-structured data. This oversight can lead to a sampling process that fails to ensure diversity at a finer-grained level, thereby missing the opportunity to sample more valuable nodes. To this end, we propose a novel approach, Disentangled Active Learning on Graphs (DALG). In this work, we first design the Disenconv-AL layer to learn disentangled feature embedding, then construct the influence graph for each node and create a dedicated "memory list" to store the resultant influence weights. On this basis, our approach aims to make the model not excessively focus on a few latent factors during the sampling phase. Specifically, we prioritize addressing latent factors with the most significant impact on the sampled node in the previous round, thereby ensuring that current sampling can better focus on other latent factors. Compared with existing methodologies, our approach pioneers reach diversity from the latent factor that drives the formation of graph data at a finer-grained level, thereby enabling further improvements in the benefits delivered with a limited labeling budget. Extensive experiments across eight public datasets show that DALG surpasses state-of-the-art graph active learning methods, achieving an improvement of up to approximately 15% in both Micro-F1 and Macro-F1.
期刊介绍:
Neural Networks is a platform that aims to foster an international community of scholars and practitioners interested in neural networks, deep learning, and other approaches to artificial intelligence and machine learning. Our journal invites submissions covering various aspects of neural networks research, from computational neuroscience and cognitive modeling to mathematical analyses and engineering applications. By providing a forum for interdisciplinary discussions between biology and technology, we aim to encourage the development of biologically-inspired artificial intelligence.