超越消息传递：通过特征扰动实现半监督节点分类的图神经网络泛化

IF 9.7 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE IEEE transactions on neural networks and learning systems Pub Date : 2024-11-25 DOI:10.1109/TNNLS.2024.3472897

Yoonhyuk Choi;Jiho Choi;Taewook Ko;Chong-Kwon Kim

{"title":"超越消息传递：通过特征扰动实现半监督节点分类的图神经网络泛化","authors":"Yoonhyuk Choi;Jiho Choi;Taewook Ko;Chong-Kwon Kim","doi":"10.1109/TNNLS.2024.3472897","DOIUrl":null,"url":null,"abstract":"Graph neural networks (GNNs) that collect information from neighbors are commonly utilized in semi-supervised learning contexts. In particular, a significant body of research has been dedicated to developing effective graph filters and aggregation methods to filter the information from adjacent nodes. Despite their efficacy, these approaches may encounter challenges due to the sparsity of training nodes, especially when their features are represented as sparse vectors (e.g., bag-of-words). This condition can lead to the overfitting of certain dimensions within the first projection matrix (hyperplane), as the training samples may not adequately represent the full spectrum of learnable parameters. To solve this limitation, we propose an innovative perturbation technique. Specifically, we introduce additional training variability by modifying both the initial features and the hyperplane, which contributes to the reduction of prediction variance by updating the entire dimensions. To the best of our knowledge, our approach is the first to address the overfitting issue in GNNs precipitated by sparse node features. Comprehensive experiments on real-world datasets and ablation studies affirm that our proposed method significantly enhances node classification performance, with improvements of up to 46.5% in GNN algorithms.","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"36 6","pages":"10271-10282"},"PeriodicalIF":9.7000,"publicationDate":"2024-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10767362","citationCount":"0","resultStr":"{\"title\":\"Beyond Message-Passing: Generalization of Graph Neural Networks via Feature Perturbation for Semi-Supervised Node Classification\",\"authors\":\"Yoonhyuk Choi;Jiho Choi;Taewook Ko;Chong-Kwon Kim\",\"doi\":\"10.1109/TNNLS.2024.3472897\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Graph neural networks (GNNs) that collect information from neighbors are commonly utilized in semi-supervised learning contexts. In particular, a significant body of research has been dedicated to developing effective graph filters and aggregation methods to filter the information from adjacent nodes. Despite their efficacy, these approaches may encounter challenges due to the sparsity of training nodes, especially when their features are represented as sparse vectors (e.g., bag-of-words). This condition can lead to the overfitting of certain dimensions within the first projection matrix (hyperplane), as the training samples may not adequately represent the full spectrum of learnable parameters. To solve this limitation, we propose an innovative perturbation technique. Specifically, we introduce additional training variability by modifying both the initial features and the hyperplane, which contributes to the reduction of prediction variance by updating the entire dimensions. To the best of our knowledge, our approach is the first to address the overfitting issue in GNNs precipitated by sparse node features. Comprehensive experiments on real-world datasets and ablation studies affirm that our proposed method significantly enhances node classification performance, with improvements of up to 46.5% in GNN algorithms.\",\"PeriodicalId\":13303,\"journal\":{\"name\":\"IEEE transactions on neural networks and learning systems\",\"volume\":\"36 6\",\"pages\":\"10271-10282\"},\"PeriodicalIF\":9.7000,\"publicationDate\":\"2024-11-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10767362\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE transactions on neural networks and learning systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10767362/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on neural networks and learning systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10767362/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

图神经网络（gnn）从邻居中收集信息，通常用于半监督学习环境。特别是，大量的研究致力于开发有效的图过滤器和聚合方法来过滤相邻节点的信息。尽管这些方法很有效，但由于训练节点的稀疏性，特别是当它们的特征被表示为稀疏向量（例如，词袋）时，这些方法可能会遇到挑战。这种情况可能导致第一个投影矩阵（超平面）内某些维度的过拟合，因为训练样本可能无法充分代表可学习参数的全部范围。为了解决这一限制，我们提出了一种创新的微扰技术。具体来说，我们通过修改初始特征和超平面来引入额外的训练可变性，这有助于通过更新整个维度来减少预测方差。据我们所知，我们的方法是第一个解决由稀疏节点特征沉淀的gnn过拟合问题的方法。在真实数据集和消融研究上的综合实验证实，我们提出的方法显著提高了节点分类性能，与GNN算法相比，改进幅度高达46.5%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Beyond Message-Passing: Generalization of Graph Neural Networks via Feature Perturbation for Semi-Supervised Node Classification

Graph neural networks (GNNs) that collect information from neighbors are commonly utilized in semi-supervised learning contexts. In particular, a significant body of research has been dedicated to developing effective graph filters and aggregation methods to filter the information from adjacent nodes. Despite their efficacy, these approaches may encounter challenges due to the sparsity of training nodes, especially when their features are represented as sparse vectors (e.g., bag-of-words). This condition can lead to the overfitting of certain dimensions within the first projection matrix (hyperplane), as the training samples may not adequately represent the full spectrum of learnable parameters. To solve this limitation, we propose an innovative perturbation technique. Specifically, we introduce additional training variability by modifying both the initial features and the hyperplane, which contributes to the reduction of prediction variance by updating the entire dimensions. To the best of our knowledge, our approach is the first to address the overfitting issue in GNNs precipitated by sparse node features. Comprehensive experiments on real-world datasets and ablation studies affirm that our proposed method significantly enhances node classification performance, with improvements of up to 46.5% in GNN algorithms.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE transactions on neural networks and learning systems COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

CiteScore

23.80

自引率

9.60%

发文量

2102

审稿时长

3-8 weeks

期刊介绍： The focus of IEEE Transactions on Neural Networks and Learning Systems is to present scholarly articles discussing the theory, design, and applications of neural networks as well as other learning systems. The journal primarily highlights technical and scientific research in this domain.