{"title":"超越消息传递:通过特征扰动实现半监督节点分类的图神经网络泛化","authors":"Yoonhyuk Choi;Jiho Choi;Taewook Ko;Chong-Kwon Kim","doi":"10.1109/TNNLS.2024.3472897","DOIUrl":null,"url":null,"abstract":"Graph neural networks (GNNs) that collect information from neighbors are commonly utilized in semi-supervised learning contexts. In particular, a significant body of research has been dedicated to developing effective graph filters and aggregation methods to filter the information from adjacent nodes. Despite their efficacy, these approaches may encounter challenges due to the sparsity of training nodes, especially when their features are represented as sparse vectors (e.g., bag-of-words). This condition can lead to the overfitting of certain dimensions within the first projection matrix (hyperplane), as the training samples may not adequately represent the full spectrum of learnable parameters. To solve this limitation, we propose an innovative perturbation technique. Specifically, we introduce additional training variability by modifying both the initial features and the hyperplane, which contributes to the reduction of prediction variance by updating the entire dimensions. To the best of our knowledge, our approach is the first to address the overfitting issue in GNNs precipitated by sparse node features. Comprehensive experiments on real-world datasets and ablation studies affirm that our proposed method significantly enhances node classification performance, with improvements of up to 46.5% in GNN algorithms.","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"36 6","pages":"10271-10282"},"PeriodicalIF":8.9000,"publicationDate":"2024-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10767362","citationCount":"0","resultStr":"{\"title\":\"Beyond Message-Passing: Generalization of Graph Neural Networks via Feature Perturbation for Semi-Supervised Node Classification\",\"authors\":\"Yoonhyuk Choi;Jiho Choi;Taewook Ko;Chong-Kwon Kim\",\"doi\":\"10.1109/TNNLS.2024.3472897\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Graph neural networks (GNNs) that collect information from neighbors are commonly utilized in semi-supervised learning contexts. In particular, a significant body of research has been dedicated to developing effective graph filters and aggregation methods to filter the information from adjacent nodes. Despite their efficacy, these approaches may encounter challenges due to the sparsity of training nodes, especially when their features are represented as sparse vectors (e.g., bag-of-words). This condition can lead to the overfitting of certain dimensions within the first projection matrix (hyperplane), as the training samples may not adequately represent the full spectrum of learnable parameters. To solve this limitation, we propose an innovative perturbation technique. Specifically, we introduce additional training variability by modifying both the initial features and the hyperplane, which contributes to the reduction of prediction variance by updating the entire dimensions. To the best of our knowledge, our approach is the first to address the overfitting issue in GNNs precipitated by sparse node features. Comprehensive experiments on real-world datasets and ablation studies affirm that our proposed method significantly enhances node classification performance, with improvements of up to 46.5% in GNN algorithms.\",\"PeriodicalId\":13303,\"journal\":{\"name\":\"IEEE transactions on neural networks and learning systems\",\"volume\":\"36 6\",\"pages\":\"10271-10282\"},\"PeriodicalIF\":8.9000,\"publicationDate\":\"2024-11-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10767362\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE transactions on neural networks and learning systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10767362/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on neural networks and learning systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10767362/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Beyond Message-Passing: Generalization of Graph Neural Networks via Feature Perturbation for Semi-Supervised Node Classification
Graph neural networks (GNNs) that collect information from neighbors are commonly utilized in semi-supervised learning contexts. In particular, a significant body of research has been dedicated to developing effective graph filters and aggregation methods to filter the information from adjacent nodes. Despite their efficacy, these approaches may encounter challenges due to the sparsity of training nodes, especially when their features are represented as sparse vectors (e.g., bag-of-words). This condition can lead to the overfitting of certain dimensions within the first projection matrix (hyperplane), as the training samples may not adequately represent the full spectrum of learnable parameters. To solve this limitation, we propose an innovative perturbation technique. Specifically, we introduce additional training variability by modifying both the initial features and the hyperplane, which contributes to the reduction of prediction variance by updating the entire dimensions. To the best of our knowledge, our approach is the first to address the overfitting issue in GNNs precipitated by sparse node features. Comprehensive experiments on real-world datasets and ablation studies affirm that our proposed method significantly enhances node classification performance, with improvements of up to 46.5% in GNN algorithms.
期刊介绍:
The focus of IEEE Transactions on Neural Networks and Learning Systems is to present scholarly articles discussing the theory, design, and applications of neural networks as well as other learning systems. The journal primarily highlights technical and scientific research in this domain.