首页 > 最新文献

IEEE transactions on neural networks and learning systems最新文献

英文 中文
Monte Carlo Marginalization: A Differentiable Method to Learn High-Dimensional Distributions. 蒙特卡罗边缘化:一种学习高维分布的可微方法。
IF 10.4 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-23 DOI: 10.1109/tnnls.2026.3682991
Chenqiu Zhao,Guanfang Dong,Anup Basu
Learning intractable distributions in high-dimensional spaces remains a fundamental challenge. While prevalent deep learning methods often rely on restrictive prior assumptions, we propose a novel differentiable method that approximates intractable distributions using a Gaussian mixture model (GMM) by minimizing Kullback-Leibler (KL) divergence. In particular, a novel Monte Carlo marginalization (MCMarg) method is proposed to address the computational complexity of the KL divergence, which is unacceptable in a high-dimensional space. In addition, kernel density estimation (KDE) is utilized to ensure the differentiability of the optimization process because the target distribution is intractable. The proposed approach is a powerful and differentiable tool for learning complex distributions, which shifts the paradigm from network-dependent approximation to direct, network-free distribution learning. Comprehensive experiments demonstrate the superior properties of the proposed approach. By replacing standard priors in pretrained VAEs, our method achieves a significant improvement of approximately 10 points in FID scores. Remarkably, the model enables image generation without using a neural network, achieving an FID of 22 on the MNIST dataset. On the CIFAR-10 benchmark, our method achieves an FID score of 2.69, outperforming several state-of-the-art deep generative models. To the best of our knowledge, the proposed MCMarg is the first attempt at image generation without using a deep learning network.
学习高维空间中难以处理的分布仍然是一个根本性的挑战。虽然流行的深度学习方法通常依赖于限制性先验假设,但我们提出了一种新的可微方法,该方法通过最小化Kullback-Leibler (KL)散度,使用高斯混合模型(GMM)近似难处理分布。特别是,提出了一种新的蒙特卡罗边缘化(MCMarg)方法来解决KL散度的计算复杂性,这在高维空间中是不可接受的。此外,由于目标分布难以处理,因此利用核密度估计(KDE)来确保优化过程的可微性。所提出的方法是学习复杂分布的强大且可微分的工具,它将范式从依赖网络的近似转移到直接的,无网络的分布学习。综合实验证明了该方法的优越性。通过在预训练的vae中替换标准先验,我们的方法在FID评分中取得了大约10分的显着提高。值得注意的是,该模型可以在不使用神经网络的情况下生成图像,在MNIST数据集上实现了FID为22。在CIFAR-10基准测试中,我们的方法达到了2.69的FID分数,优于几个最先进的深度生成模型。据我们所知,提出的MCMarg是第一次尝试在不使用深度学习网络的情况下生成图像。
{"title":"Monte Carlo Marginalization: A Differentiable Method to Learn High-Dimensional Distributions.","authors":"Chenqiu Zhao,Guanfang Dong,Anup Basu","doi":"10.1109/tnnls.2026.3682991","DOIUrl":"https://doi.org/10.1109/tnnls.2026.3682991","url":null,"abstract":"Learning intractable distributions in high-dimensional spaces remains a fundamental challenge. While prevalent deep learning methods often rely on restrictive prior assumptions, we propose a novel differentiable method that approximates intractable distributions using a Gaussian mixture model (GMM) by minimizing Kullback-Leibler (KL) divergence. In particular, a novel Monte Carlo marginalization (MCMarg) method is proposed to address the computational complexity of the KL divergence, which is unacceptable in a high-dimensional space. In addition, kernel density estimation (KDE) is utilized to ensure the differentiability of the optimization process because the target distribution is intractable. The proposed approach is a powerful and differentiable tool for learning complex distributions, which shifts the paradigm from network-dependent approximation to direct, network-free distribution learning. Comprehensive experiments demonstrate the superior properties of the proposed approach. By replacing standard priors in pretrained VAEs, our method achieves a significant improvement of approximately 10 points in FID scores. Remarkably, the model enables image generation without using a neural network, achieving an FID of 22 on the MNIST dataset. On the CIFAR-10 benchmark, our method achieves an FID score of 2.69, outperforming several state-of-the-art deep generative models. To the best of our knowledge, the proposed MCMarg is the first attempt at image generation without using a deep learning network.","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"25 1","pages":""},"PeriodicalIF":10.4,"publicationDate":"2026-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147735246","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CLIP Graph Adaptor: A Dual-Graph Adapted Visual-Language Model for Weakly Supervised Semantic Segmentation. CLIP图适配器:用于弱监督语义分割的双图适应视觉语言模型。
IF 10.4 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-23 DOI: 10.1109/tnnls.2026.3683363
Jia Zhang,Bo Peng,Xi Wu,Chenchen He
Recent advancements in weakly supervised semantic segmentation (WSSS) have shown promise by using the contrastive language-image pretraining (CLIP) model to generate pseudo-labels. However, directly applying the CLIP model without considering interclass relationships in downstream tasks has resulted in suboptimal transferability and generalization. To address these challenges, we propose CLIP graph adapter (CLIP-GA), a novel approach that integrates both textual and visual structural knowledge to generate high-quality initial class activation maps (CAMs) for each object class. Our method introduces a dual-graph adaptive strategy, comprising a textual subgraph and a visual subgraph and employs cross-modal graph attention (CGA) for effective fusion. The framework includes three specialized loss functions that help to capture more complete object regions while minimizing the activation of background areas closely related to foreground objects. In addition, we implement the superpixel consistency to refine pseudo-labels and introduce a graph reasoning attention (GRA) module to build global contextual relationships within visual features for the segmentation network. Extensive experiments on the PASCAL VOC 2012 and MS COCO 2014 datasets have convincingly demonstrated the effectiveness of CLIP-GA compared with other state-of-the-art methods. Our code is provided at: https://github.com/JIA-ZHANG666/CLIP-GA.
弱监督语义分割(WSSS)的最新进展表明,使用对比语言图像预训练(CLIP)模型来生成伪标签是有希望的。然而,直接应用CLIP模型而不考虑下游任务中的类间关系导致了次优的可转移性和泛化。为了解决这些挑战,我们提出了CLIP图形适配器(CLIP- ga),这是一种集成文本和视觉结构知识的新方法,可以为每个对象类生成高质量的初始类激活图(CAMs)。该方法引入了一种双图自适应策略,包括文本子图和视觉子图,并采用跨模态图注意(CGA)进行有效融合。该框架包括三个专门的损失函数,有助于捕获更完整的目标区域,同时最大限度地减少与前景目标密切相关的背景区域的激活。此外,我们实现了超像素一致性来改进伪标签,并引入了图推理注意(GRA)模块来构建分割网络的视觉特征之间的全局上下文关系。在PASCAL VOC 2012和MS COCO 2014数据集上进行的大量实验令人信服地证明了CLIP-GA与其他最先进方法相比的有效性。我们的代码提供在:https://github.com/JIA-ZHANG666/CLIP-GA。
{"title":"CLIP Graph Adaptor: A Dual-Graph Adapted Visual-Language Model for Weakly Supervised Semantic Segmentation.","authors":"Jia Zhang,Bo Peng,Xi Wu,Chenchen He","doi":"10.1109/tnnls.2026.3683363","DOIUrl":"https://doi.org/10.1109/tnnls.2026.3683363","url":null,"abstract":"Recent advancements in weakly supervised semantic segmentation (WSSS) have shown promise by using the contrastive language-image pretraining (CLIP) model to generate pseudo-labels. However, directly applying the CLIP model without considering interclass relationships in downstream tasks has resulted in suboptimal transferability and generalization. To address these challenges, we propose CLIP graph adapter (CLIP-GA), a novel approach that integrates both textual and visual structural knowledge to generate high-quality initial class activation maps (CAMs) for each object class. Our method introduces a dual-graph adaptive strategy, comprising a textual subgraph and a visual subgraph and employs cross-modal graph attention (CGA) for effective fusion. The framework includes three specialized loss functions that help to capture more complete object regions while minimizing the activation of background areas closely related to foreground objects. In addition, we implement the superpixel consistency to refine pseudo-labels and introduce a graph reasoning attention (GRA) module to build global contextual relationships within visual features for the segmentation network. Extensive experiments on the PASCAL VOC 2012 and MS COCO 2014 datasets have convincingly demonstrated the effectiveness of CLIP-GA compared with other state-of-the-art methods. Our code is provided at: https://github.com/JIA-ZHANG666/CLIP-GA.","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"27 1","pages":""},"PeriodicalIF":10.4,"publicationDate":"2026-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147735248","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FedSKD: Aggregation-Free Model-Heterogeneous Federated Learning via Multidimensional Similarity Knowledge Distillation for Medical Image Classification. 基于多维相似知识蒸馏的无聚集模型异构联邦学习医学图像分类。
IF 10.4 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-22 DOI: 10.1109/tnnls.2026.3684321
Ziqiao Weng,Weidong Cai,Bo Zhou
Federated learning (FL) enables privacy-preserving collaborative model training without direct data sharing. Model-heterogeneous FL (MHFL) enables clients to train personalized models with heterogeneous architectures, but existing methods mainly rely on centralized aggregation or require partially identical architectures, limiting scalability and efficiency. Current peer-to-peer (P2P) FL frameworks, though removing server dependence, have not been adapted to heterogeneous models and suffer from model drift and knowledge dilution. To address these challenges, we propose FedSKD, a novel P2P MHFL framework for medical image classification that facilitates direct knowledge exchange through round-robin model circulation, eliminating the need for centralized aggregation while allowing fully heterogeneous model architectures across clients. FedSKD's key innovation lies in multidimensional similarity knowledge distillation (SKD), which enables bidirectional cross-client knowledge transfer at batch, pixel/voxel, and region levels for heterogeneous models in FL. This approach mitigates catastrophic forgetting and model drift through progressive reinforcement and distribution alignment while preserving model heterogeneity. Extensive evaluations on fMRI-based autism spectrum disorder (ASD) diagnosis and skin lesion classification demonstrate that FedSKD outperforms state-of-the-art heterogeneous and homogeneous FL baselines, achieving superior personalization and cross-institutional generalization. These findings underscore FedSKD's potential as a scalable and robust solution for real-world medical FL.
联邦学习(FL)可以在不直接共享数据的情况下实现保护隐私的协作模型训练。模型异构FL (MHFL)使客户能够使用异构体系结构训练个性化模型,但是现有的方法主要依赖于集中式聚合或需要部分相同的体系结构,限制了可伸缩性和效率。当前的点对点(P2P) FL框架虽然消除了对服务器的依赖,但没有适应异构模型,存在模型漂移和知识稀释的问题。为了解决这些挑战,我们提出了FedSKD,这是一种用于医学图像分类的新型P2P MHFL框架,通过循环模型循环促进直接知识交换,消除了集中聚合的需要,同时允许跨客户端完全异构的模型架构。FedSKD的关键创新在于多维相似知识升华(SKD),它可以在批处理、像素/体素和区域级别上实现双向跨客户端知识转移。这种方法通过渐进强化和分布对齐来减轻灾难性遗忘和模型漂移,同时保持模型异质性。基于fmri的自闭症谱系障碍(ASD)诊断和皮肤病变分类的广泛评估表明,FedSKD优于最先进的异质和同质FL基线,实现了卓越的个性化和跨机构推广。这些发现强调了FedSKD作为现实世界医疗FL的可扩展和强大解决方案的潜力。
{"title":"FedSKD: Aggregation-Free Model-Heterogeneous Federated Learning via Multidimensional Similarity Knowledge Distillation for Medical Image Classification.","authors":"Ziqiao Weng,Weidong Cai,Bo Zhou","doi":"10.1109/tnnls.2026.3684321","DOIUrl":"https://doi.org/10.1109/tnnls.2026.3684321","url":null,"abstract":"Federated learning (FL) enables privacy-preserving collaborative model training without direct data sharing. Model-heterogeneous FL (MHFL) enables clients to train personalized models with heterogeneous architectures, but existing methods mainly rely on centralized aggregation or require partially identical architectures, limiting scalability and efficiency. Current peer-to-peer (P2P) FL frameworks, though removing server dependence, have not been adapted to heterogeneous models and suffer from model drift and knowledge dilution. To address these challenges, we propose FedSKD, a novel P2P MHFL framework for medical image classification that facilitates direct knowledge exchange through round-robin model circulation, eliminating the need for centralized aggregation while allowing fully heterogeneous model architectures across clients. FedSKD's key innovation lies in multidimensional similarity knowledge distillation (SKD), which enables bidirectional cross-client knowledge transfer at batch, pixel/voxel, and region levels for heterogeneous models in FL. This approach mitigates catastrophic forgetting and model drift through progressive reinforcement and distribution alignment while preserving model heterogeneity. Extensive evaluations on fMRI-based autism spectrum disorder (ASD) diagnosis and skin lesion classification demonstrate that FedSKD outperforms state-of-the-art heterogeneous and homogeneous FL baselines, achieving superior personalization and cross-institutional generalization. These findings underscore FedSKD's potential as a scalable and robust solution for real-world medical FL.","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"23 1","pages":""},"PeriodicalIF":10.4,"publicationDate":"2026-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147733980","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Computing Node Failure Prediction Based on Continuous-Time Dynamic Graph. 基于连续时间动态图的计算节点故障预测。
IF 10.4 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-22 DOI: 10.1109/tnnls.2026.3684886
Binbin Huang,Teng Bao,Feiyi Chen,Lingbin Wang,Xunqing Huang,Yuyu Yin,Xiaoying Shi,Shangguang Wang,Shuiguang Deng
The growth of large models demands multinode cooperation during training and inference processes. The computing node failures can interrupt these processes, subsequently causing information loss and prolonging the execution time. To reduce the prohibitively large overhead incurred by the computing nodes failures, the accurate prediction of computing node failure is vital, which can help to avert potential large overhead, service interruptions, and negative customer experiences. Existing solutions of computing nodes failure prediction mainly focus on utilizing state-of-the-art time-series models to enhance the performance of computing node failure prediction. However, on the one hand, they could not capture the causal relationship between device over-utilization and node failures; On the other hand, they fail to extract the complex spatial-temporal cascading correlations among computing node failure events. These limits can degrade the performance of computing node failure prediction. To address these above problems, this article makes an effort to focus on designing a continuous-time dynamic graphs-based computing node failures prediction (CTDG-NFP) scheme, to accurately predict in dynamic cluster environments. Specifically, the CTDG-NFP scheme first designs a novel multiple-dimensional feature-biased neighbor sampling method, which jointly considers CPU utilization-biased, memory utilization-biased, temporal-biased and spatial-biased, to sample relevant context. Then, the CTDG-NFP scheme extracts diverse computing node failure motifs by multiple-dimensional feature-biased-based long-short-path walk method and set-based anonymization method. Finally, the CTDG-NFP scheme adopts time encoder to encode these motifs, and thereby extracting the complex spatial-temporal correlations among computing node failure events. On this basis, contrastive learning is adopted to train the computing node failure prediction model. Extensive evaluations with various real-world failure traces demonstrate the CTDG-NFP scheme can achieve superior performance in terms of six widely used performance metrics compared with the SOTA node failure prediction methods.
大型模型的增长需要在训练和推理过程中进行多节点合作。计算节点故障会导致这些进程中断,造成信息丢失,延长执行时间。为了减少计算节点故障带来的巨大开销,准确预测计算节点故障至关重要,这有助于避免潜在的巨大开销、服务中断和负面的客户体验。现有的计算节点故障预测方案主要是利用最先进的时间序列模型来提高计算节点故障预测的性能。然而,一方面,他们无法捕捉到设备过度使用和节点故障之间的因果关系;另一方面,它们无法提取计算节点故障事件之间复杂的时空级联关系。这些限制会降低计算节点故障预测的性能。为了解决上述问题,本文重点设计了一种基于连续时间动态图的计算节点故障预测(CTDG-NFP)方案,以便在动态集群环境下进行准确预测。具体而言,CTDG-NFP方案首先设计了一种新颖的多维特征偏差邻居采样方法,该方法联合考虑CPU利用率偏差、内存利用率偏差、时间偏差和空间偏差,对相关上下文进行采样。然后,CTDG-NFP方案采用基于多维特征偏差的长-短路径行走法和基于集合的匿名化方法提取不同的计算节点故障基元;最后,CTDG-NFP方案采用时间编码器对这些基元进行编码,从而提取计算节点故障事件之间复杂的时空相关性。在此基础上,采用对比学习训练计算节点故障预测模型。对各种实际故障轨迹的广泛评估表明,与SOTA节点故障预测方法相比,CTDG-NFP方案在6个广泛使用的性能指标方面具有优越的性能。
{"title":"Computing Node Failure Prediction Based on Continuous-Time Dynamic Graph.","authors":"Binbin Huang,Teng Bao,Feiyi Chen,Lingbin Wang,Xunqing Huang,Yuyu Yin,Xiaoying Shi,Shangguang Wang,Shuiguang Deng","doi":"10.1109/tnnls.2026.3684886","DOIUrl":"https://doi.org/10.1109/tnnls.2026.3684886","url":null,"abstract":"The growth of large models demands multinode cooperation during training and inference processes. The computing node failures can interrupt these processes, subsequently causing information loss and prolonging the execution time. To reduce the prohibitively large overhead incurred by the computing nodes failures, the accurate prediction of computing node failure is vital, which can help to avert potential large overhead, service interruptions, and negative customer experiences. Existing solutions of computing nodes failure prediction mainly focus on utilizing state-of-the-art time-series models to enhance the performance of computing node failure prediction. However, on the one hand, they could not capture the causal relationship between device over-utilization and node failures; On the other hand, they fail to extract the complex spatial-temporal cascading correlations among computing node failure events. These limits can degrade the performance of computing node failure prediction. To address these above problems, this article makes an effort to focus on designing a continuous-time dynamic graphs-based computing node failures prediction (CTDG-NFP) scheme, to accurately predict in dynamic cluster environments. Specifically, the CTDG-NFP scheme first designs a novel multiple-dimensional feature-biased neighbor sampling method, which jointly considers CPU utilization-biased, memory utilization-biased, temporal-biased and spatial-biased, to sample relevant context. Then, the CTDG-NFP scheme extracts diverse computing node failure motifs by multiple-dimensional feature-biased-based long-short-path walk method and set-based anonymization method. Finally, the CTDG-NFP scheme adopts time encoder to encode these motifs, and thereby extracting the complex spatial-temporal correlations among computing node failure events. On this basis, contrastive learning is adopted to train the computing node failure prediction model. Extensive evaluations with various real-world failure traces demonstrate the CTDG-NFP scheme can achieve superior performance in terms of six widely used performance metrics compared with the SOTA node failure prediction methods.","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"246 1","pages":""},"PeriodicalIF":10.4,"publicationDate":"2026-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147733983","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Forgettable Federated Linear Learning With Certified Data Unlearning 可遗忘的联邦线性学习与认证数据遗忘
IF 10.4 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-22 DOI: 10.1109/tnnls.2026.3683398
Ruinan Jin, Minghui Chen, Qiong Zhang, Xiaoxiao Li
{"title":"Forgettable Federated Linear Learning With Certified Data Unlearning","authors":"Ruinan Jin, Minghui Chen, Qiong Zhang, Xiaoxiao Li","doi":"10.1109/tnnls.2026.3683398","DOIUrl":"https://doi.org/10.1109/tnnls.2026.3683398","url":null,"abstract":"","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"22 1","pages":""},"PeriodicalIF":10.4,"publicationDate":"2026-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147735977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Node Classification in GNNs: Impact of Neighborhood Label Distribution on Homophily and Heterophily. gnn中的节点分类:邻域标签分布对同质性和异质性的影响。
IF 10.4 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-22 DOI: 10.1109/tnnls.2026.3680732
Zhili Zhao,Li Wan,Xupeng Liu,Ruiyi Yan,Shaomeng Wang
In node classification, traditional graph neural networks (GNNs) typically assume implicit homophily, indicating that intraclass nodes are likely connected. However, real-world graphs frequently exhibit heterophily, in which interclass nodes are also commonly connected. To address this challenge, recent methods have adopted approaches such as expanding local neighborhoods and employing adaptive message aggregation to enhance the GNN performance on heterophily graphs. Nevertheless, these methods are restricted by the homophily assumption and fail to effectively capture long-range dependencies (e.g., widely separated intraclass nodes) and insufficiently leverage the graph topology. This study investigates the performance differences of GNN when it is applied to both homophily and heterophily graphs and finds that the distinguishability of neighborhood label distributions (NLDs) exhibits a significant correlation with the accuracy of node classification. To assess the impact of NLD on node classification, this study proposes a novel homophily metric based on node distinguishability. Subsequently, this study introduces a new GNN model named NLD-based GNN (NLDGNN) for node classification. First, NLDGNN initializes node representations by integrating node features with node NLDs. To address long-range dependencies in heterophily graphs, NLDGNN utilizes the global label relationship matrix with low-rank characteristics for global message passing. By combining the attention scores derived from the initial node representations, NLDGNN constructs the global label relationship matrix for enhanced message passing, thereby improving the expressiveness of node representations. Experimental results indicate that NLDGNN outperforms existing GNN models on both real-world homophily and heterophily graphs. The code of this study is available at https://github.com/wanli6/NLDGNN.
在节点分类中,传统的图神经网络(gnn)通常假设隐式同态,这表明类内节点可能是连通的。然而,现实世界的图经常表现出异构性,其中类间节点也通常是连接的。为了解决这一挑战,最近的方法采用了扩展局部邻域和采用自适应消息聚合等方法来提高GNN在异质性图上的性能。然而,这些方法受到同态假设的限制,不能有效地捕获远程依赖关系(例如,广泛分离的类内节点),也不能充分利用图拓扑。本文研究了GNN在同态图和异态图上的性能差异,发现邻域标签分布(nld)的可分辨性与节点分类的准确性有显著的相关性。为了评估NLD对节点分类的影响,本研究提出了一种基于节点可分辨性的同态度量。随后,本研究引入了一种新的GNN模型,称为基于nld的GNN (NLDGNN),用于节点分类。首先,NLDGNN通过将节点特征与节点nld集成来初始化节点表示。为了解决异质性图中的远程依赖关系,NLDGNN利用具有低秩特征的全局标签关系矩阵进行全局消息传递。NLDGNN通过结合初始节点表示得到的关注分数,构建全局标签关系矩阵,增强消息传递,从而提高节点表示的表达性。实验结果表明,NLDGNN在真实世界同态图和异态图上都优于现有的GNN模型。本研究的代码可在https://github.com/wanli6/NLDGNN上获得。
{"title":"Node Classification in GNNs: Impact of Neighborhood Label Distribution on Homophily and Heterophily.","authors":"Zhili Zhao,Li Wan,Xupeng Liu,Ruiyi Yan,Shaomeng Wang","doi":"10.1109/tnnls.2026.3680732","DOIUrl":"https://doi.org/10.1109/tnnls.2026.3680732","url":null,"abstract":"In node classification, traditional graph neural networks (GNNs) typically assume implicit homophily, indicating that intraclass nodes are likely connected. However, real-world graphs frequently exhibit heterophily, in which interclass nodes are also commonly connected. To address this challenge, recent methods have adopted approaches such as expanding local neighborhoods and employing adaptive message aggregation to enhance the GNN performance on heterophily graphs. Nevertheless, these methods are restricted by the homophily assumption and fail to effectively capture long-range dependencies (e.g., widely separated intraclass nodes) and insufficiently leverage the graph topology. This study investigates the performance differences of GNN when it is applied to both homophily and heterophily graphs and finds that the distinguishability of neighborhood label distributions (NLDs) exhibits a significant correlation with the accuracy of node classification. To assess the impact of NLD on node classification, this study proposes a novel homophily metric based on node distinguishability. Subsequently, this study introduces a new GNN model named NLD-based GNN (NLDGNN) for node classification. First, NLDGNN initializes node representations by integrating node features with node NLDs. To address long-range dependencies in heterophily graphs, NLDGNN utilizes the global label relationship matrix with low-rank characteristics for global message passing. By combining the attention scores derived from the initial node representations, NLDGNN constructs the global label relationship matrix for enhanced message passing, thereby improving the expressiveness of node representations. Experimental results indicate that NLDGNN outperforms existing GNN models on both real-world homophily and heterophily graphs. The code of this study is available at https://github.com/wanli6/NLDGNN.","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"25 1","pages":""},"PeriodicalIF":10.4,"publicationDate":"2026-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147733981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Systematic Abductive Reasoning via Diverse Relation Representations in Vector-Symbolic Architecture 向量符号建筑中基于不同关系表示的系统溯因推理
IF 10.4 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-22 DOI: 10.1109/tnnls.2026.3684958
Zhong-Hua Sun, Ru-Yuan Zhang, Zonglei Zhen, Da-Hui Wang, Yong-Jie Li, Xiaohong Wan, Hongzhi You
{"title":"Systematic Abductive Reasoning via Diverse Relation Representations in Vector-Symbolic Architecture","authors":"Zhong-Hua Sun, Ru-Yuan Zhang, Zonglei Zhen, Da-Hui Wang, Yong-Jie Li, Xiaohong Wan, Hongzhi You","doi":"10.1109/tnnls.2026.3684958","DOIUrl":"https://doi.org/10.1109/tnnls.2026.3684958","url":null,"abstract":"","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"55 1","pages":""},"PeriodicalIF":10.4,"publicationDate":"2026-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147735976","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scalable and Efficient Deep Reinforcement Learning-Based Model Checker for Computation Tree Logic. 基于深度强化学习的可扩展高效计算树逻辑模型检查器。
IF 10.4 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-21 DOI: 10.1109/tnnls.2026.3683573
Ghalya Alwhishi,Jamal Bentahar,Amine Andam,Ahmed Elwhishi,Mustapha Hedabou
Formal verification using temporal logics such as computation tree logic (CTL) is essential for validating safety and correctness in complex systems. However, traditional model-checking techniques face severe scalability limitations due to the state explosion problem and their reliance on exhaustive symbolic traversal. Moreover, existing learning-based verification methods often lack formal guarantees and interpretability. These challenges create a pressing need for scalable, learning-based verification methods that preserve verification reliability while improving computational efficiency. This article introduces a novel deep reinforcement learning (DRL)-based model checking framework that learns to verify CTL formulas directly through interaction with system models. Unlike traditional symbolic model checkers such as NuSMV, the proposed DRL-CTL checker trained using proximal policy optimization (PPO) interprets CTL semantics over system models represented as Kripke structures without performing symbolic state-space traversal at inference time. Reward functions are designed for individual CTL operators, and fixed-point reasoning is incorporated to handle global temporal properties such as $AG(phi)$ and $EG(phi)$ . Experimental results show that the proposed method achieves near-constant inference time of approximately 2 ms per formula on an Intel Core i9-13900K CPU (24 cores, 3.0 GHz), 64 GB RAM, NVIDIA RTX 4090 GPU (24 GB VRAM), reduces verification time by up to 90% compared with traditional model checkers, and scales to models with more than $10^{1192}$ reachable states. The framework also produces witnesses and counterexamples and yields verification outcomes identical to those of symbolic checkers in our experiments. These results highlight the potential of DRL to serve as a scalable, efficient, and explainable alternative to classical CTL model checking.
使用时间逻辑(如计算树逻辑(CTL))进行形式化验证对于验证复杂系统的安全性和正确性至关重要。然而,由于状态爆炸问题和对穷举符号遍历的依赖,传统的模型检查技术面临严重的可扩展性限制。此外,现有的基于学习的验证方法往往缺乏正式的保证和可解释性。这些挑战产生了对可扩展的、基于学习的验证方法的迫切需求,这些方法在保持验证可靠性的同时提高了计算效率。本文介绍了一种新的基于深度强化学习(DRL)的模型检查框架,该框架通过与系统模型的交互学习直接验证CTL公式。与传统的符号模型检查器(如NuSMV)不同,本文提出的DRL-CTL检查器使用近端策略优化(PPO)训练,可以在Kripke结构表示的系统模型上解释CTL语义,而无需在推理时执行符号状态空间遍历。奖励函数是为单个CTL操作符设计的,并结合了定点推理来处理全局时间属性,如$AG(phi)$和$EG(phi)$。实验结果表明,该方法在Intel Core i9-13900K CPU(24核,3.0 GHz), 64 GB RAM, NVIDIA RTX 4090 GPU (24 GB VRAM)上实现了近似恒定的推理时间,每个公式约为2 ms,与传统模型检查器相比,验证时间缩短了90%,并且可扩展到超过$10^{1192}$可达状态的模型。该框架还产生了证人和反例,并产生了与我们实验中符号检查器相同的验证结果。这些结果突出了DRL作为经典CTL模型检查的可扩展、高效和可解释的替代方案的潜力。
{"title":"Scalable and Efficient Deep Reinforcement Learning-Based Model Checker for Computation Tree Logic.","authors":"Ghalya Alwhishi,Jamal Bentahar,Amine Andam,Ahmed Elwhishi,Mustapha Hedabou","doi":"10.1109/tnnls.2026.3683573","DOIUrl":"https://doi.org/10.1109/tnnls.2026.3683573","url":null,"abstract":"Formal verification using temporal logics such as computation tree logic (CTL) is essential for validating safety and correctness in complex systems. However, traditional model-checking techniques face severe scalability limitations due to the state explosion problem and their reliance on exhaustive symbolic traversal. Moreover, existing learning-based verification methods often lack formal guarantees and interpretability. These challenges create a pressing need for scalable, learning-based verification methods that preserve verification reliability while improving computational efficiency. This article introduces a novel deep reinforcement learning (DRL)-based model checking framework that learns to verify CTL formulas directly through interaction with system models. Unlike traditional symbolic model checkers such as NuSMV, the proposed DRL-CTL checker trained using proximal policy optimization (PPO) interprets CTL semantics over system models represented as Kripke structures without performing symbolic state-space traversal at inference time. Reward functions are designed for individual CTL operators, and fixed-point reasoning is incorporated to handle global temporal properties such as $AG(phi)$ and $EG(phi)$ . Experimental results show that the proposed method achieves near-constant inference time of approximately 2 ms per formula on an Intel Core i9-13900K CPU (24 cores, 3.0 GHz), 64 GB RAM, NVIDIA RTX 4090 GPU (24 GB VRAM), reduces verification time by up to 90% compared with traditional model checkers, and scales to models with more than $10^{1192}$ reachable states. The framework also produces witnesses and counterexamples and yields verification outcomes identical to those of symbolic checkers in our experiments. These results highlight the potential of DRL to serve as a scalable, efficient, and explainable alternative to classical CTL model checking.","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"13 1","pages":""},"PeriodicalIF":10.4,"publicationDate":"2026-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147731258","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Fully Data-Driven Value Iteration for Stochastic LQR: Convergence, Robustness, and Stability 随机LQR的完全数据驱动值迭代:收敛性、鲁棒性和稳定性
IF 10.4 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-21 DOI: 10.1109/tnnls.2026.3675892
Leilei Cui, Zhong-Ping Jiang, Petter N. Kolm, Grégoire G. Macqueron
{"title":"A Fully Data-Driven Value Iteration for Stochastic LQR: Convergence, Robustness, and Stability","authors":"Leilei Cui, Zhong-Ping Jiang, Petter N. Kolm, Grégoire G. Macqueron","doi":"10.1109/tnnls.2026.3675892","DOIUrl":"https://doi.org/10.1109/tnnls.2026.3675892","url":null,"abstract":"","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"21 1","pages":""},"PeriodicalIF":10.4,"publicationDate":"2026-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147731795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiscale Graph Redefining: Correlation-Based Multiscale Graph Clustering Network for Human Motion Prediction. 多尺度图重定义:基于关联的多尺度图聚类网络用于人体运动预测。
IF 10.4 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-21 DOI: 10.1109/tnnls.2026.3684128
Jianqi Zhong,Junyu Shi,Wenming Cao
Graph Convolutional Networks (GCNs) have exhibited considerable promise in 3-D skeleton-based human motion prediction. Based on the intuitive observation that human motion can be delineated through the physical interconnections among human joints, many previous works have designed multiscale graphs to learn the relationships and constraints between different graph scales, obtaining encouraging results for human motion prediction. However, these fixed multiscale graphs obtain new scale graphs by merging adjacent human joint information, ignoring implicit semantic information during dynamic movements. Furthermore, human joint correlations tend to vary randomly as the depth of the multiscale clustering graph increases, which contradicts the design concept of fixed multiscale graphs. To address these limitations, we explore a novel correlation-based multiscale graph clustering network (CMGC) for adaptive multiscale graph representation learning. Given a human joints graph, the goal of CMGC is first to generate more new graphs representing motion correlations adaptively at different scale levels and then selectively restore the derived graph scales to the original human joints graphs, which enables various motion features extraction. Moreover, we introduce the discrete wavelet transform (DWT) to compensate for the signal loss caused by discrete cosine transform (DCT) domain modeling from human motion. The CMGC gives rise to gratifying performances with the adaptive multiscale graph. Extensive experiments reveal that CMGC outperforms state-of-the-art methods by 11.2%, 10.1%, and 11.2% of 3-D mean per joint position error (MPJPE) on average on Human 3.6M, CMU Mocap, and 3DPW datasets, respectively. We also test the mean angle error (MAE) on Human3.6M, which is lower by 6.5% than previous methods. Our code is released at https://github.com/JunyuShi02/CMGC.
图卷积网络(GCNs)在基于三维骨骼的人体运动预测中显示出相当大的前景。基于对人体运动可以通过人体关节之间的物理联系来描绘的直观观察,许多前人的工作设计了多尺度图来学习不同图尺度之间的关系和约束,在人体运动预测方面取得了令人鼓舞的结果。然而,这些固定的多尺度图通过合并相邻的人体关节信息来获得新的尺度图,忽略了动态运动中隐含的语义信息。此外,随着多尺度聚类图深度的增加,人体关节相关性趋于随机变化,这与固定多尺度图的设计理念相矛盾。为了解决这些限制,我们探索了一种新的基于关联的多尺度图聚类网络(CMGC),用于自适应多尺度图表示学习。给定人体关节图,CMGC的目标是首先在不同尺度上自适应生成更多表示运动关联的新图,然后有选择地将导出的图尺度恢复到原始人体关节图,从而实现各种运动特征的提取。此外,我们引入了离散小波变换(DWT)来补偿离散余弦变换(DCT)对人体运动的域建模所造成的信号损失。该算法通过自适应多尺度图获得了令人满意的性能。大量实验表明,在Human 3.6M、CMU Mocap和3DPW数据集上,CMGC的平均每个关节位置误差(MPJPE)分别比最先进的方法高出11.2%、10.1%和11.2%。我们还在Human3.6M上测试了平均角度误差(MAE),比以前的方法降低了6.5%。我们的代码发布在https://github.com/JunyuShi02/CMGC。
{"title":"Multiscale Graph Redefining: Correlation-Based Multiscale Graph Clustering Network for Human Motion Prediction.","authors":"Jianqi Zhong,Junyu Shi,Wenming Cao","doi":"10.1109/tnnls.2026.3684128","DOIUrl":"https://doi.org/10.1109/tnnls.2026.3684128","url":null,"abstract":"Graph Convolutional Networks (GCNs) have exhibited considerable promise in 3-D skeleton-based human motion prediction. Based on the intuitive observation that human motion can be delineated through the physical interconnections among human joints, many previous works have designed multiscale graphs to learn the relationships and constraints between different graph scales, obtaining encouraging results for human motion prediction. However, these fixed multiscale graphs obtain new scale graphs by merging adjacent human joint information, ignoring implicit semantic information during dynamic movements. Furthermore, human joint correlations tend to vary randomly as the depth of the multiscale clustering graph increases, which contradicts the design concept of fixed multiscale graphs. To address these limitations, we explore a novel correlation-based multiscale graph clustering network (CMGC) for adaptive multiscale graph representation learning. Given a human joints graph, the goal of CMGC is first to generate more new graphs representing motion correlations adaptively at different scale levels and then selectively restore the derived graph scales to the original human joints graphs, which enables various motion features extraction. Moreover, we introduce the discrete wavelet transform (DWT) to compensate for the signal loss caused by discrete cosine transform (DCT) domain modeling from human motion. The CMGC gives rise to gratifying performances with the adaptive multiscale graph. Extensive experiments reveal that CMGC outperforms state-of-the-art methods by 11.2%, 10.1%, and 11.2% of 3-D mean per joint position error (MPJPE) on average on Human 3.6M, CMU Mocap, and 3DPW datasets, respectively. We also test the mean angle error (MAE) on Human3.6M, which is lower by 6.5% than previous methods. Our code is released at https://github.com/JunyuShi02/CMGC.","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"322 1","pages":""},"PeriodicalIF":10.4,"publicationDate":"2026-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147731259","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE transactions on neural networks and learning systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1