首页 > 最新文献

IEEE transactions on neural networks and learning systems最新文献

英文 中文
Computing Node Failure Prediction Based on Continuous-Time Dynamic Graph. 基于连续时间动态图的计算节点故障预测。
IF 10.4 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-22 DOI: 10.1109/tnnls.2026.3684886
Binbin Huang,Teng Bao,Feiyi Chen,Lingbin Wang,Xunqing Huang,Yuyu Yin,Xiaoying Shi,Shangguang Wang,Shuiguang Deng
The growth of large models demands multinode cooperation during training and inference processes. The computing node failures can interrupt these processes, subsequently causing information loss and prolonging the execution time. To reduce the prohibitively large overhead incurred by the computing nodes failures, the accurate prediction of computing node failure is vital, which can help to avert potential large overhead, service interruptions, and negative customer experiences. Existing solutions of computing nodes failure prediction mainly focus on utilizing state-of-the-art time-series models to enhance the performance of computing node failure prediction. However, on the one hand, they could not capture the causal relationship between device over-utilization and node failures; On the other hand, they fail to extract the complex spatial-temporal cascading correlations among computing node failure events. These limits can degrade the performance of computing node failure prediction. To address these above problems, this article makes an effort to focus on designing a continuous-time dynamic graphs-based computing node failures prediction (CTDG-NFP) scheme, to accurately predict in dynamic cluster environments. Specifically, the CTDG-NFP scheme first designs a novel multiple-dimensional feature-biased neighbor sampling method, which jointly considers CPU utilization-biased, memory utilization-biased, temporal-biased and spatial-biased, to sample relevant context. Then, the CTDG-NFP scheme extracts diverse computing node failure motifs by multiple-dimensional feature-biased-based long-short-path walk method and set-based anonymization method. Finally, the CTDG-NFP scheme adopts time encoder to encode these motifs, and thereby extracting the complex spatial-temporal correlations among computing node failure events. On this basis, contrastive learning is adopted to train the computing node failure prediction model. Extensive evaluations with various real-world failure traces demonstrate the CTDG-NFP scheme can achieve superior performance in terms of six widely used performance metrics compared with the SOTA node failure prediction methods.
大型模型的增长需要在训练和推理过程中进行多节点合作。计算节点故障会导致这些进程中断,造成信息丢失,延长执行时间。为了减少计算节点故障带来的巨大开销,准确预测计算节点故障至关重要,这有助于避免潜在的巨大开销、服务中断和负面的客户体验。现有的计算节点故障预测方案主要是利用最先进的时间序列模型来提高计算节点故障预测的性能。然而,一方面,他们无法捕捉到设备过度使用和节点故障之间的因果关系;另一方面,它们无法提取计算节点故障事件之间复杂的时空级联关系。这些限制会降低计算节点故障预测的性能。为了解决上述问题,本文重点设计了一种基于连续时间动态图的计算节点故障预测(CTDG-NFP)方案,以便在动态集群环境下进行准确预测。具体而言,CTDG-NFP方案首先设计了一种新颖的多维特征偏差邻居采样方法,该方法联合考虑CPU利用率偏差、内存利用率偏差、时间偏差和空间偏差,对相关上下文进行采样。然后,CTDG-NFP方案采用基于多维特征偏差的长-短路径行走法和基于集合的匿名化方法提取不同的计算节点故障基元;最后,CTDG-NFP方案采用时间编码器对这些基元进行编码,从而提取计算节点故障事件之间复杂的时空相关性。在此基础上,采用对比学习训练计算节点故障预测模型。对各种实际故障轨迹的广泛评估表明,与SOTA节点故障预测方法相比,CTDG-NFP方案在6个广泛使用的性能指标方面具有优越的性能。
{"title":"Computing Node Failure Prediction Based on Continuous-Time Dynamic Graph.","authors":"Binbin Huang,Teng Bao,Feiyi Chen,Lingbin Wang,Xunqing Huang,Yuyu Yin,Xiaoying Shi,Shangguang Wang,Shuiguang Deng","doi":"10.1109/tnnls.2026.3684886","DOIUrl":"https://doi.org/10.1109/tnnls.2026.3684886","url":null,"abstract":"The growth of large models demands multinode cooperation during training and inference processes. The computing node failures can interrupt these processes, subsequently causing information loss and prolonging the execution time. To reduce the prohibitively large overhead incurred by the computing nodes failures, the accurate prediction of computing node failure is vital, which can help to avert potential large overhead, service interruptions, and negative customer experiences. Existing solutions of computing nodes failure prediction mainly focus on utilizing state-of-the-art time-series models to enhance the performance of computing node failure prediction. However, on the one hand, they could not capture the causal relationship between device over-utilization and node failures; On the other hand, they fail to extract the complex spatial-temporal cascading correlations among computing node failure events. These limits can degrade the performance of computing node failure prediction. To address these above problems, this article makes an effort to focus on designing a continuous-time dynamic graphs-based computing node failures prediction (CTDG-NFP) scheme, to accurately predict in dynamic cluster environments. Specifically, the CTDG-NFP scheme first designs a novel multiple-dimensional feature-biased neighbor sampling method, which jointly considers CPU utilization-biased, memory utilization-biased, temporal-biased and spatial-biased, to sample relevant context. Then, the CTDG-NFP scheme extracts diverse computing node failure motifs by multiple-dimensional feature-biased-based long-short-path walk method and set-based anonymization method. Finally, the CTDG-NFP scheme adopts time encoder to encode these motifs, and thereby extracting the complex spatial-temporal correlations among computing node failure events. On this basis, contrastive learning is adopted to train the computing node failure prediction model. Extensive evaluations with various real-world failure traces demonstrate the CTDG-NFP scheme can achieve superior performance in terms of six widely used performance metrics compared with the SOTA node failure prediction methods.","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"246 1","pages":""},"PeriodicalIF":10.4,"publicationDate":"2026-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147733983","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Forgettable Federated Linear Learning With Certified Data Unlearning 可遗忘的联邦线性学习与认证数据遗忘
IF 10.4 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-22 DOI: 10.1109/tnnls.2026.3683398
Ruinan Jin, Minghui Chen, Qiong Zhang, Xiaoxiao Li
{"title":"Forgettable Federated Linear Learning With Certified Data Unlearning","authors":"Ruinan Jin, Minghui Chen, Qiong Zhang, Xiaoxiao Li","doi":"10.1109/tnnls.2026.3683398","DOIUrl":"https://doi.org/10.1109/tnnls.2026.3683398","url":null,"abstract":"","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"22 1","pages":""},"PeriodicalIF":10.4,"publicationDate":"2026-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147735977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Node Classification in GNNs: Impact of Neighborhood Label Distribution on Homophily and Heterophily. gnn中的节点分类:邻域标签分布对同质性和异质性的影响。
IF 10.4 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-22 DOI: 10.1109/tnnls.2026.3680732
Zhili Zhao,Li Wan,Xupeng Liu,Ruiyi Yan,Shaomeng Wang
In node classification, traditional graph neural networks (GNNs) typically assume implicit homophily, indicating that intraclass nodes are likely connected. However, real-world graphs frequently exhibit heterophily, in which interclass nodes are also commonly connected. To address this challenge, recent methods have adopted approaches such as expanding local neighborhoods and employing adaptive message aggregation to enhance the GNN performance on heterophily graphs. Nevertheless, these methods are restricted by the homophily assumption and fail to effectively capture long-range dependencies (e.g., widely separated intraclass nodes) and insufficiently leverage the graph topology. This study investigates the performance differences of GNN when it is applied to both homophily and heterophily graphs and finds that the distinguishability of neighborhood label distributions (NLDs) exhibits a significant correlation with the accuracy of node classification. To assess the impact of NLD on node classification, this study proposes a novel homophily metric based on node distinguishability. Subsequently, this study introduces a new GNN model named NLD-based GNN (NLDGNN) for node classification. First, NLDGNN initializes node representations by integrating node features with node NLDs. To address long-range dependencies in heterophily graphs, NLDGNN utilizes the global label relationship matrix with low-rank characteristics for global message passing. By combining the attention scores derived from the initial node representations, NLDGNN constructs the global label relationship matrix for enhanced message passing, thereby improving the expressiveness of node representations. Experimental results indicate that NLDGNN outperforms existing GNN models on both real-world homophily and heterophily graphs. The code of this study is available at https://github.com/wanli6/NLDGNN.
在节点分类中,传统的图神经网络(gnn)通常假设隐式同态,这表明类内节点可能是连通的。然而,现实世界的图经常表现出异构性,其中类间节点也通常是连接的。为了解决这一挑战,最近的方法采用了扩展局部邻域和采用自适应消息聚合等方法来提高GNN在异质性图上的性能。然而,这些方法受到同态假设的限制,不能有效地捕获远程依赖关系(例如,广泛分离的类内节点),也不能充分利用图拓扑。本文研究了GNN在同态图和异态图上的性能差异,发现邻域标签分布(nld)的可分辨性与节点分类的准确性有显著的相关性。为了评估NLD对节点分类的影响,本研究提出了一种基于节点可分辨性的同态度量。随后,本研究引入了一种新的GNN模型,称为基于nld的GNN (NLDGNN),用于节点分类。首先,NLDGNN通过将节点特征与节点nld集成来初始化节点表示。为了解决异质性图中的远程依赖关系,NLDGNN利用具有低秩特征的全局标签关系矩阵进行全局消息传递。NLDGNN通过结合初始节点表示得到的关注分数,构建全局标签关系矩阵,增强消息传递,从而提高节点表示的表达性。实验结果表明,NLDGNN在真实世界同态图和异态图上都优于现有的GNN模型。本研究的代码可在https://github.com/wanli6/NLDGNN上获得。
{"title":"Node Classification in GNNs: Impact of Neighborhood Label Distribution on Homophily and Heterophily.","authors":"Zhili Zhao,Li Wan,Xupeng Liu,Ruiyi Yan,Shaomeng Wang","doi":"10.1109/tnnls.2026.3680732","DOIUrl":"https://doi.org/10.1109/tnnls.2026.3680732","url":null,"abstract":"In node classification, traditional graph neural networks (GNNs) typically assume implicit homophily, indicating that intraclass nodes are likely connected. However, real-world graphs frequently exhibit heterophily, in which interclass nodes are also commonly connected. To address this challenge, recent methods have adopted approaches such as expanding local neighborhoods and employing adaptive message aggregation to enhance the GNN performance on heterophily graphs. Nevertheless, these methods are restricted by the homophily assumption and fail to effectively capture long-range dependencies (e.g., widely separated intraclass nodes) and insufficiently leverage the graph topology. This study investigates the performance differences of GNN when it is applied to both homophily and heterophily graphs and finds that the distinguishability of neighborhood label distributions (NLDs) exhibits a significant correlation with the accuracy of node classification. To assess the impact of NLD on node classification, this study proposes a novel homophily metric based on node distinguishability. Subsequently, this study introduces a new GNN model named NLD-based GNN (NLDGNN) for node classification. First, NLDGNN initializes node representations by integrating node features with node NLDs. To address long-range dependencies in heterophily graphs, NLDGNN utilizes the global label relationship matrix with low-rank characteristics for global message passing. By combining the attention scores derived from the initial node representations, NLDGNN constructs the global label relationship matrix for enhanced message passing, thereby improving the expressiveness of node representations. Experimental results indicate that NLDGNN outperforms existing GNN models on both real-world homophily and heterophily graphs. The code of this study is available at https://github.com/wanli6/NLDGNN.","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"25 1","pages":""},"PeriodicalIF":10.4,"publicationDate":"2026-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147733981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Systematic Abductive Reasoning via Diverse Relation Representations in Vector-Symbolic Architecture 向量符号建筑中基于不同关系表示的系统溯因推理
IF 10.4 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-22 DOI: 10.1109/tnnls.2026.3684958
Zhong-Hua Sun, Ru-Yuan Zhang, Zonglei Zhen, Da-Hui Wang, Yong-Jie Li, Xiaohong Wan, Hongzhi You
{"title":"Systematic Abductive Reasoning via Diverse Relation Representations in Vector-Symbolic Architecture","authors":"Zhong-Hua Sun, Ru-Yuan Zhang, Zonglei Zhen, Da-Hui Wang, Yong-Jie Li, Xiaohong Wan, Hongzhi You","doi":"10.1109/tnnls.2026.3684958","DOIUrl":"https://doi.org/10.1109/tnnls.2026.3684958","url":null,"abstract":"","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"55 1","pages":""},"PeriodicalIF":10.4,"publicationDate":"2026-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147735976","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scalable and Efficient Deep Reinforcement Learning-Based Model Checker for Computation Tree Logic. 基于深度强化学习的可扩展高效计算树逻辑模型检查器。
IF 10.4 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-21 DOI: 10.1109/tnnls.2026.3683573
Ghalya Alwhishi,Jamal Bentahar,Amine Andam,Ahmed Elwhishi,Mustapha Hedabou
Formal verification using temporal logics such as computation tree logic (CTL) is essential for validating safety and correctness in complex systems. However, traditional model-checking techniques face severe scalability limitations due to the state explosion problem and their reliance on exhaustive symbolic traversal. Moreover, existing learning-based verification methods often lack formal guarantees and interpretability. These challenges create a pressing need for scalable, learning-based verification methods that preserve verification reliability while improving computational efficiency. This article introduces a novel deep reinforcement learning (DRL)-based model checking framework that learns to verify CTL formulas directly through interaction with system models. Unlike traditional symbolic model checkers such as NuSMV, the proposed DRL-CTL checker trained using proximal policy optimization (PPO) interprets CTL semantics over system models represented as Kripke structures without performing symbolic state-space traversal at inference time. Reward functions are designed for individual CTL operators, and fixed-point reasoning is incorporated to handle global temporal properties such as $AG(phi)$ and $EG(phi)$ . Experimental results show that the proposed method achieves near-constant inference time of approximately 2 ms per formula on an Intel Core i9-13900K CPU (24 cores, 3.0 GHz), 64 GB RAM, NVIDIA RTX 4090 GPU (24 GB VRAM), reduces verification time by up to 90% compared with traditional model checkers, and scales to models with more than $10^{1192}$ reachable states. The framework also produces witnesses and counterexamples and yields verification outcomes identical to those of symbolic checkers in our experiments. These results highlight the potential of DRL to serve as a scalable, efficient, and explainable alternative to classical CTL model checking.
使用时间逻辑(如计算树逻辑(CTL))进行形式化验证对于验证复杂系统的安全性和正确性至关重要。然而,由于状态爆炸问题和对穷举符号遍历的依赖,传统的模型检查技术面临严重的可扩展性限制。此外,现有的基于学习的验证方法往往缺乏正式的保证和可解释性。这些挑战产生了对可扩展的、基于学习的验证方法的迫切需求,这些方法在保持验证可靠性的同时提高了计算效率。本文介绍了一种新的基于深度强化学习(DRL)的模型检查框架,该框架通过与系统模型的交互学习直接验证CTL公式。与传统的符号模型检查器(如NuSMV)不同,本文提出的DRL-CTL检查器使用近端策略优化(PPO)训练,可以在Kripke结构表示的系统模型上解释CTL语义,而无需在推理时执行符号状态空间遍历。奖励函数是为单个CTL操作符设计的,并结合了定点推理来处理全局时间属性,如$AG(phi)$和$EG(phi)$。实验结果表明,该方法在Intel Core i9-13900K CPU(24核,3.0 GHz), 64 GB RAM, NVIDIA RTX 4090 GPU (24 GB VRAM)上实现了近似恒定的推理时间,每个公式约为2 ms,与传统模型检查器相比,验证时间缩短了90%,并且可扩展到超过$10^{1192}$可达状态的模型。该框架还产生了证人和反例,并产生了与我们实验中符号检查器相同的验证结果。这些结果突出了DRL作为经典CTL模型检查的可扩展、高效和可解释的替代方案的潜力。
{"title":"Scalable and Efficient Deep Reinforcement Learning-Based Model Checker for Computation Tree Logic.","authors":"Ghalya Alwhishi,Jamal Bentahar,Amine Andam,Ahmed Elwhishi,Mustapha Hedabou","doi":"10.1109/tnnls.2026.3683573","DOIUrl":"https://doi.org/10.1109/tnnls.2026.3683573","url":null,"abstract":"Formal verification using temporal logics such as computation tree logic (CTL) is essential for validating safety and correctness in complex systems. However, traditional model-checking techniques face severe scalability limitations due to the state explosion problem and their reliance on exhaustive symbolic traversal. Moreover, existing learning-based verification methods often lack formal guarantees and interpretability. These challenges create a pressing need for scalable, learning-based verification methods that preserve verification reliability while improving computational efficiency. This article introduces a novel deep reinforcement learning (DRL)-based model checking framework that learns to verify CTL formulas directly through interaction with system models. Unlike traditional symbolic model checkers such as NuSMV, the proposed DRL-CTL checker trained using proximal policy optimization (PPO) interprets CTL semantics over system models represented as Kripke structures without performing symbolic state-space traversal at inference time. Reward functions are designed for individual CTL operators, and fixed-point reasoning is incorporated to handle global temporal properties such as $AG(phi)$ and $EG(phi)$ . Experimental results show that the proposed method achieves near-constant inference time of approximately 2 ms per formula on an Intel Core i9-13900K CPU (24 cores, 3.0 GHz), 64 GB RAM, NVIDIA RTX 4090 GPU (24 GB VRAM), reduces verification time by up to 90% compared with traditional model checkers, and scales to models with more than $10^{1192}$ reachable states. The framework also produces witnesses and counterexamples and yields verification outcomes identical to those of symbolic checkers in our experiments. These results highlight the potential of DRL to serve as a scalable, efficient, and explainable alternative to classical CTL model checking.","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"13 1","pages":""},"PeriodicalIF":10.4,"publicationDate":"2026-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147731258","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Fully Data-Driven Value Iteration for Stochastic LQR: Convergence, Robustness, and Stability 随机LQR的完全数据驱动值迭代:收敛性、鲁棒性和稳定性
IF 10.4 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-21 DOI: 10.1109/tnnls.2026.3675892
Leilei Cui, Zhong-Ping Jiang, Petter N. Kolm, Grégoire G. Macqueron
{"title":"A Fully Data-Driven Value Iteration for Stochastic LQR: Convergence, Robustness, and Stability","authors":"Leilei Cui, Zhong-Ping Jiang, Petter N. Kolm, Grégoire G. Macqueron","doi":"10.1109/tnnls.2026.3675892","DOIUrl":"https://doi.org/10.1109/tnnls.2026.3675892","url":null,"abstract":"","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"21 1","pages":""},"PeriodicalIF":10.4,"publicationDate":"2026-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147731795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiscale Graph Redefining: Correlation-Based Multiscale Graph Clustering Network for Human Motion Prediction. 多尺度图重定义:基于关联的多尺度图聚类网络用于人体运动预测。
IF 10.4 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-21 DOI: 10.1109/tnnls.2026.3684128
Jianqi Zhong,Junyu Shi,Wenming Cao
Graph Convolutional Networks (GCNs) have exhibited considerable promise in 3-D skeleton-based human motion prediction. Based on the intuitive observation that human motion can be delineated through the physical interconnections among human joints, many previous works have designed multiscale graphs to learn the relationships and constraints between different graph scales, obtaining encouraging results for human motion prediction. However, these fixed multiscale graphs obtain new scale graphs by merging adjacent human joint information, ignoring implicit semantic information during dynamic movements. Furthermore, human joint correlations tend to vary randomly as the depth of the multiscale clustering graph increases, which contradicts the design concept of fixed multiscale graphs. To address these limitations, we explore a novel correlation-based multiscale graph clustering network (CMGC) for adaptive multiscale graph representation learning. Given a human joints graph, the goal of CMGC is first to generate more new graphs representing motion correlations adaptively at different scale levels and then selectively restore the derived graph scales to the original human joints graphs, which enables various motion features extraction. Moreover, we introduce the discrete wavelet transform (DWT) to compensate for the signal loss caused by discrete cosine transform (DCT) domain modeling from human motion. The CMGC gives rise to gratifying performances with the adaptive multiscale graph. Extensive experiments reveal that CMGC outperforms state-of-the-art methods by 11.2%, 10.1%, and 11.2% of 3-D mean per joint position error (MPJPE) on average on Human 3.6M, CMU Mocap, and 3DPW datasets, respectively. We also test the mean angle error (MAE) on Human3.6M, which is lower by 6.5% than previous methods. Our code is released at https://github.com/JunyuShi02/CMGC.
图卷积网络(GCNs)在基于三维骨骼的人体运动预测中显示出相当大的前景。基于对人体运动可以通过人体关节之间的物理联系来描绘的直观观察,许多前人的工作设计了多尺度图来学习不同图尺度之间的关系和约束,在人体运动预测方面取得了令人鼓舞的结果。然而,这些固定的多尺度图通过合并相邻的人体关节信息来获得新的尺度图,忽略了动态运动中隐含的语义信息。此外,随着多尺度聚类图深度的增加,人体关节相关性趋于随机变化,这与固定多尺度图的设计理念相矛盾。为了解决这些限制,我们探索了一种新的基于关联的多尺度图聚类网络(CMGC),用于自适应多尺度图表示学习。给定人体关节图,CMGC的目标是首先在不同尺度上自适应生成更多表示运动关联的新图,然后有选择地将导出的图尺度恢复到原始人体关节图,从而实现各种运动特征的提取。此外,我们引入了离散小波变换(DWT)来补偿离散余弦变换(DCT)对人体运动的域建模所造成的信号损失。该算法通过自适应多尺度图获得了令人满意的性能。大量实验表明,在Human 3.6M、CMU Mocap和3DPW数据集上,CMGC的平均每个关节位置误差(MPJPE)分别比最先进的方法高出11.2%、10.1%和11.2%。我们还在Human3.6M上测试了平均角度误差(MAE),比以前的方法降低了6.5%。我们的代码发布在https://github.com/JunyuShi02/CMGC。
{"title":"Multiscale Graph Redefining: Correlation-Based Multiscale Graph Clustering Network for Human Motion Prediction.","authors":"Jianqi Zhong,Junyu Shi,Wenming Cao","doi":"10.1109/tnnls.2026.3684128","DOIUrl":"https://doi.org/10.1109/tnnls.2026.3684128","url":null,"abstract":"Graph Convolutional Networks (GCNs) have exhibited considerable promise in 3-D skeleton-based human motion prediction. Based on the intuitive observation that human motion can be delineated through the physical interconnections among human joints, many previous works have designed multiscale graphs to learn the relationships and constraints between different graph scales, obtaining encouraging results for human motion prediction. However, these fixed multiscale graphs obtain new scale graphs by merging adjacent human joint information, ignoring implicit semantic information during dynamic movements. Furthermore, human joint correlations tend to vary randomly as the depth of the multiscale clustering graph increases, which contradicts the design concept of fixed multiscale graphs. To address these limitations, we explore a novel correlation-based multiscale graph clustering network (CMGC) for adaptive multiscale graph representation learning. Given a human joints graph, the goal of CMGC is first to generate more new graphs representing motion correlations adaptively at different scale levels and then selectively restore the derived graph scales to the original human joints graphs, which enables various motion features extraction. Moreover, we introduce the discrete wavelet transform (DWT) to compensate for the signal loss caused by discrete cosine transform (DCT) domain modeling from human motion. The CMGC gives rise to gratifying performances with the adaptive multiscale graph. Extensive experiments reveal that CMGC outperforms state-of-the-art methods by 11.2%, 10.1%, and 11.2% of 3-D mean per joint position error (MPJPE) on average on Human 3.6M, CMU Mocap, and 3DPW datasets, respectively. We also test the mean angle error (MAE) on Human3.6M, which is lower by 6.5% than previous methods. Our code is released at https://github.com/JunyuShi02/CMGC.","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"322 1","pages":""},"PeriodicalIF":10.4,"publicationDate":"2026-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147731259","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Distributed Inertial k -Winners-Take-All Neural Network Based on Quadratic Optimization Problems 基于二次优化问题的分布式惯性k -赢者通吃神经网络
IF 10.4 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-20 DOI: 10.1109/tnnls.2026.3683360
Xiaohan Bo, Song Zhu, Zhen Zhang, Weiwei Luo, Shiping Wen, Chaoxu Mu
{"title":"Distributed Inertial k -Winners-Take-All Neural Network Based on Quadratic Optimization Problems","authors":"Xiaohan Bo, Song Zhu, Zhen Zhang, Weiwei Luo, Shiping Wen, Chaoxu Mu","doi":"10.1109/tnnls.2026.3683360","DOIUrl":"https://doi.org/10.1109/tnnls.2026.3683360","url":null,"abstract":"","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"9 1","pages":""},"PeriodicalIF":10.4,"publicationDate":"2026-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147725649","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spectral–Spatial–Temporal Kolmogorov–Arnold Network for Hyperspectral Change Detection 高光谱变化检测的光谱-时空Kolmogorov-Arnold网络
IF 10.4 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-20 DOI: 10.1109/tnnls.2026.3680585
Puhong Duan, Wenxuan Wang, Xudong Kang, Shutao Li
{"title":"Spectral–Spatial–Temporal Kolmogorov–Arnold Network for Hyperspectral Change Detection","authors":"Puhong Duan, Wenxuan Wang, Xudong Kang, Shutao Li","doi":"10.1109/tnnls.2026.3680585","DOIUrl":"https://doi.org/10.1109/tnnls.2026.3680585","url":null,"abstract":"","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"426 1","pages":""},"PeriodicalIF":10.4,"publicationDate":"2026-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147725648","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Deep Neural Network Optimization Framework Based on Optimal Transport Bridge Feature Selection and Sparse Representation. 基于最优传输桥特征选择和稀疏表示的深度神经网络优化框架。
IF 10.4 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-04-17 DOI: 10.1109/tnnls.2026.3678220
Guipeng Lan,Shuai Xiao,Jiabao Wen,Jiachen Yang,Wen Lu,Baihua Li,Qinggang Meng,Xinbo Gao
The performance of deep neural networks (DNNs) in accomplishing tasks heavily relies on feature selection and sparse representation of high-dimensional data. Previous work has treated feature selection and sparse representation as separate mechanisms for improving DNNs performance, focusing on identifying and leveraging informative features to enhance task-specific outcomes. However, few studies have established a connection between feature selection and sparse representation. To address this gap, this article proposes an optimization framework termed informative sparse transport (IST), which integrates feature selection and sparse coding into a unified multiobjective optimization framework. Using optimal transport as a bridge, the IST framework harmonizes the relationship between feature selection and sparse representation, offering an informational advantage. In the IST framework, feature selection aims to identify an optimal subset of features to maximize mutual information or minimize redundancy, while sparse representation seeks to approximate data with the fewest possible features. Although these objectives differ, they are fundamentally complementary, as both emphasize extracting task-relevant information while eliminating redundancy. By unifying feature selection and sparse representation, the IST framework effectively mitigates challenges posed by high-dimensional data, delivering a robust solution for enhanced feature extraction and representation. We validate the IST framework on generative and classification tasks, demonstrating IST framework improves model performance through the complementary synergy of feature selection and sparse representation.
深度神经网络在完成任务时的性能很大程度上依赖于高维数据的特征选择和稀疏表示。以前的工作将特征选择和稀疏表示视为提高dnn性能的独立机制,重点是识别和利用信息特征来增强特定任务的结果。然而,很少有研究将特征选择与稀疏表示联系起来。为了解决这一差距,本文提出了一种称为信息稀疏传输(IST)的优化框架,该框架将特征选择和稀疏编码集成到一个统一的多目标优化框架中。IST框架以最优传输为桥梁,协调了特征选择和稀疏表示之间的关系,提供了信息优势。在IST框架中,特征选择旨在识别特征的最优子集,以最大化互信息或最小化冗余,而稀疏表示寻求用最少可能的特征来近似数据。尽管这些目标不同,但它们从根本上是互补的,因为它们都强调在消除冗余的同时提取与任务相关的信息。通过统一特征选择和稀疏表示,IST框架有效地缓解了高维数据带来的挑战,为增强特征提取和表示提供了一个鲁棒的解决方案。我们在生成和分类任务上验证了IST框架,证明了IST框架通过特征选择和稀疏表示的互补协同作用提高了模型性能。
{"title":"A Deep Neural Network Optimization Framework Based on Optimal Transport Bridge Feature Selection and Sparse Representation.","authors":"Guipeng Lan,Shuai Xiao,Jiabao Wen,Jiachen Yang,Wen Lu,Baihua Li,Qinggang Meng,Xinbo Gao","doi":"10.1109/tnnls.2026.3678220","DOIUrl":"https://doi.org/10.1109/tnnls.2026.3678220","url":null,"abstract":"The performance of deep neural networks (DNNs) in accomplishing tasks heavily relies on feature selection and sparse representation of high-dimensional data. Previous work has treated feature selection and sparse representation as separate mechanisms for improving DNNs performance, focusing on identifying and leveraging informative features to enhance task-specific outcomes. However, few studies have established a connection between feature selection and sparse representation. To address this gap, this article proposes an optimization framework termed informative sparse transport (IST), which integrates feature selection and sparse coding into a unified multiobjective optimization framework. Using optimal transport as a bridge, the IST framework harmonizes the relationship between feature selection and sparse representation, offering an informational advantage. In the IST framework, feature selection aims to identify an optimal subset of features to maximize mutual information or minimize redundancy, while sparse representation seeks to approximate data with the fewest possible features. Although these objectives differ, they are fundamentally complementary, as both emphasize extracting task-relevant information while eliminating redundancy. By unifying feature selection and sparse representation, the IST framework effectively mitigates challenges posed by high-dimensional data, delivering a robust solution for enhanced feature extraction and representation. We validate the IST framework on generative and classification tasks, demonstrating IST framework improves model performance through the complementary synergy of feature selection and sparse representation.","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"242 1","pages":""},"PeriodicalIF":10.4,"publicationDate":"2026-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147702139","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE transactions on neural networks and learning systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1