首页 > 最新文献

Neural Networks最新文献

英文 中文
DH-MSVM: A hybrid algorithm for seeking quality support vectors in distributed learning DH-MSVM:分布式学习中寻找高质量支持向量的混合算法
IF 6.3 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-10 DOI: 10.1016/j.neunet.2026.108569
Jiawen Gong , Beihao Xia , Qinmu Peng , Bin Zou , Xinge You
Data heterogeneity is a common yet complex challenge in distributed machine learning scenarios. However, current Distributed Support Vector Machines (DSVMs) lack effective mechanisms to identify suitable support vectors across diverse data structures, limiting their ability to dynamically adjust decision boundaries. To address this issue, we propose a Distributed Hybrid Learning based on Support Vector Machine (DH-SVM). This approach leverages global pre-learning to capture data structure information, which then guides local learning processes to identify higher-quality support vectors and adaptively refine decision boundaries. Furthermore, considering the computational overhead inherent in distributed learning, we enhance our algorithm by incorporating a Markov sampling technique (DH-MSVM). Theoretically, we derive the generalization bound of the algorithm based on uniformly ergodic Markov chain samples and establish a fast learning rate, demonstrating the robustness and scalability of DH-MSVM. Empirically, extensive experiments on real-world datasets validate the superior performance of the proposed algorithms.
在分布式机器学习场景中,数据异构是一个常见而复杂的挑战。然而,目前的分布式支持向量机(dsvm)缺乏有效的机制来识别不同数据结构中合适的支持向量,限制了它们动态调整决策边界的能力。为了解决这个问题,我们提出了一种基于支持向量机(DH-SVM)的分布式混合学习。该方法利用全局预学习来捕获数据结构信息,然后指导局部学习过程来识别更高质量的支持向量并自适应地细化决策边界。此外,考虑到分布式学习固有的计算开销,我们通过结合马尔可夫采样技术(DH-MSVM)来增强我们的算法。从理论上讲,我们基于均匀遍历马尔可夫链样本推导了算法的泛化界,并建立了快速的学习率,证明了DH-MSVM的鲁棒性和可扩展性。在经验上,对真实世界数据集的大量实验验证了所提出算法的优越性能。
{"title":"DH-MSVM: A hybrid algorithm for seeking quality support vectors in distributed learning","authors":"Jiawen Gong ,&nbsp;Beihao Xia ,&nbsp;Qinmu Peng ,&nbsp;Bin Zou ,&nbsp;Xinge You","doi":"10.1016/j.neunet.2026.108569","DOIUrl":"10.1016/j.neunet.2026.108569","url":null,"abstract":"<div><div>Data heterogeneity is a common yet complex challenge in distributed machine learning scenarios. However, current Distributed Support Vector Machines (DSVMs) lack effective mechanisms to identify suitable support vectors across diverse data structures, limiting their ability to dynamically adjust decision boundaries. To address this issue, we propose a Distributed Hybrid Learning based on Support Vector Machine (DH-SVM). This approach leverages global pre-learning to capture data structure information, which then guides local learning processes to identify higher-quality support vectors and adaptively refine decision boundaries. Furthermore, considering the computational overhead inherent in distributed learning, we enhance our algorithm by incorporating a Markov sampling technique (DH-MSVM). Theoretically, we derive the generalization bound of the algorithm based on uniformly ergodic Markov chain samples and establish a fast learning rate, demonstrating the robustness and scalability of DH-MSVM. Empirically, extensive experiments on real-world datasets validate the superior performance of the proposed algorithms.</div></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"198 ","pages":"Article 108569"},"PeriodicalIF":6.3,"publicationDate":"2026-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145981630","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A two-stage active cleaning strategy for long-tail label noise 长尾标签噪声的两阶段主动清洗策略
IF 6.3 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-10 DOI: 10.1016/j.neunet.2026.108585
Xiao Lin , Zeyu Rong , Yan Li , Qizhe Yang , Ping Li , Wei Huang
Long-tailed data is ubiquitous in real-world applications, posing significant challenges due to imbalanced class distribution and high levels of label noise. Previous methods to address long-tailed data with label noise often incur high computational and manual costs. To address these challenges, we propose a novel two-stage active label cleaning strategy, which extends active learning beyond traditional label acquisition to efficiently identify and correct mislabeled samples while minimizing annotation cost. Specifically, in the first stage, we propose a Balanced Class-Centered Contrastive Learning (BCCL) to enhance feature representation quality and identify potential label noise within long-tailed datasets. BCCL achieves this through a novel loss function that integrates contrastive learning with weighted average of class centers. The second stage employs an uncertainty-based active learning sorted sampling to retrain on potential label noise samples, focusing on high-uncertainty instances to determine the final noise samples needing to be relabeled. Our two-stage active label cleaning strategy minimizes the amount of data requiring re-annotation, ultimately improving classification performance through iterative re-labeling, while optimizing the use of annotation resources and reducing the annotation workload. Experimental results demonstrate the robustness of our proposed method across varying noise ratios and levels of imbalance, effectively enhancing discriminative capability on noisy data in multiple datasets and achieving superior classification performance on long-tailed data, particularly in high-noise scenarios. In experiments on the CIFAR10-LT dataset under imbalance ratio 10 and symmetric noise 0.6, we significantly outperform the state-of-the-art PCSE with a relative improvement of 5.17%. In addition, on the real-noise long-tail dataset Red Mini-ImageNet under imbalance ratio 100 and noise ratio 0.4, we achieve an accuracy of 38.37%, surpassing existing baselines.
长尾数据在现实应用中无处不在,由于类别分布不平衡和高水平的标签噪声,带来了重大挑战。以往处理带有标签噪声的长尾数据的方法往往需要耗费大量的计算和人工成本。为了解决这些挑战,我们提出了一种新的两阶段主动标签清洗策略,该策略将主动学习扩展到传统的标签获取之外,以有效地识别和纠正错误标记的样本,同时最大限度地降低标注成本。具体而言,在第一阶段,我们提出了平衡类中心对比学习(BCCL)来提高特征表示质量并识别长尾数据集中潜在的标签噪声。BCCL通过一种新颖的损失函数实现了这一目标,该函数将对比学习与类中心加权平均相结合。第二阶段采用基于不确定性的主动学习分类采样对潜在的标签噪声样本进行再训练,重点关注高不确定性实例,以确定需要重新标记的最终噪声样本。我们的两阶段主动标签清理策略最大限度地减少了需要重新标注的数据量,最终通过迭代重新标注提高了分类性能,同时优化了标注资源的使用,减少了标注工作量。实验结果表明,本文提出的方法在不同的噪声比和不平衡程度下具有鲁棒性,有效地增强了对多数据集中噪声数据的判别能力,并在长尾数据(特别是高噪声场景)上取得了优异的分类性能。在失衡比为10、对称噪声为0.6的CIFAR10-LT数据集上进行的实验中,我们显著优于最先进的PCSE,相对提高了5.17%。此外,在不平衡比为100、噪声比为0.4的实噪声长尾数据集Red Mini-ImageNet上,我们的准确率达到了38.37%,超过了现有的基线。
{"title":"A two-stage active cleaning strategy for long-tail label noise","authors":"Xiao Lin ,&nbsp;Zeyu Rong ,&nbsp;Yan Li ,&nbsp;Qizhe Yang ,&nbsp;Ping Li ,&nbsp;Wei Huang","doi":"10.1016/j.neunet.2026.108585","DOIUrl":"10.1016/j.neunet.2026.108585","url":null,"abstract":"<div><div>Long-tailed data is ubiquitous in real-world applications, posing significant challenges due to imbalanced class distribution and high levels of label noise. Previous methods to address long-tailed data with label noise often incur high computational and manual costs. To address these challenges, we propose a novel two-stage active label cleaning strategy, which extends active learning beyond traditional label acquisition to efficiently identify and correct mislabeled samples while minimizing annotation cost. Specifically, in the first stage, we propose a Balanced Class-Centered Contrastive Learning (BCCL) to enhance feature representation quality and identify potential label noise within long-tailed datasets. BCCL achieves this through a novel loss function that integrates contrastive learning with weighted average of class centers. The second stage employs an uncertainty-based active learning sorted sampling to retrain on potential label noise samples, focusing on high-uncertainty instances to determine the final noise samples needing to be relabeled. Our two-stage active label cleaning strategy minimizes the amount of data requiring re-annotation, ultimately improving classification performance through iterative re-labeling, while optimizing the use of annotation resources and reducing the annotation workload. Experimental results demonstrate the robustness of our proposed method across varying noise ratios and levels of imbalance, effectively enhancing discriminative capability on noisy data in multiple datasets and achieving superior classification performance on long-tailed data, particularly in high-noise scenarios. In experiments on the CIFAR10-LT dataset under imbalance ratio 10 and symmetric noise 0.6, we significantly outperform the state-of-the-art PCSE with a relative improvement of 5.17%. In addition, on the real-noise long-tail dataset Red Mini-ImageNet under imbalance ratio 100 and noise ratio 0.4, we achieve an accuracy of 38.37%, surpassing existing baselines.</div></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"198 ","pages":"Article 108585"},"PeriodicalIF":6.3,"publicationDate":"2026-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145980406","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
YOFOR : You only focus on object regions for tiny object detection in aerial images YOFOR:在航拍图像中,你只关注物体区域进行微小物体检测
IF 6.3 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-10 DOI: 10.1016/j.neunet.2026.108571
Heng Hu, Hao-Zhe Wang, Si-Bao Chen, Jin Tang
With development of deep learning methods, performance of object detection has been greatly improved. However, the high resolution of remotely sensed images, the complexity of the background, the uneven distribution of objects, and the uneven number of objects among them lead to unsatisfactory detection results of existing detectors. Facing these challenges, we propose YOFOR (You Only Focus on Object Regions), an adaptive local sensing enhancement network. It contains three components: adaptive local sensing module, fuzzy enhancement module and class balance module. Among them, adaptive local sensing module can adaptively localize dense object regions and dynamically crop dense object regions on view, which effectively solves problem of uneven distribution of objects. Fuzzy enhancement module further enhances object region by weakening the background interference, thus improving detection performance. Class balancing module, which analyzes dataset to obtain distribution of long-tailed classes, takes into account direction of tailed classes and distance around object, and operates on tailed classes within a certain range to alleviate long-tailed class problem and further improve detection performance. All three components are unsupervised and can be easily inserted into existing networks. Extensive experiments on the VisDrone, DOTA, and AI-TOD datasets demonstrate the effectiveness and adaptability of the method.
随着深度学习方法的发展,目标检测的性能得到了很大的提高。然而,由于遥感图像的高分辨率、背景的复杂性、目标分布的不均匀以及其中目标数量的不均匀,导致现有探测器的检测效果不理想。面对这些挑战,我们提出了一种自适应局部传感增强网络YOFOR (You Only Focus on Object Regions)。该系统由三个部分组成:自适应局部感知模块、模糊增强模块和类平衡模块。其中,自适应局部传感模块能够自适应定位密集目标区域,并动态裁剪可视密集目标区域,有效解决了目标分布不均匀的问题。模糊增强模块通过减弱背景干扰进一步增强目标区域,从而提高检测性能。类平衡模块通过对数据集进行分析得到长尾类的分布,考虑到尾类的方向和与目标的距离,在一定范围内对尾类进行操作,缓解长尾类问题,进一步提高检测性能。这三个组件都是无监督的,可以很容易地插入到现有的网络中。在VisDrone、DOTA和AI-TOD数据集上的大量实验证明了该方法的有效性和适应性。
{"title":"YOFOR : You only focus on object regions for tiny object detection in aerial images","authors":"Heng Hu,&nbsp;Hao-Zhe Wang,&nbsp;Si-Bao Chen,&nbsp;Jin Tang","doi":"10.1016/j.neunet.2026.108571","DOIUrl":"10.1016/j.neunet.2026.108571","url":null,"abstract":"<div><div>With development of deep learning methods, performance of object detection has been greatly improved. However, the high resolution of remotely sensed images, the complexity of the background, the uneven distribution of objects, and the uneven number of objects among them lead to unsatisfactory detection results of existing detectors. Facing these challenges, we propose YOFOR (You Only Focus on Object Regions), an adaptive local sensing enhancement network. It contains three components: adaptive local sensing module, fuzzy enhancement module and class balance module. Among them, adaptive local sensing module can adaptively localize dense object regions and dynamically crop dense object regions on view, which effectively solves problem of uneven distribution of objects. Fuzzy enhancement module further enhances object region by weakening the background interference, thus improving detection performance. Class balancing module, which analyzes dataset to obtain distribution of long-tailed classes, takes into account direction of tailed classes and distance around object, and operates on tailed classes within a certain range to alleviate long-tailed class problem and further improve detection performance. All three components are unsupervised and can be easily inserted into existing networks. Extensive experiments on the VisDrone, DOTA, and AI-TOD datasets demonstrate the effectiveness and adaptability of the method.</div></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"198 ","pages":"Article 108571"},"PeriodicalIF":6.3,"publicationDate":"2026-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145981712","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Key-value pair-free continual learner via task-specific prompt-prototype 通过特定任务的提示原型实现无键值对的持续学习
IF 6.3 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-10 DOI: 10.1016/j.neunet.2026.108576
Haihua Luo , Xuming Ran , Zhengji Li , Huiyan Xue , Tingting Jiang , Jiangrong Shen , Tommi Kärkkäinen , Qi Xu , Fengyu Cong
Continual learning aims to enable models to acquire new knowledge while retaining previously learned information. Prompt-based methods have shown remarkable performance in this domain; however, they typically rely on key-value pairing, which can introduce inter-task interference and hinder scalability. To overcome these limitations, we propose a novel approach employing task-specific Prompt-Prototype (ProP), thereby eliminating the need for key-value pairs. In our method, task-specific prompts facilitate more effective feature learning for the current task, while corresponding prototypes capture the representative features of the input. During inference, predictions are generated by binding each task-specific prompt with its associated prototype. Additionally, we introduce regularization constraints during prompt initialization to penalize excessively large values, thereby enhancing stability. Experiments on several widely used datasets demonstrate the effectiveness of the proposed method. In contrast to mainstream prompt-based approaches, our framework removes the dependency on key-value pairs, offering a fresh perspective for future continual learning research.
持续学习的目的是使模型在保留以前学习过的信息的同时获得新的知识。基于提示的方法在该领域表现出了显著的性能;然而,它们通常依赖于键值配对,这可能会引入任务间干扰并阻碍可伸缩性。为了克服这些限制,我们提出了一种使用特定于任务的提示原型(ProP)的新方法,从而消除了对键值对的需要。在我们的方法中,任务特定的提示有助于更有效地学习当前任务的特征,而相应的原型则捕获输入的代表性特征。在推理过程中,通过将每个特定于任务的提示与其相关的原型绑定来生成预测。此外,我们在提示初始化期间引入正则化约束,以惩罚过大的值,从而增强稳定性。在几个广泛使用的数据集上的实验证明了该方法的有效性。与主流的基于提示的方法相比,我们的框架消除了对键值对的依赖,为未来的持续学习研究提供了一个新的视角。
{"title":"Key-value pair-free continual learner via task-specific prompt-prototype","authors":"Haihua Luo ,&nbsp;Xuming Ran ,&nbsp;Zhengji Li ,&nbsp;Huiyan Xue ,&nbsp;Tingting Jiang ,&nbsp;Jiangrong Shen ,&nbsp;Tommi Kärkkäinen ,&nbsp;Qi Xu ,&nbsp;Fengyu Cong","doi":"10.1016/j.neunet.2026.108576","DOIUrl":"10.1016/j.neunet.2026.108576","url":null,"abstract":"<div><div>Continual learning aims to enable models to acquire new knowledge while retaining previously learned information. Prompt-based methods have shown remarkable performance in this domain; however, they typically rely on key-value pairing, which can introduce inter-task interference and hinder scalability. To overcome these limitations, we propose a novel approach employing task-specific <strong>Pro</strong>mpt-<strong>P</strong>rototype (<strong>ProP</strong>), thereby eliminating the need for key-value pairs. In our method, task-specific prompts facilitate more effective feature learning for the current task, while corresponding prototypes capture the representative features of the input. During inference, predictions are generated by binding each task-specific prompt with its associated prototype. Additionally, we introduce regularization constraints during prompt initialization to penalize excessively large values, thereby enhancing stability. Experiments on several widely used datasets demonstrate the effectiveness of the proposed method. In contrast to mainstream prompt-based approaches, our framework removes the dependency on key-value pairs, offering a fresh perspective for future continual learning research.</div></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"198 ","pages":"Article 108576"},"PeriodicalIF":6.3,"publicationDate":"2026-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145981636","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Point-Deeponet: Predicting nonlinear fields on non-Parametric geometries under variable load conditions 点-深度网络:在变载荷条件下预测非参数几何的非线性场
IF 6.3 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-09 DOI: 10.1016/j.neunet.2026.108560
Jangseop Park , Namwoo Kang
Nonlinear structural analyses in engineering often require extensive finite element simulations, limiting their applicability in design optimization and real-time control. Conventional deep learning surrogates often struggle with complex, non-parametric three-dimensional (3D) geometries and directionally varying loads. This work presents Point-DeepONet, an operator-learning-based surrogate that integrates PointNet into the DeepONet framework to learn a mapping from non-parametric geometries and variable load conditions to physical response fields. By leveraging PointNet to learn a geometric representation from raw point clouds, our model circumvents the need for manual parameterization. This geometric embedding is then synergistically fused with load conditions within the DeepONet architecture to accurately predict three-dimensional displacement and von Mises stress fields. Trained on a large-scale dataset, Point-DeepONet demonstrates high fidelity, achieving a coefficient of determination (R²) reaching 0.987 for displacement and 0.923 for von Mises stress. Furthermore, to rigorously validate its generalization capabilities, we conducted additional experiments on unseen, randomly oriented load directions, where the model maintained exceptional accuracy. Compared to nonlinear finite element analyses that require about 19.32 minutes per case, Point-DeepONet provides predictions in mere seconds-approximately 400 times faster-while maintaining excellent scalability. These findings, validated through extensive experiments and ablation studies, highlight the potential of Point-DeepONet to enable rapid, high-fidelity structural analyses for complex engineering workflows.
工程中的非线性结构分析往往需要大量的有限元模拟,限制了其在设计优化和实时控制中的适用性。传统的深度学习替代品经常与复杂的非参数三维(3D)几何形状和方向变化的负载作斗争。这项工作提出了Point-DeepONet,这是一个基于算子学习的代理,它将PointNet集成到DeepONet框架中,以学习从非参数几何形状和可变负载条件到物理响应场的映射。通过利用PointNet从原始点云中学习几何表示,我们的模型避免了手动参数化的需要。然后,这种几何嵌入与DeepONet架构中的负载条件协同融合,以准确预测三维位移和von Mises应力场。在大规模数据集上训练后,Point-DeepONet显示出高保真度,位移的决定系数(R²)达到0.987,von Mises应力的决定系数(R²)达到0.923。此外,为了严格验证其泛化能力,我们在看不见的、随机定向的负载方向上进行了额外的实验,其中模型保持了出色的准确性。与每次需要19.32分钟的非线性有限元分析相比,Point-DeepONet在几秒钟内提供预测——大约快了400倍——同时保持了出色的可扩展性。这些发现经过了大量实验和消融研究的验证,突显了Point-DeepONet在复杂工程工作流程中实现快速、高保真结构分析的潜力。
{"title":"Point-Deeponet: Predicting nonlinear fields on non-Parametric geometries under variable load conditions","authors":"Jangseop Park ,&nbsp;Namwoo Kang","doi":"10.1016/j.neunet.2026.108560","DOIUrl":"10.1016/j.neunet.2026.108560","url":null,"abstract":"<div><div>Nonlinear structural analyses in engineering often require extensive finite element simulations, limiting their applicability in design optimization and real-time control. Conventional deep learning surrogates often struggle with complex, non-parametric three-dimensional (3D) geometries and directionally varying loads. This work presents Point-DeepONet, an operator-learning-based surrogate that integrates PointNet into the DeepONet framework to learn a mapping from non-parametric geometries and variable load conditions to physical response fields. By leveraging PointNet to learn a geometric representation from raw point clouds, our model circumvents the need for manual parameterization. This geometric embedding is then synergistically fused with load conditions within the DeepONet architecture to accurately predict three-dimensional displacement and von Mises stress fields. Trained on a large-scale dataset, Point-DeepONet demonstrates high fidelity, achieving a coefficient of determination (R²) reaching 0.987 for displacement and 0.923 for von Mises stress. Furthermore, to rigorously validate its generalization capabilities, we conducted additional experiments on unseen, randomly oriented load directions, where the model maintained exceptional accuracy. Compared to nonlinear finite element analyses that require about 19.32 minutes per case, Point-DeepONet provides predictions in mere seconds-approximately 400 times faster-while maintaining excellent scalability. These findings, validated through extensive experiments and ablation studies, highlight the potential of Point-DeepONet to enable rapid, high-fidelity structural analyses for complex engineering workflows.</div></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"198 ","pages":"Article 108560"},"PeriodicalIF":6.3,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145981539","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ReTri: Progressive domain bridging via representation disentanglement and triple-level consistency-driven feature alignment for unsupervised domain adaptive medical image segmentation 基于表示解纠缠和三级一致性驱动特征对齐的渐进式域桥接无监督域自适应医学图像分割
IF 6.3 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-09 DOI: 10.1016/j.neunet.2026.108564
Xiaoru Gao , Guoyan Zheng
Unsupervised domain adaptation (UDA) in medical image segmentation presents significant challenges due to substantial cross-domain disparities and the inherent absence of target domain annotations. In this study, to address these challenges, we propose an end-to-end progressive domain bridging framework based on representation disentanglement and triple-level consistency-driven feature alignment, referred to as ReTri, that synergistically integrates a representation disentanglement-based image alignment (RDIA) module with a novel triple-level consistency-driven feature alignment (TCFA) module. In particular, the RDIA module aims to establish an initial domain bridge by decoupling and aligning fundamental visual disparities through disentangled representation learning, while the novel TCFA module hierarchically bridges remaining cross-domain semantic discrepancies and feature distribution disparities via two novel consistency-driven alignment mechanisms: 1) attention-guided semantics-level consistency alignment, where we purposely design a bi-attentive semantic feature extraction (BSFE) component coupled with an attention-adaptive semantic consistency (ASC) loss function, facilitating dynamic alignment of high-level semantic representations across domains, and 2) multi-view dual-level mixing consistency alignment, consisting of Feature-Cut consistent self-ensembling (FCCS) and Trans-Cut consistent self-ensembling (TCCS) components. These two components operate within intermediate mixing spaces to ensure robust knowledge transfer through complementary feature- and prediction-level consistency regularization. Extensive experimental evaluations are conducted on four challenging datasets (Lumbar Spine CT-MR, Cardiac CT-MR, Cross-domain Echocardiography, and Multi-center Prostate MR) across seven UDA-based segmentation scenarios and two external validation scenarios. Our framework achieves superior performance over the best state-of-the-art (SOTA) methods on following UDA-based segmentation scenarios: +2.9% DSC for spine CT → MR segmentation, +3.6% and +2.4% DSC for bidirectional cardiac CT↔MR segmentation, +1.7% and +2.3% DSC for bidirectional cross-center cross-vendor Echocardiography (CAMUS↔EchoNet-Dynamic) segmentation, and +12.2% and +12.0% DSC for bidirectional multi-center prostate MR segmentation. The source code and the datasets are publicly available at https://github.com/xiaorugao999/ReTri.
无监督域自适应(UDA)在医学图像分割中存在较大的跨域差异和缺乏目标域标注等问题。在本研究中,为了解决这些挑战,我们提出了一个基于表示解除纠缠和三级一致性驱动特征对齐的端到端渐进式领域桥接框架(ReTri),该框架将基于表示解除纠缠的图像对齐(RDIA)模块与新型三级一致性驱动的特征对齐(TCFA)模块协同集成。特别是,RDIA模块旨在通过解耦表示学习来解耦和对齐基本的视觉差异,从而建立一个初始的领域桥梁,而新颖的TCFA模块通过两种新颖的一致性驱动对齐机制分层地桥接剩余的跨领域语义差异和特征分布差异:1)注意引导语义级一致性对齐,其中设计了双注意语义特征提取(BSFE)组件和注意自适应语义一致性(ASC)损失函数,促进跨域高级语义表示的动态对齐;2)多视图双级别混合一致性对齐,由特征切割一致自集成(FCCS)和横切一致自集成(TCCS)组件组成。这两个组件在中间混合空间中运行,通过互补的特征级和预测级一致性正则化来确保稳健的知识转移。在四个具有挑战性的数据集(腰椎CT-MR,心脏CT-MR,跨域超声心动图和多中心前列腺MR)上进行了广泛的实验评估,包括七个基于uda的分割场景和两个外部验证场景。我们的框架在以下基于uda的分割方案上实现了比最先进的(SOTA)方法更优越的性能:脊柱CT → MR分割+2.9% DSC,双向心脏CT↔MR分割+3.6%和+2.4% DSC,双向跨中心跨供应商超声心动图(CAMUS↔EchoNet-Dynamic)分割+1.7%和+2.3% DSC,双向多中心前列腺MR分割+12.2%和+12.0% DSC。源代码和数据集可以在https://github.com/xiaorugao999/ReTri上公开获得。
{"title":"ReTri: Progressive domain bridging via representation disentanglement and triple-level consistency-driven feature alignment for unsupervised domain adaptive medical image segmentation","authors":"Xiaoru Gao ,&nbsp;Guoyan Zheng","doi":"10.1016/j.neunet.2026.108564","DOIUrl":"10.1016/j.neunet.2026.108564","url":null,"abstract":"<div><div>Unsupervised domain adaptation (UDA) in medical image segmentation presents significant challenges due to substantial cross-domain disparities and the inherent absence of target domain annotations. In this study, to address these challenges, we propose an end-to-end progressive domain bridging framework based on representation disentanglement and triple-level consistency-driven feature alignment, referred to as ReTri, that synergistically integrates a representation disentanglement-based image alignment (RDIA) module with a novel triple-level consistency-driven feature alignment (TCFA) module. In particular, the RDIA module aims to establish an initial domain bridge by decoupling and aligning fundamental visual disparities through disentangled representation learning, while the novel TCFA module hierarchically bridges remaining cross-domain semantic discrepancies and feature distribution disparities via two novel consistency-driven alignment mechanisms: 1) attention-guided semantics-level consistency alignment, where we purposely design a bi-attentive semantic feature extraction (BSFE) component coupled with an attention-adaptive semantic consistency (ASC) loss function, facilitating dynamic alignment of high-level semantic representations across domains, and 2) multi-view dual-level mixing consistency alignment, consisting of Feature-Cut consistent self-ensembling (FCCS) and Trans-Cut consistent self-ensembling (TCCS) components. These two components operate within intermediate mixing spaces to ensure robust knowledge transfer through complementary feature- and prediction-level consistency regularization. Extensive experimental evaluations are conducted on four challenging datasets (Lumbar Spine CT-MR, Cardiac CT-MR, Cross-domain Echocardiography, and Multi-center Prostate MR) across seven UDA-based segmentation scenarios and two external validation scenarios. Our framework achieves superior performance over the best state-of-the-art (SOTA) methods on following UDA-based segmentation scenarios: +2.9% DSC for spine CT → MR segmentation, +3.6% and +2.4% DSC for bidirectional cardiac CT↔MR segmentation, +1.7% and +2.3% DSC for bidirectional cross-center cross-vendor Echocardiography (CAMUS↔EchoNet-Dynamic) segmentation, and +12.2% and +12.0% DSC for bidirectional multi-center prostate MR segmentation. The source code and the datasets are publicly available at <span><span>https://github.com/xiaorugao999/ReTri</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"198 ","pages":"Article 108564"},"PeriodicalIF":6.3,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145981537","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MT-IDS: A multi-task information decoupling strategy for identifying lymph node metastasis in the mediastinal region MT-IDS:一种识别纵隔区淋巴结转移的多任务信息解耦策略
IF 6.3 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-09 DOI: 10.1016/j.neunet.2026.108541
Wei Zhou , Yining Xie , Fengjiao Wang , Jing Zhao , Jiayi Ma
Accurately identifying the mediastinal regions where metastatic lymph nodes are located is critical for the staging diagnosis of lung cancer. This identification task involves two distinct detection dimensions: mediastinal region identification and lymph node metastasis assessment. Traditional single-task image classification algorithms struggle to manage the interference between different classification dimensions within a single task. Existing multi-task learning methods struggle to balance the relationship between shared and task-specific features, and often fail to effectively fit the underlying data distributions and task characteristics during gradient adjustment. To address these challenges, we propose a Multi-Task Information Decoupling Strategy (MT-IDS). MT-IDS decomposes the main task into multiple auxiliary tasks along different feature dimensions, forming a unified multi-task system to optimize detection performance across tasks. A Dual-control Branch Routing Gate Mechanism (DBR) is employed in MT-IDS to compute the weighting of shared and task-specific features, thereby enabling more precise expert selection and feature extraction for each task. Additionally, a Dual-Dimensional Gradient Balancing Algorithm (DD-GB) is introduced in MT-IDS, whereby gradient balance is achieved through alignment of gradient directions and dynamic scaling of magnitudes, while the distribution of inter-task gradient characteristics is maintained. The significant advantages demonstrated by MT-IDS in both ablation and comparative experiments indicate its potential as an innovative solution for multi-dimensional medical image classification problems.
准确识别转移性淋巴结所在的纵隔区域对于肺癌的分期诊断至关重要。这项鉴定任务涉及两个不同的检测维度:纵隔区域鉴定和淋巴结转移评估。传统的单任务图像分类算法难以处理单个任务中不同分类维度之间的干扰。现有的多任务学习方法难以平衡共享特征和任务特定特征之间的关系,在梯度调整过程中往往不能有效地拟合底层数据分布和任务特征。为了解决这些挑战,我们提出了一种多任务信息解耦策略(MT-IDS)。MT-IDS将主任务沿不同特征维度分解为多个辅助任务,形成统一的多任务系统,跨任务优化检测性能。MT-IDS采用双控制分支路由门机制(DBR)来计算共享特征和特定任务特征的权重,从而为每个任务提供更精确的专家选择和特征提取。此外,在MT-IDS中引入了一种二维梯度平衡算法(DD-GB),通过梯度方向对齐和幅度的动态缩放来实现梯度平衡,同时保持任务间梯度特征的分布。MT-IDS在消融和对比实验中所显示的显著优势表明,它有潜力成为多维医学图像分类问题的创新解决方案。
{"title":"MT-IDS: A multi-task information decoupling strategy for identifying lymph node metastasis in the mediastinal region","authors":"Wei Zhou ,&nbsp;Yining Xie ,&nbsp;Fengjiao Wang ,&nbsp;Jing Zhao ,&nbsp;Jiayi Ma","doi":"10.1016/j.neunet.2026.108541","DOIUrl":"10.1016/j.neunet.2026.108541","url":null,"abstract":"<div><div>Accurately identifying the mediastinal regions where metastatic lymph nodes are located is critical for the staging diagnosis of lung cancer. This identification task involves two distinct detection dimensions: mediastinal region identification and lymph node metastasis assessment. Traditional single-task image classification algorithms struggle to manage the interference between different classification dimensions within a single task. Existing multi-task learning methods struggle to balance the relationship between shared and task-specific features, and often fail to effectively fit the underlying data distributions and task characteristics during gradient adjustment. To address these challenges, we propose a Multi-Task Information Decoupling Strategy (MT-IDS). MT-IDS decomposes the main task into multiple auxiliary tasks along different feature dimensions, forming a unified multi-task system to optimize detection performance across tasks. A Dual-control Branch Routing Gate Mechanism (DBR) is employed in MT-IDS to compute the weighting of shared and task-specific features, thereby enabling more precise expert selection and feature extraction for each task. Additionally, a Dual-Dimensional Gradient Balancing Algorithm (DD-GB) is introduced in MT-IDS, whereby gradient balance is achieved through alignment of gradient directions and dynamic scaling of magnitudes, while the distribution of inter-task gradient characteristics is maintained. The significant advantages demonstrated by MT-IDS in both ablation and comparative experiments indicate its potential as an innovative solution for multi-dimensional medical image classification problems.</div></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"198 ","pages":"Article 108541"},"PeriodicalIF":6.3,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145981715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Taming polarized fitting: BLINEX-Pcomp with asymmetric risk penalty for robust Pcomp classification 驯服极化拟合:具有不对称风险惩罚的BLINEX-Pcomp稳健Pcomp分类
IF 6.3 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-09 DOI: 10.1016/j.neunet.2026.108580
Long Tang , Xin Si , Yingjie Tian , Panos M Pardalos
As a novel paradigm for learning with inexact supervision, Pcomp classification reduces the annotation costs of training a binary classifier by using ordered pairwise samples without requiring precise labels. However, existing methods fail to fully account for sign differences in empirical risk at the level of individual sample pairs, resulting in polarized fitting where the risks of overfitting and underfitting coexist. Actually, positive and negative empirical risks indicate varying degrees of training difficulty, necessitating differentiated treatments. In this work, we propose a BLINEX-Pcomp model that employs a bounded linear-exponential function to impose distinct penalties on positive and negative risks for each sample pair. The BLINEX-Pcomp model dynamically shifts the training focus toward challenging sample pairs, well balancing pairwise-level risks of overfitting and underfitting. Additionally, a multi-view version of BLINEX-Pcomp (MV-BLINEX-Pcomp) is developed to further enhance performance by integrating multi-view features. We have theoretically verified that MV-BLINEX-Pcomp degrades to BLINEX-Pcomp when only a single view of features is available. A dual-stage solver is designed to train the MV-BLINEX-Pcomp model. Exciting numerical results from comparative experiments validate the effectiveness of our methods in tackling Pcomp classification.
作为一种新的非精确监督学习范式,Pcomp分类通过使用有序成对样本而不需要精确标记来减少训练二分类器的标注成本。然而,现有方法未能充分考虑个体样本对水平上经验风险的符号差异,导致过度拟合和欠拟合风险并存的极化拟合。实际上,正、负经验风险表明了不同程度的训练难度,需要区别对待。在这项工作中,我们提出了一个BLINEX-Pcomp模型,该模型采用有界线性指数函数对每个样本对的正风险和负风险施加不同的惩罚。BLINEX-Pcomp模型动态地将训练重点转向具有挑战性的样本对,很好地平衡了过拟合和欠拟合的成对水平风险。此外,还开发了多视图版本的BLINEX-Pcomp (MV-BLINEX-Pcomp),通过集成多视图功能进一步提高性能。我们已经从理论上验证了当只有一个特征视图可用时,MV-BLINEX-Pcomp会降级为BLINEX-Pcomp。设计了一种双级求解器来训练MV-BLINEX-Pcomp模型。对比实验的令人振奋的数值结果验证了我们的方法在处理Pcomp分类方面的有效性。
{"title":"Taming polarized fitting: BLINEX-Pcomp with asymmetric risk penalty for robust Pcomp classification","authors":"Long Tang ,&nbsp;Xin Si ,&nbsp;Yingjie Tian ,&nbsp;Panos M Pardalos","doi":"10.1016/j.neunet.2026.108580","DOIUrl":"10.1016/j.neunet.2026.108580","url":null,"abstract":"<div><div>As a novel paradigm for learning with inexact supervision, Pcomp classification reduces the annotation costs of training a binary classifier by using ordered pairwise samples without requiring precise labels. However, existing methods fail to fully account for sign differences in empirical risk at the level of individual sample pairs, resulting in polarized fitting where the risks of overfitting and underfitting coexist. Actually, positive and negative empirical risks indicate varying degrees of training difficulty, necessitating differentiated treatments. In this work, we propose a BLINEX-Pcomp model that employs a bounded linear-exponential function to impose distinct penalties on positive and negative risks for each sample pair. The BLINEX-Pcomp model dynamically shifts the training focus toward challenging sample pairs, well balancing pairwise-level risks of overfitting and underfitting. Additionally, a multi-view version of BLINEX-Pcomp (MV-BLINEX-Pcomp) is developed to further enhance performance by integrating multi-view features. We have theoretically verified that MV-BLINEX-Pcomp degrades to BLINEX-Pcomp when only a single view of features is available. A dual-stage solver is designed to train the MV-BLINEX-Pcomp model. Exciting numerical results from comparative experiments validate the effectiveness of our methods in tackling Pcomp classification.</div></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"198 ","pages":"Article 108580"},"PeriodicalIF":6.3,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145981714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep multi-view clustering based on instance-level adaptive structural contrastive learning 基于实例级自适应结构对比学习的深度多视图聚类
IF 6.3 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-09 DOI: 10.1016/j.neunet.2026.108581
Zengbiao Yang, Yihua Tan
Deep multi-view clustering has rapidly developed in recent years, leveraging the powerful representation capabilities of deep neural networks. Among them, instance-level feature contrastive learning is widely used in deep multi-view clustering to align the embedded features of the same samples across views. However, it overlooks the constraint on the clustering structure consistency across views. Drawing inspiration from the instance-level feature contrastive learning mentioned above, we propose deep multi-view clustering based on instance-level adaptive structural contrastive learning. First, considering that different views may have varying impacts on the clustering results of different multi-view samples, we utilize Transformer Pooling module to adaptively fuse different views of different samples, obtaining the fused view. The final clustering results are derived from the fused view as well. Secondly, we propose a clue-consistency-based approach to identify sample pairs that exhibit consistent clustering structures between each view and the fused view, forming local and global consistency clustering information. Incorporating global consistency clustering information, we construct adjacency matrices for each view and the fused view. Since adjacency matrices record the clustering structure of each sample with others, we propose the instance-level adaptive structural contrastive learning, leveraging the above local consistency information to align the overall clustering structures of the same samples across different views and the fused view. By comparing the results of proposed method with several state-of-the-art methods on multiple multi-view datasets, we demonstrate the superiority of the proposed approach.
近年来,利用深度神经网络强大的表示能力,深度多视图聚类得到了迅速发展。其中,实例级特征对比学习被广泛应用于深度多视图聚类中,用于跨视图对齐相同样本的嵌入特征。但是,它忽略了跨视图集群结构一致性的约束。受上述实例级特征对比学习的启发,我们提出了基于实例级自适应结构对比学习的深度多视图聚类。首先,考虑到不同视图对不同多视图样本聚类结果的影响不同,利用Transformer Pooling模块对不同样本的不同视图进行自适应融合,得到融合视图;最后的聚类结果也是由融合视图导出的。其次,我们提出了一种基于线索一致性的方法来识别每个视图和融合视图之间具有一致聚类结构的样本对,形成局部和全局一致性聚类信息。结合全局一致性聚类信息,构造各视图和融合视图的邻接矩阵。由于邻接矩阵记录了每个样本与其他样本的聚类结构,我们提出了实例级自适应结构对比学习,利用上述局部一致性信息在不同视图和融合视图中对齐相同样本的整体聚类结构。通过将该方法与几种最新方法在多个多视图数据集上的结果进行比较,我们证明了该方法的优越性。
{"title":"Deep multi-view clustering based on instance-level adaptive structural contrastive learning","authors":"Zengbiao Yang,&nbsp;Yihua Tan","doi":"10.1016/j.neunet.2026.108581","DOIUrl":"10.1016/j.neunet.2026.108581","url":null,"abstract":"<div><div>Deep multi-view clustering has rapidly developed in recent years, leveraging the powerful representation capabilities of deep neural networks. Among them, instance-level feature contrastive learning is widely used in deep multi-view clustering to align the embedded features of the same samples across views. However, it overlooks the constraint on the clustering structure consistency across views. Drawing inspiration from the instance-level feature contrastive learning mentioned above, we propose deep multi-view clustering based on instance-level adaptive structural contrastive learning. First, considering that different views may have varying impacts on the clustering results of different multi-view samples, we utilize Transformer Pooling module to adaptively fuse different views of different samples, obtaining the fused view. The final clustering results are derived from the fused view as well. Secondly, we propose a clue-consistency-based approach to identify sample pairs that exhibit consistent clustering structures between each view and the fused view, forming local and global consistency clustering information. Incorporating global consistency clustering information, we construct adjacency matrices for each view and the fused view. Since adjacency matrices record the clustering structure of each sample with others, we propose the instance-level adaptive structural contrastive learning, leveraging the above local consistency information to align the overall clustering structures of the same samples across different views and the fused view. By comparing the results of proposed method with several state-of-the-art methods on multiple multi-view datasets, we demonstrate the superiority of the proposed approach.</div></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"198 ","pages":"Article 108581"},"PeriodicalIF":6.3,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145981629","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Curriculum-guided divergence scheduling improves single-cell clustering robustness 课程导向的发散调度提高了单细胞聚类的鲁棒性
IF 6.3 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-09 DOI: 10.1016/j.neunet.2026.108592
Meihua Zhou , Tianlong Zheng , Baihua Wang , Xinyu Tong , Wai kin Fung , Li Yang
Deep clustering of single-cell RNA-seq data faces significant challenges due to extreme sparsity and noise. We present DAGCL (Dynamic Attention-enhanced Graph Embedding with Curriculum Learning), a dynamic graph embedding framework that reframes representation learning as a coarse-to-fine evolutionary process. Unlike conventional static paradigms, DAGCL employs a curriculum-guided scheduling mechanism that actively modulates both attention intensity and supervision stringency throughout training. This strategy aligns model complexity with feature maturity, effectively mitigating early-stage confirmation bias. To further stabilize optimization, we incorporate an entropy-regularized Sinkhorn projection that enforces globally balanced soft assignments. Extensive experiments on 27 benchmarks demonstrate that DAGCL consistently outperforms baselines in clustering accuracy and robustness. Our work establishes a principled strategy for unsupervised learning where structural constraints and supervisory pressure co-evolve with learned representations.
单细胞RNA-seq数据的深度聚类由于极端稀疏性和噪声而面临重大挑战。我们提出DAGCL(动态注意增强图嵌入课程学习),这是一个动态图嵌入框架,它将表示学习重新定义为一个从粗到精的进化过程。与传统的静态范例不同,DAGCL采用课程指导的调度机制,在整个培训过程中积极调节注意力强度和监督严格性。该策略将模型复杂性与特征成熟度结合起来,有效地减轻了早期阶段的确认偏差。为了进一步稳定优化,我们结合了一个熵正则化的Sinkhorn投影,强制全局平衡软分配。在27个基准测试上进行的大量实验表明,DAGCL在聚类准确性和鲁棒性方面始终优于基线。我们的工作为无监督学习建立了一个原则性策略,其中结构约束和监督压力与学习表征共同演变。
{"title":"Curriculum-guided divergence scheduling improves single-cell clustering robustness","authors":"Meihua Zhou ,&nbsp;Tianlong Zheng ,&nbsp;Baihua Wang ,&nbsp;Xinyu Tong ,&nbsp;Wai kin Fung ,&nbsp;Li Yang","doi":"10.1016/j.neunet.2026.108592","DOIUrl":"10.1016/j.neunet.2026.108592","url":null,"abstract":"<div><div>Deep clustering of single-cell RNA-seq data faces significant challenges due to extreme sparsity and noise. We present DAGCL (Dynamic Attention-enhanced Graph Embedding with Curriculum Learning), a dynamic graph embedding framework that reframes representation learning as a coarse-to-fine evolutionary process. Unlike conventional static paradigms, DAGCL employs a curriculum-guided scheduling mechanism that actively modulates both attention intensity and supervision stringency throughout training. This strategy aligns model complexity with feature maturity, effectively mitigating early-stage confirmation bias. To further stabilize optimization, we incorporate an entropy-regularized Sinkhorn projection that enforces globally balanced soft assignments. Extensive experiments on 27 benchmarks demonstrate that DAGCL consistently outperforms baselines in clustering accuracy and robustness. Our work establishes a principled strategy for unsupervised learning where structural constraints and supervisory pressure co-evolve with learned representations.</div></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"198 ","pages":"Article 108592"},"PeriodicalIF":6.3,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145981728","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Neural Networks
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1