IEEE transactions on neural networks and learning systems最新文献

英文中文

IEEE Computational Intelligence Society 计算智能学会

IF 8.9 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE transactions on neural networks and learning systems

Pub Date : 2025-12-02 DOI: 10.1109/TNNLS.2025.3629909

引用次数: 0

Unlocking Pseudolabel Potential and Alignment for Unpaired Cross-Modality Adaptation in Remote Sensing Image Segmentation. 遥感图像分割中非配对跨模态自适应的伪标签潜力解锁与对齐。

IF 10.4 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE transactions on neural networks and learning systems

Pub Date : 2025-12-02 DOI: 10.1109/tnnls.2025.3635883

Zhengyi Xu,Jie Geng,Wen Jiang,Shuai Song

With the growth of multisource sensor technology, multimodal learning has become pivotal in remote sensing (RS) image segmentation. Despite its potential, current methods face challenges in acquiring large-scale paired samples. When annotated optical images are available, but synthetic aperture radar (SAR) images lack annotations, learning discriminative features for SAR images from optical images becomes difficult. Unsupervised domain adaptation (UDA) offers a potential solution to this challenge, which we refer to as unpaired cross-modality UDA. In this article, we propose unlocking pseudolabel potential and alignment (ULPA) for unpaired cross-modality adaptation in RS image segmentation, a novel one-stage adaptation framework designed to enhance cross-modality knowledge transfer. Our approach employs a prototypical multidomain alignment (PMDA) strategy, which reduces the modality gap through contrastive learning between features and prototypes of identical classes across different modalities. In addition, we introduce the unreliable-sample-guided feature contrast (UFC) loss to address the underutilization of unreliable pixels during training. This strategy separates reliable and unreliable pixels based on prediction confidence, assigning unreliable pixels to a category-wise queue of negative samples, thus ensuring all candidate pixels contribute to the training process. Extensive experiments show that the integration of PMDA and UFC loss can lead to more effective cross-modality domain alignment and substantially boost the model's generalization capability.

随着多源传感器技术的发展，多模态学习已成为遥感图像分割的关键。尽管具有潜力，但目前的方法在获取大规模配对样本方面面临挑战。当有带注释的光学图像，而合成孔径雷达（SAR）图像缺乏注释时，很难从光学图像中学习SAR图像的判别特征。无监督域自适应（UDA）为这一挑战提供了一个潜在的解决方案，我们将其称为非配对跨模态UDA。在本文中，我们提出了解锁伪标签电位和对齐（ULPA）用于RS图像分割中的非配对跨模态自适应，这是一种新的单阶段自适应框架，旨在增强跨模态知识转移。我们的方法采用了一种原型多域对齐（PMDA）策略，该策略通过不同模态的相同类的特征和原型之间的对比学习来减少模态差距。此外，我们引入了不可靠样本引导特征对比度（UFC）损失来解决训练过程中不可靠像素的利用不足问题。该策略基于预测置信度分离可靠像素和不可靠像素，将不可靠像素分配给负样本的分类队列，从而确保所有候选像素都对训练过程有贡献。大量的实验表明，PMDA和UFC损失的集成可以更有效地实现跨模态域对齐，并大大提高模型的泛化能力。

{"title":"Unlocking Pseudolabel Potential and Alignment for Unpaired Cross-Modality Adaptation in Remote Sensing Image Segmentation.","authors":"Zhengyi Xu,Jie Geng,Wen Jiang,Shuai Song","doi":"10.1109/tnnls.2025.3635883","DOIUrl":"https://doi.org/10.1109/tnnls.2025.3635883","url":null,"abstract":"With the growth of multisource sensor technology, multimodal learning has become pivotal in remote sensing (RS) image segmentation. Despite its potential, current methods face challenges in acquiring large-scale paired samples. When annotated optical images are available, but synthetic aperture radar (SAR) images lack annotations, learning discriminative features for SAR images from optical images becomes difficult. Unsupervised domain adaptation (UDA) offers a potential solution to this challenge, which we refer to as unpaired cross-modality UDA. In this article, we propose unlocking pseudolabel potential and alignment (ULPA) for unpaired cross-modality adaptation in RS image segmentation, a novel one-stage adaptation framework designed to enhance cross-modality knowledge transfer. Our approach employs a prototypical multidomain alignment (PMDA) strategy, which reduces the modality gap through contrastive learning between features and prototypes of identical classes across different modalities. In addition, we introduce the unreliable-sample-guided feature contrast (UFC) loss to address the underutilization of unreliable pixels during training. This strategy separates reliable and unreliable pixels based on prediction confidence, assigning unreliable pixels to a category-wise queue of negative samples, thus ensuring all candidate pixels contribute to the training process. Extensive experiments show that the integration of PMDA and UFC loss can lead to more effective cross-modality domain alignment and substantially boost the model's generalization capability.","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"20 1","pages":""},"PeriodicalIF":10.4,"publicationDate":"2025-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145657029","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Accurate Protein-Protein Interaction Prediction: Based on Multiview Heterogeneous Graph Autoencoders and Random Masking. 精确的蛋白质-蛋白质相互作用预测：基于多视图异构图自编码器和随机掩蔽。

IF 10.4 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE transactions on neural networks and learning systems

Pub Date : 2025-12-02 DOI: 10.1109/tnnls.2025.3632083

Shouzhi Chen,Zhenchao Tang,Linlin You,Calvin Yu-Chian Chen

Protein-protein interaction (PPI) and their interaction sites [PPI site (PPIS)] hold immense potential for elucidating cellular mechanisms and advancing targeted drug development. While deep learning has driven progress in PPI research by capturing protein features, it remains limited by its overreliance on sequence information and inability to effectively integrate protein internal structural features. To address these challenges, we propose MEGAE, a novel model capable of achieving high-precision prediction of PPI and PPIS. MEGAE reconstructs amino acid microenvironments through a vector quantization autoencoder, integrating physicochemical properties, structural details, and sequence data to provide a comprehensive representation of proteins. We innovatively introduce a multiview random masking training strategy, introducing controlled randomness during the reconstruction process to enhance the robustness of microenvironment embeddings. The model combines these fused embeddings with protein graphs and protein interaction networks, leveraging graph neural networks (GNNs) to capture multilevel relationships from local amino acid interactions to global signal network connections-thereby achieving precise predictions. Experimental results demonstrate that MEGAE outperforms state-of-the-art sequence- and structure-based methods across multiple datasets, exhibiting higher accuracy in predicting interaction types and interaction sites. This advancement underscores the potential of microenvironment-aware modeling in uncovering complex protein interactions.

蛋白质-蛋白质相互作用（PPI）及其相互作用位点[PPI位点（PPIS）]在阐明细胞机制和推进靶向药物开发方面具有巨大潜力。虽然深度学习通过捕获蛋白质特征推动了PPI研究的进展，但它仍然受到过度依赖序列信息和无法有效整合蛋白质内部结构特征的限制。为了解决这些挑战，我们提出了MEGAE，这是一种能够实现PPI和PPI高精度预测的新模型。MEGAE通过矢量量化自编码器重建氨基酸微环境，整合物理化学性质，结构细节和序列数据，以提供蛋白质的全面表示。我们创新地引入了一种多视图随机掩蔽训练策略，在重建过程中引入可控随机性，以增强微环境嵌入的鲁棒性。该模型将这些融合嵌入与蛋白质图和蛋白质相互作用网络相结合，利用图神经网络（gnn）捕获从局部氨基酸相互作用到全局信号网络连接的多层次关系，从而实现精确的预测。实验结果表明，MEGAE在多数据集上优于最先进的基于序列和结构的方法，在预测交互类型和交互位点方面表现出更高的准确性。这一进展强调了微环境感知建模在揭示复杂蛋白质相互作用方面的潜力。

{"title":"Accurate Protein-Protein Interaction Prediction: Based on Multiview Heterogeneous Graph Autoencoders and Random Masking.","authors":"Shouzhi Chen,Zhenchao Tang,Linlin You,Calvin Yu-Chian Chen","doi":"10.1109/tnnls.2025.3632083","DOIUrl":"https://doi.org/10.1109/tnnls.2025.3632083","url":null,"abstract":"Protein-protein interaction (PPI) and their interaction sites [PPI site (PPIS)] hold immense potential for elucidating cellular mechanisms and advancing targeted drug development. While deep learning has driven progress in PPI research by capturing protein features, it remains limited by its overreliance on sequence information and inability to effectively integrate protein internal structural features. To address these challenges, we propose MEGAE, a novel model capable of achieving high-precision prediction of PPI and PPIS. MEGAE reconstructs amino acid microenvironments through a vector quantization autoencoder, integrating physicochemical properties, structural details, and sequence data to provide a comprehensive representation of proteins. We innovatively introduce a multiview random masking training strategy, introducing controlled randomness during the reconstruction process to enhance the robustness of microenvironment embeddings. The model combines these fused embeddings with protein graphs and protein interaction networks, leveraging graph neural networks (GNNs) to capture multilevel relationships from local amino acid interactions to global signal network connections-thereby achieving precise predictions. Experimental results demonstrate that MEGAE outperforms state-of-the-art sequence- and structure-based methods across multiple datasets, exhibiting higher accuracy in predicting interaction types and interaction sites. This advancement underscores the potential of microenvironment-aware modeling in uncovering complex protein interactions.","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"14 1","pages":""},"PeriodicalIF":10.4,"publicationDate":"2025-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145657038","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

User Isolation Poisoning on Decentralized Federated Learning: An Adversarial Message-Passing Graph Neural Network Approach. 分散联邦学习中的用户隔离中毒：一种对抗性消息传递图神经网络方法。

IF 10.4 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE transactions on neural networks and learning systems

Pub Date : 2025-12-02 DOI: 10.1109/tnnls.2025.3636440

Kai Li,Yilei Liang,Pietro Lio,Wei Ni,Falko Dressler,Jon Crowcroft,Ozgur B Akan

This article proposes a new cyberattack on decentralized federated learning (DFL), named user isolation poisoning (UIP). While following the standard DFL protocol of receiving and aggregating benign local models, a malicious user strategically generates and distributes compromised updates to undermine the learning process. The objective of the new UIP attack is to diminish the impact of benign users by isolating their model updates, thereby manipulating the shared model to reduce the learning accuracy. To realize this attack, we design a novel threat model that leverages an adversarial message-passing graph (MPG) neural network. Through iterative message passing, the adversarial MPG progressively refines the representations (also known as embeddings or hidden states) of each benign local model update. By orchestrating feature exchanges among connected nodes in a targeted manner, the malicious users effectively curtail the genuine data features of benign local models, thereby diminishing their overall influence within the DFL process. The MPG-based UIP attack is implemented in PyTorch, demonstrating that it effectively reduces the test accuracy of DFL by 49.5% and successfully evades existing cosine similarity- and Euclidean distance-based defense strategies.

本文提出了一种新的基于分散联邦学习（DFL）的网络攻击方法——用户隔离中毒（UIP）。当遵循接收和聚合良性本地模型的标准DFL协议时，恶意用户会有策略地生成和分发受损的更新以破坏学习过程。新的UIP攻击的目标是通过隔离良性用户的模型更新来减少良性用户的影响，从而操纵共享模型来降低学习精度。为了实现这种攻击，我们设计了一种利用对抗性消息传递图（MPG）神经网络的新型威胁模型。通过迭代消息传递，对抗性MPG逐步改进每个良性局部模型更新的表示（也称为嵌入或隐藏状态）。通过有针对性地编排连接节点之间的特征交换，恶意用户有效地削弱了良性局部模型的真实数据特征，从而削弱了它们在DFL过程中的整体影响。在PyTorch中实现了基于mpg的UIP攻击，结果表明，该攻击有效地将DFL的测试准确率降低了49.5%，并成功规避了现有的基于余弦相似度和欧氏距离的防御策略。

{"title":"User Isolation Poisoning on Decentralized Federated Learning: An Adversarial Message-Passing Graph Neural Network Approach.","authors":"Kai Li,Yilei Liang,Pietro Lio,Wei Ni,Falko Dressler,Jon Crowcroft,Ozgur B Akan","doi":"10.1109/tnnls.2025.3636440","DOIUrl":"https://doi.org/10.1109/tnnls.2025.3636440","url":null,"abstract":"This article proposes a new cyberattack on decentralized federated learning (DFL), named user isolation poisoning (UIP). While following the standard DFL protocol of receiving and aggregating benign local models, a malicious user strategically generates and distributes compromised updates to undermine the learning process. The objective of the new UIP attack is to diminish the impact of benign users by isolating their model updates, thereby manipulating the shared model to reduce the learning accuracy. To realize this attack, we design a novel threat model that leverages an adversarial message-passing graph (MPG) neural network. Through iterative message passing, the adversarial MPG progressively refines the representations (also known as embeddings or hidden states) of each benign local model update. By orchestrating feature exchanges among connected nodes in a targeted manner, the malicious users effectively curtail the genuine data features of benign local models, thereby diminishing their overall influence within the DFL process. The MPG-based UIP attack is implemented in PyTorch, demonstrating that it effectively reduces the test accuracy of DFL by 49.5% and successfully evades existing cosine similarity- and Euclidean distance-based defense strategies.","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"22 1","pages":""},"PeriodicalIF":10.4,"publicationDate":"2025-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145657034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

IEEE Transactions on Neural Networks and Learning Systems Publication Information IEEE神经网络与学习系统汇刊

IF 10.4 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE transactions on neural networks and learning systems

Pub Date : 2025-12-02 DOI: 10.1109/tnnls.2025.3629907

引用次数: 0

Model-Based Offline Reinforcement Learning With Adversarial Data Augmentation. 基于模型的离线强化学习与对抗数据增强。

IF 8.9 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE transactions on neural networks and learning systems

Pub Date : 2025-12-02 DOI: 10.1109/TNNLS.2025.3636176

Hongye Cao, Fan Feng, Jing Huo, Shangdong Yang, Meng Fang, Tianpei Yang, Yang Gao

Model-based offline reinforcement learning (RL) constructs environment models from offline datasets to perform conservative policy optimization. Existing approaches focus on learning state transitions through ensemble models, rolling out conservative estimation to mitigate extrapolation errors. However, the static data makes it challenging to develop a robust policy, and offline agents cannot access the environment to gather new data. To address these challenges, we introduce Model-based Offline Reinforcement learning with AdversariaL data augmentation (MORAL). In MORAL, we replace the fixed horizon rollout by employing adversarial data augmentation to execute alternating sampling with ensemble models to enrich training data. Specifically, this adversarial process dynamically selects ensemble models against policy for biased sampling, mitigating the optimistic estimation of fixed models, thus robustly expanding the training data for policy optimization. Moreover, a differential factor (DF) is integrated into the adversarial process for regularization, ensuring error minimization in extrapolations. This data-augmented optimization adapts to diverse offline tasks without rollout horizon tuning, showing remarkable applicability. Extensive experiments on the D4RL benchmark demonstrate that MORAL outperforms other model-based offline RL methods in terms of policy learning and sample efficiency.

基于模型的离线强化学习（RL）从离线数据集构建环境模型，进行保守策略优化。现有的方法侧重于通过集成模型学习状态转换，推出保守估计以减轻外推误差。然而，静态数据给开发健壮的策略带来了挑战，并且离线代理不能访问环境来收集新数据。为了解决这些挑战，我们引入了基于模型的离线强化学习和对抗数据增强（MORAL）。在MORAL中，我们通过使用对抗数据增强来代替固定的水平铺开，使用集成模型执行交替采样以丰富训练数据。具体而言，该对抗过程动态地选择针对策略的集成模型进行有偏抽样，减轻了固定模型的乐观估计，从而稳健地扩展了策略优化的训练数据。此外，微分因子（DF）被集成到正则化的对抗过程中，确保外推中的误差最小化。这种数据增强优化可以适应各种离线任务，而无需进行部署水平调优，显示出非凡的适用性。在D4RL基准上的大量实验表明，MORAL在策略学习和样本效率方面优于其他基于模型的离线RL方法。

{"title":"Model-Based Offline Reinforcement Learning With Adversarial Data Augmentation.","authors":"Hongye Cao, Fan Feng, Jing Huo, Shangdong Yang, Meng Fang, Tianpei Yang, Yang Gao","doi":"10.1109/TNNLS.2025.3636176","DOIUrl":"https://doi.org/10.1109/TNNLS.2025.3636176","url":null,"abstract":"Model-based offline reinforcement learning (RL) constructs environment models from offline datasets to perform conservative policy optimization. Existing approaches focus on learning state transitions through ensemble models, rolling out conservative estimation to mitigate extrapolation errors. However, the static data makes it challenging to develop a robust policy, and offline agents cannot access the environment to gather new data. To address these challenges, we introduce Model-based Offline Reinforcement learning with AdversariaL data augmentation (MORAL). In MORAL, we replace the fixed horizon rollout by employing adversarial data augmentation to execute alternating sampling with ensemble models to enrich training data. Specifically, this adversarial process dynamically selects ensemble models against policy for biased sampling, mitigating the optimistic estimation of fixed models, thus robustly expanding the training data for policy optimization. Moreover, a differential factor (DF) is integrated into the adversarial process for regularization, ensuring error minimization in extrapolations. This data-augmented optimization adapts to diverse offline tasks without rollout horizon tuning, showing remarkable applicability. Extensive experiments on the D4RL benchmark demonstrate that MORAL outperforms other model-based offline RL methods in terms of policy learning and sample efficiency.","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"PP ","pages":""},"PeriodicalIF":8.9,"publicationDate":"2025-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145661151","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Segment Any Cell: A SAM-Based Auto-Prompting Fine-Tuning Framework for Nuclei Segmentation. 分割任何细胞：一个基于sam的自动提示微调框架的细胞核分割。

IF 8.9 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE transactions on neural networks and learning systems

Pub Date : 2025-12-01 DOI: 10.1109/TNNLS.2025.3611322

Saiyang Na, Yuzhi Guo, Feng Jiang, Hehuan Ma, Jean Gao, Junzhou Huang

In the rapidly evolving field of AI research, foundational models like BERT and GPT have significantly advanced language and vision tasks. The advent of pretrain-prompting models, such as ChatGPT and segment anything model (SAM), has further revolutionized image segmentation. However, their applications in specialized areas, particularly in nuclei segmentation within medical imaging, reveal a key challenge: the generation of high-quality, informative prompts is as crucial as applying state-of-the-art (SOTA) fine-tuning techniques on foundation models. To address this, we introduce segment any cell (SAC), an innovative framework that enhances SAM specifically for nuclei segmentation. SAC integrates a low-rank adaptation (LoRA) within the attention layer of the Transformer to improve the fine-tuning process, outperforming existing SOTA methods. It also introduces an innovative auto-prompt generator that produces effective prompts to guide segmentation, a critical factor in handling the complexities of nuclei segmentation in biomedical imaging. Our extensive experiments demonstrate the superiority of SAC in nuclei segmentation tasks, proving its effectiveness as a tool for pathologists and researchers. Our contributions include a novel prompt generation strategy, automated adaptability for diverse segmentation tasks, the innovative application of low-rank attention adaptation in SAM, and a versatile framework for semantic and instance segmentation challenges.

在快速发展的人工智能研究领域，BERT和GPT等基础模型具有显着先进的语言和视觉任务。预训练提示模型的出现，如ChatGPT和任意分割模型（SAM），进一步革新了图像分割。然而，它们在专业领域的应用，特别是在医学成像中的细胞核分割中，揭示了一个关键的挑战：生成高质量、信息丰富的提示与在基础模型上应用最先进的（SOTA）微调技术一样重要。为了解决这个问题，我们引入了任何细胞片段（SAC），这是一种创新的框架，可以增强SAM，专门用于细胞核分割。SAC在Transformer的注意层中集成了低阶自适应（LoRA），以改进微调过程，优于现有的SOTA方法。它还引入了一个创新的自动提示生成器，产生有效的提示来指导分割，这是处理生物医学成像中核分割复杂性的关键因素。我们的大量实验证明了SAC在细胞核分割任务中的优越性，证明了其作为病理学家和研究人员工具的有效性。我们的贡献包括一种新的提示生成策略，对各种分割任务的自动适应性，SAM中低秩注意自适应的创新应用，以及用于语义和实例分割挑战的通用框架。

{"title":"Segment Any Cell: A SAM-Based Auto-Prompting Fine-Tuning Framework for Nuclei Segmentation.","authors":"Saiyang Na, Yuzhi Guo, Feng Jiang, Hehuan Ma, Jean Gao, Junzhou Huang","doi":"10.1109/TNNLS.2025.3611322","DOIUrl":"10.1109/TNNLS.2025.3611322","url":null,"abstract":"In the rapidly evolving field of AI research, foundational models like BERT and GPT have significantly advanced language and vision tasks. The advent of pretrain-prompting models, such as ChatGPT and segment anything model (SAM), has further revolutionized image segmentation. However, their applications in specialized areas, particularly in nuclei segmentation within medical imaging, reveal a key challenge: the generation of high-quality, informative prompts is as crucial as applying state-of-the-art (SOTA) fine-tuning techniques on foundation models. To address this, we introduce segment any cell (SAC), an innovative framework that enhances SAM specifically for nuclei segmentation. SAC integrates a low-rank adaptation (LoRA) within the attention layer of the Transformer to improve the fine-tuning process, outperforming existing SOTA methods. It also introduces an innovative auto-prompt generator that produces effective prompts to guide segmentation, a critical factor in handling the complexities of nuclei segmentation in biomedical imaging. Our extensive experiments demonstrate the superiority of SAC in nuclei segmentation tasks, proving its effectiveness as a tool for pathologists and researchers. Our contributions include a novel prompt generation strategy, automated adaptability for diverse segmentation tasks, the innovative application of low-rank attention adaptation in SAM, and a versatile framework for semantic and instance segmentation challenges.","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"PP ","pages":"19986-19995"},"PeriodicalIF":8.9,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145377180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Bidirectional Multiscale Efficient Dilated Convolutional Recurrent Neural Network Improved by Swarm Intelligence Optimization. 基于群智能优化的双向多尺度高效扩展卷积递归神经网络。

IF 8.9 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE transactions on neural networks and learning systems

Pub Date : 2025-12-01 DOI: 10.1109/TNNLS.2025.3596244

Qinwei Fan, Shuai Zhao, Jacek M Zurada, Tingwen Huang, Xiaolong Qin, Rui Zhang

In recent years, bidirectional convolutional recurrent neural networks (RNNs) have made significant breakthroughs in addressing a wide range of challenging problems related to time series and prediction applications. However, the performance of the models is highly dependent on the hyperparameters chosen. Hence, we propose an automatic method for hyperparameter optimization and apply a bidirectional convolutional RNN based on the improved swarm intelligence optimization (sparrow search) to solve regression prediction problems. Specifically, a parallel multiscale dilated convolution (PMDC) module was designed to capture both local and global spatial correlations. This method utilizes convolution with different dilation rates to expand the receptive field without increasing the complexity of the model. Meanwhile, it integrates parallel multiscale structures to extract features at different scales and enhance the model's understanding of the input data. Then, the bidirectional gated recurrent units (BGRUs) learn temporal information from the convolutional features. To address the limitations of empirical hyperparameter selection, such as slow training and low efficiency, a novel PMDC-BGRU model integrated with a pretrained sparrow search algorithm (SSA) was proposed for hyperparameter optimization. Finally, experiments on multiple datasets verified the superiority of the algorithm and explained the flexibility of intelligent optimization algorithms in solving model parameter optimization.

近年来，双向卷积递归神经网络（RNNs）在解决与时间序列和预测应用相关的一系列具有挑战性的问题方面取得了重大突破。然而，模型的性能高度依赖于所选择的超参数。因此，我们提出了一种超参数优化的自动方法，并应用基于改进的群体智能优化（麻雀搜索）的双向卷积RNN来解决回归预测问题。具体来说，设计了一个并行多尺度扩展卷积（PMDC）模块来捕获局部和全局空间相关性。该方法在不增加模型复杂性的前提下，利用不同扩张率的卷积来扩展接受野。同时，结合并行多尺度结构提取不同尺度的特征，增强模型对输入数据的理解能力。然后，双向门控循环单元（bgru）从卷积特征中学习时间信息。针对经验超参数选择存在的训练速度慢、效率低等问题，提出了一种结合预训练麻雀搜索算法（SSA）的PMDC-BGRU模型进行超参数优化。最后，通过多数据集的实验验证了算法的优越性，说明了智能优化算法在求解模型参数优化方面的灵活性。

{"title":"Bidirectional Multiscale Efficient Dilated Convolutional Recurrent Neural Network Improved by Swarm Intelligence Optimization.","authors":"Qinwei Fan, Shuai Zhao, Jacek M Zurada, Tingwen Huang, Xiaolong Qin, Rui Zhang","doi":"10.1109/TNNLS.2025.3596244","DOIUrl":"10.1109/TNNLS.2025.3596244","url":null,"abstract":"In recent years, bidirectional convolutional recurrent neural networks (RNNs) have made significant breakthroughs in addressing a wide range of challenging problems related to time series and prediction applications. However, the performance of the models is highly dependent on the hyperparameters chosen. Hence, we propose an automatic method for hyperparameter optimization and apply a bidirectional convolutional RNN based on the improved swarm intelligence optimization (sparrow search) to solve regression prediction problems. Specifically, a parallel multiscale dilated convolution (PMDC) module was designed to capture both local and global spatial correlations. This method utilizes convolution with different dilation rates to expand the receptive field without increasing the complexity of the model. Meanwhile, it integrates parallel multiscale structures to extract features at different scales and enhance the model's understanding of the input data. Then, the bidirectional gated recurrent units (BGRUs) learn temporal information from the convolutional features. To address the limitations of empirical hyperparameter selection, such as slow training and low efficiency, a novel PMDC-BGRU model integrated with a pretrained sparrow search algorithm (SSA) was proposed for hyperparameter optimization. Finally, experiments on multiple datasets verified the superiority of the algorithm and explained the flexibility of intelligent optimization algorithms in solving model parameter optimization.","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"PP ","pages":"20009-20023"},"PeriodicalIF":8.9,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144882745","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Progressive Structure Preservation and Detail Refinement for Remote Sensing Single-Image Super-Resolution. 遥感单幅超分辨率图像的渐进结构保存与细节细化。

IF 8.9 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE transactions on neural networks and learning systems

Pub Date : 2025-12-01 DOI: 10.1109/TNNLS.2025.3589209

Wei-Yen Hsu, Shih-Hao Huang

Recent advances in deep-learning-based remote sensing image super-resolution (RSISR) have garnered significant attention. Conventional models typically perform upsampling at the end of the architecture, which reduces computational effort but leads to information loss and limits image quality. Moreover, the structural complexity and texture diversity of remote sensing images pose challenges in detail preservation. While transformer-based approaches improve global feature capture, they often introduce redundancy and overlook local details. To address these issues, we propose a novel progressive structure preservation and detail refinement super-resolution (PSPDR-SR) model, designed to enhance both structural integrity and fine details in RSISR. The model comprises two primary subnetworks: the structure-aware super-resolution (SaSR) subnetwork and the detail recovery and refinement (DR&R) subnetwork. To efficiently leverage multilayer and multiscale feature representations, we introduce coarse-to-fine dynamic information transmission (C2FDIT) and fine-to-coarse dynamic information transmission (F2CDIT) modules, which facilitate the extraction of richer details from low-resolution (LR) remote sensing images. These modules integrate transformers and convolutional long short-term memory (ConvLSTM) blocks to form dynamic information transmission modules (DITMs), enabling effective bidirectional feature transmission both horizontally and vertically. This method ensures comprehensive feature fusion, mitigates redundant information, and preserves essential extracted features within the deep network. Experimental results demonstrate that PSPDR-SR outperforms the state-of-the-art approaches on two benchmark datasets in both quantitative and qualitative evaluations, excelling in structure preservation and detail enhancement across various metrics, including SSIM, MS_SSIM, learned perceptual image patch similarity (LPIPS), deep image structure and texture similarity (DISTS), spatial correlation coefficient (SCC), and spectral angle mapper (SAM).

近年来，基于深度学习的遥感图像超分辨率（RSISR）的研究进展备受关注。传统模型通常在架构的最后执行上采样，这减少了计算工作量，但会导致信息丢失并限制图像质量。此外，遥感图像的结构复杂性和纹理多样性给细节保存带来了挑战。虽然基于变压器的方法改进了全局特征捕获，但它们经常引入冗余并忽略局部细节。为了解决这些问题，我们提出了一种新的渐进式结构保存和细节细化超分辨率（PSPDR-SR）模型，旨在提高结构完整性和精细细节。该模型包括两个主要子网：结构感知超分辨率（SaSR）子网和细节恢复与细化（DR&R）子网。为了有效地利用多层和多尺度特征表示，我们引入了粗到细的动态信息传输（C2FDIT）和细到粗的动态信息传输（F2CDIT）模块，促进了从低分辨率（LR）遥感图像中提取更丰富的细节。这些模块集成了变压器和卷积长短期记忆（ConvLSTM）模块，形成动态信息传输模块（ditm），实现了有效的横向和纵向双向特征传输。该方法保证了特征融合的全面，减少了冗余信息，并保留了深度网络中提取的基本特征。实验结果表明，PSPDR-SR在两个基准数据集上的定量和定性评估都优于最先进的方法，在各种指标（包括SSIM、MS_SSIM、学习感知图像斑块相似度（LPIPS）、深度图像结构和纹理相似度（DISTS）、空间相关系数（SCC）和光谱角映射器（SAM））上的结构保存和细节增强方面表现出色。

{"title":"Progressive Structure Preservation and Detail Refinement for Remote Sensing Single-Image Super-Resolution.","authors":"Wei-Yen Hsu, Shih-Hao Huang","doi":"10.1109/TNNLS.2025.3589209","DOIUrl":"10.1109/TNNLS.2025.3589209","url":null,"abstract":"Recent advances in deep-learning-based remote sensing image super-resolution (RSISR) have garnered significant attention. Conventional models typically perform upsampling at the end of the architecture, which reduces computational effort but leads to information loss and limits image quality. Moreover, the structural complexity and texture diversity of remote sensing images pose challenges in detail preservation. While transformer-based approaches improve global feature capture, they often introduce redundancy and overlook local details. To address these issues, we propose a novel progressive structure preservation and detail refinement super-resolution (PSPDR-SR) model, designed to enhance both structural integrity and fine details in RSISR. The model comprises two primary subnetworks: the structure-aware super-resolution (SaSR) subnetwork and the detail recovery and refinement (DR&R) subnetwork. To efficiently leverage multilayer and multiscale feature representations, we introduce coarse-to-fine dynamic information transmission (C2FDIT) and fine-to-coarse dynamic information transmission (F2CDIT) modules, which facilitate the extraction of richer details from low-resolution (LR) remote sensing images. These modules integrate transformers and convolutional long short-term memory (ConvLSTM) blocks to form dynamic information transmission modules (DITMs), enabling effective bidirectional feature transmission both horizontally and vertically. This method ensures comprehensive feature fusion, mitigates redundant information, and preserves essential extracted features within the deep network. Experimental results demonstrate that PSPDR-SR outperforms the state-of-the-art approaches on two benchmark datasets in both quantitative and qualitative evaluations, excelling in structure preservation and detail enhancement across various metrics, including SSIM, MS_SSIM, learned perceptual image patch similarity (LPIPS), deep image structure and texture similarity (DISTS), spatial correlation coefficient (SCC), and spectral angle mapper (SAM).","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"PP ","pages":"20273-20287"},"PeriodicalIF":8.9,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144872954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Leader-Based Multiexpert Neural Network for High-Level Visual Tasks 基于领导者的多专家神经网络用于高级视觉任务

IF 10.4 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE transactions on neural networks and learning systems

Pub Date : 2025-11-27 DOI: 10.1109/tnnls.2025.3631509

Fengyuan Zuo, Jinhai Liu, Zhaolin Chen, Xiangkai Shen, Lei Wang, Zhitao Wen

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

IEEE transactions on neural networks and learning systems

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀