Residual-like multi-kernel block and dynamic attention for deep neural networks

IF 8 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Engineering Applications of Artificial Intelligence Pub Date : 2025-03-04 DOI:10.1016/j.engappai.2025.110456

Hanxiang Wang , Yanfen Li , Tan N. Nguyen , L. Minh Dang

{"title":"Residual-like multi-kernel block and dynamic attention for deep neural networks","authors":"Hanxiang Wang , Yanfen Li , Tan N. Nguyen , L. Minh Dang","doi":"10.1016/j.engappai.2025.110456","DOIUrl":null,"url":null,"abstract":"<div><div>Traditional network architectures struggled with a uniform approach to receptive field (RF) sizes, leading to suboptimal performance across scales. Although recent advances have addressed the problem by utilizing different RF sizes, a balance between accuracy and complexity remains elusive. In addition, the existing group attention mechanism that simply uses the squeeze-and-excitation method neglects the spatial position information in the feature selection and fusion process. Therefore, this research introduces a lightweight and efficient architecture named Split-Dense Adaptive Network (SDANet) to cope with these limitations. In the proposed network, a residual-like multi-kernel method is implemented to enable better feature extraction under diverse RF sizes. Next, a new grouped attention module processes features dynamically and highlight the location information. Also, the constructed feature augmentation structure strengthens the model's representation. Furthermore, a new channel split and merge strategy is utilized for computation reduction. Compared with state-of-the-art methods, our model achieved better generalization ability, less computational complexity, and superior precision based on various public datasets. The introduced network shows a promising general applicability in the field of computer vision, and further inspires research on supervised deep learning.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"148 ","pages":"Article 110456"},"PeriodicalIF":8.0000,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625004567","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Traditional network architectures struggled with a uniform approach to receptive field (RF) sizes, leading to suboptimal performance across scales. Although recent advances have addressed the problem by utilizing different RF sizes, a balance between accuracy and complexity remains elusive. In addition, the existing group attention mechanism that simply uses the squeeze-and-excitation method neglects the spatial position information in the feature selection and fusion process. Therefore, this research introduces a lightweight and efficient architecture named Split-Dense Adaptive Network (SDANet) to cope with these limitations. In the proposed network, a residual-like multi-kernel method is implemented to enable better feature extraction under diverse RF sizes. Next, a new grouped attention module processes features dynamically and highlight the location information. Also, the constructed feature augmentation structure strengthens the model's representation. Furthermore, a new channel split and merge strategy is utilized for computation reduction. Compared with state-of-the-art methods, our model achieved better generalization ability, less computational complexity, and superior precision based on various public datasets. The introduced network shows a promising general applicability in the field of computer vision, and further inspires research on supervised deep learning.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

深度神经网络的类残差多核块与动态关注

传统的网络架构难以采用统一的接受场（RF）大小方法，导致跨规模的性能不理想。尽管最近的进展已经通过使用不同的射频尺寸解决了这个问题，但准确性和复杂性之间的平衡仍然难以捉摸。此外，现有的群体注意机制在特征选择和融合过程中，单纯采用挤压激励方法，忽略了空间位置信息。因此，本研究引入了一种轻量级和高效的架构，称为分裂密集自适应网络（SDANet）来应对这些限制。在所提出的网络中，实现了一种类残差多核方法，以便在不同RF尺寸下更好地提取特征。接下来，一个新的分组注意模块动态处理特征并突出显示位置信息。同时，构造的特征增强结构增强了模型的表征性。此外，为了减少计算量，采用了一种新的信道分割合并策略。与现有方法相比，该模型具有更好的泛化能力、更低的计算复杂度和更高的精度。所引入的网络在计算机视觉领域显示出很好的普遍适用性，并进一步激发了监督深度学习的研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Engineering Applications of Artificial Intelligence 工程技术-工程：电子与电气

CiteScore

9.60

自引率

10.00%

发文量

505

审稿时长

68 days

期刊介绍： Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.