Residual-like multi-kernel block and dynamic attention for deep neural networks

IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Engineering Applications of Artificial Intelligence Pub Date : 2025-03-04 DOI:10.1016/j.engappai.2025.110456
Hanxiang Wang , Yanfen Li , Tan N. Nguyen , L. Minh Dang
{"title":"Residual-like multi-kernel block and dynamic attention for deep neural networks","authors":"Hanxiang Wang ,&nbsp;Yanfen Li ,&nbsp;Tan N. Nguyen ,&nbsp;L. Minh Dang","doi":"10.1016/j.engappai.2025.110456","DOIUrl":null,"url":null,"abstract":"<div><div>Traditional network architectures struggled with a uniform approach to receptive field (RF) sizes, leading to suboptimal performance across scales. Although recent advances have addressed the problem by utilizing different RF sizes, a balance between accuracy and complexity remains elusive. In addition, the existing group attention mechanism that simply uses the squeeze-and-excitation method neglects the spatial position information in the feature selection and fusion process. Therefore, this research introduces a lightweight and efficient architecture named Split-Dense Adaptive Network (SDANet) to cope with these limitations. In the proposed network, a residual-like multi-kernel method is implemented to enable better feature extraction under diverse RF sizes. Next, a new grouped attention module processes features dynamically and highlight the location information. Also, the constructed feature augmentation structure strengthens the model's representation. Furthermore, a new channel split and merge strategy is utilized for computation reduction. Compared with state-of-the-art methods, our model achieved better generalization ability, less computational complexity, and superior precision based on various public datasets. The introduced network shows a promising general applicability in the field of computer vision, and further inspires research on supervised deep learning.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"148 ","pages":"Article 110456"},"PeriodicalIF":8.0000,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625004567","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Traditional network architectures struggled with a uniform approach to receptive field (RF) sizes, leading to suboptimal performance across scales. Although recent advances have addressed the problem by utilizing different RF sizes, a balance between accuracy and complexity remains elusive. In addition, the existing group attention mechanism that simply uses the squeeze-and-excitation method neglects the spatial position information in the feature selection and fusion process. Therefore, this research introduces a lightweight and efficient architecture named Split-Dense Adaptive Network (SDANet) to cope with these limitations. In the proposed network, a residual-like multi-kernel method is implemented to enable better feature extraction under diverse RF sizes. Next, a new grouped attention module processes features dynamically and highlight the location information. Also, the constructed feature augmentation structure strengthens the model's representation. Furthermore, a new channel split and merge strategy is utilized for computation reduction. Compared with state-of-the-art methods, our model achieved better generalization ability, less computational complexity, and superior precision based on various public datasets. The introduced network shows a promising general applicability in the field of computer vision, and further inspires research on supervised deep learning.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
深度神经网络的类残差多核块与动态关注
传统的网络架构难以采用统一的接受场(RF)大小方法,导致跨规模的性能不理想。尽管最近的进展已经通过使用不同的射频尺寸解决了这个问题,但准确性和复杂性之间的平衡仍然难以捉摸。此外,现有的群体注意机制在特征选择和融合过程中,单纯采用挤压激励方法,忽略了空间位置信息。因此,本研究引入了一种轻量级和高效的架构,称为分裂密集自适应网络(SDANet)来应对这些限制。在所提出的网络中,实现了一种类残差多核方法,以便在不同RF尺寸下更好地提取特征。接下来,一个新的分组注意模块动态处理特征并突出显示位置信息。同时,构造的特征增强结构增强了模型的表征性。此外,为了减少计算量,采用了一种新的信道分割合并策略。与现有方法相比,该模型具有更好的泛化能力、更低的计算复杂度和更高的精度。所引入的网络在计算机视觉领域显示出很好的普遍适用性,并进一步激发了监督深度学习的研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Engineering Applications of Artificial Intelligence
Engineering Applications of Artificial Intelligence 工程技术-工程:电子与电气
CiteScore
9.60
自引率
10.00%
发文量
505
审稿时长
68 days
期刊介绍: Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.
期刊最新文献
Multiphysics response and internal leakage prediction of seismic hydraulic systems considering structural clearance effects Machine learning-based prediction of ductility of strain-hardening fiber-reinforced cementitious composites Neighborhood constrained attention for lightweight image super-resolution A quantum group decision-making model for patient-capital project selection integrating cumulative prospect theory under linear Diophantine fuzzy uncertainty Forecast-enhanced bilevel real-time pricing for microgrids via hybrid-action reinforcement learning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1