GACFNet: A global attention cross-level feature fusion network for aerial image object detection

IF 4.9 3区 计算机科学 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Computers & Electrical Engineering Pub Date : 2025-04-01 Epub Date: 2025-01-06 DOI:10.1016/j.compeleceng.2024.110042
Xingzhu Liang , Mengyuan Li , Yu-e Lin , Xianjin Fang
{"title":"GACFNet: A global attention cross-level feature fusion network for aerial image object detection","authors":"Xingzhu Liang ,&nbsp;Mengyuan Li ,&nbsp;Yu-e Lin ,&nbsp;Xianjin Fang","doi":"10.1016/j.compeleceng.2024.110042","DOIUrl":null,"url":null,"abstract":"<div><div>Real-time object detection in aerial images is challenging, primarily due to small and densely packed objects, accompanied by significant scale variations. Previous methods have addressed these issues by employing fusion structures similar to feature pyramid networks. However, these fusion structures overlook the complementary relationship between feature information from non-adjacent layers. To tackle this, we propose a global attention cross-layer feature fusion network (GACFNet). Firstly, we design a global attention cross-layer feature fusion (GACF) module, which obtains global information by fusing features at different scales, using the attention mechanism to highlight foreground information in the global feature map. Additionally, we connect the global attention feature map with other layers to establish correlations between non-adjacent layers. Secondly, a large-kernel separable pooling pyramid fusion (LKSPPF) module is proposed to capture a wider receptive field and enhance context information. Thirdly, to better preserve small object information in low-resolution feature maps, we improve the cross-stage partial fusion module (C2f) of the baseline using a deformable convolution technique (DCNv2). Finally, we design a hybrid regression function (NGIoU loss) to improve object localization and sample allocation in aerial images while accelerating model convergence. Extensive experiments were conducted on three publicly available aerial image datasets. The experimental results show that the method significantly improves the accuracy of object detection in aerial images. The average precision (<span><math><mrow><mi>A</mi><msub><mrow><mi>P</mi></mrow><mrow><mn>50</mn></mrow></msub></mrow></math></span>) of the three datasets reaches 52.7%, 81.8%, and 33.0%, respectively, while a real-time performance of 69.9 frames per second is achieved. The code will be available online <span><span>https://github.com/JSJ515-Group/GACFNet/</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50630,"journal":{"name":"Computers & Electrical Engineering","volume":"123 ","pages":"Article 110042"},"PeriodicalIF":4.9000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Electrical Engineering","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0045790624009674","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/6 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0

Abstract

Real-time object detection in aerial images is challenging, primarily due to small and densely packed objects, accompanied by significant scale variations. Previous methods have addressed these issues by employing fusion structures similar to feature pyramid networks. However, these fusion structures overlook the complementary relationship between feature information from non-adjacent layers. To tackle this, we propose a global attention cross-layer feature fusion network (GACFNet). Firstly, we design a global attention cross-layer feature fusion (GACF) module, which obtains global information by fusing features at different scales, using the attention mechanism to highlight foreground information in the global feature map. Additionally, we connect the global attention feature map with other layers to establish correlations between non-adjacent layers. Secondly, a large-kernel separable pooling pyramid fusion (LKSPPF) module is proposed to capture a wider receptive field and enhance context information. Thirdly, to better preserve small object information in low-resolution feature maps, we improve the cross-stage partial fusion module (C2f) of the baseline using a deformable convolution technique (DCNv2). Finally, we design a hybrid regression function (NGIoU loss) to improve object localization and sample allocation in aerial images while accelerating model convergence. Extensive experiments were conducted on three publicly available aerial image datasets. The experimental results show that the method significantly improves the accuracy of object detection in aerial images. The average precision (AP50) of the three datasets reaches 52.7%, 81.8%, and 33.0%, respectively, while a real-time performance of 69.9 frames per second is achieved. The code will be available online https://github.com/JSJ515-Group/GACFNet/.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
GACFNet:一种用于航空图像目标检测的全局关注跨层特征融合网络
航空图像中的实时目标检测具有挑战性,主要是由于小而密集的物体,伴随着显著的尺度变化。以前的方法通过采用类似于特征金字塔网络的融合结构来解决这些问题。然而,这些融合结构忽略了非相邻层特征信息之间的互补关系。为了解决这个问题,我们提出了一个全局注意力跨层特征融合网络(GACFNet)。首先,设计了全局关注跨层特征融合(GACF)模块,通过融合不同尺度的特征获取全局信息,利用关注机制突出全局特征图中的前景信息;此外,我们将全局注意力特征图与其他层连接起来,以建立非相邻层之间的相关性。其次,提出了一种大核可分离池金字塔融合(LKSPPF)模块,以捕获更广泛的接受场并增强上下文信息。第三,为了更好地保留低分辨率特征图中的小目标信息,我们使用可变形卷积技术(DCNv2)改进了基线的跨阶段部分融合模块(C2f)。最后,我们设计了一个混合回归函数(NGIoU loss)来提高航测图像的目标定位和样本分配,同时加速模型收敛。在三个公开的航空图像数据集上进行了广泛的实验。实验结果表明,该方法显著提高了航拍图像中目标检测的精度。三组数据集的平均精度(AP50)分别达到52.7%、81.8%和33.0%,实时性达到69.9帧/秒。代码将在网上提供https://github.com/JSJ515-Group/GACFNet/。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Computers & Electrical Engineering
Computers & Electrical Engineering 工程技术-工程:电子与电气
CiteScore
9.20
自引率
7.00%
发文量
661
审稿时长
47 days
期刊介绍: The impact of computers has nowhere been more revolutionary than in electrical engineering. The design, analysis, and operation of electrical and electronic systems are now dominated by computers, a transformation that has been motivated by the natural ease of interface between computers and electrical systems, and the promise of spectacular improvements in speed and efficiency. Published since 1973, Computers & Electrical Engineering provides rapid publication of topical research into the integration of computer technology and computational techniques with electrical and electronic systems. The journal publishes papers featuring novel implementations of computers and computational techniques in areas like signal and image processing, high-performance computing, parallel processing, and communications. Special attention will be paid to papers describing innovative architectures, algorithms, and software tools.
期刊最新文献
EVerGen: Optimal path planning for electric vehicle using modified genetic algorithm in internet of vehicular things SRAM PUF-based logic locking framework for IoT authentication and IP protection Neuroauthnet: A brainwave-based authentication framework using BCI and deep learning for privacy-preserving identity verification A novel hybrid gated recurrent ensemble model for stage-based clinical diagnosis of neurological disorders Chaotic atom search optimized fractional-order proportional-integral-derivative control for energy-efficient electric vehicles regenerative braking
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1