General Optimization Methods for YOLO Series Object Detection in Remote Sensing Images

IF 3.2 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Signal Processing Letters Pub Date : 2024-09-27 DOI:10.1109/LSP.2024.3469787
Guozheng Nan;Yue Zhao;Chengxing Lin;Qiaolin Ye
{"title":"General Optimization Methods for YOLO Series Object Detection in Remote Sensing Images","authors":"Guozheng Nan;Yue Zhao;Chengxing Lin;Qiaolin Ye","doi":"10.1109/LSP.2024.3469787","DOIUrl":null,"url":null,"abstract":"The You Only Look Once (YOLO) series of object detection algorithms has attracted considerable attention for its notable advantages in speed and accuracy, resulting in widespread applications in various real-world scenarios. However, achieving outstanding accuracy on remote sensing images with densely arranged small targets and complex backgrounds remains a challenging task. To address this issue, this letter proposes two easily integrated modules suitable for the YOLO architecture, namely global semantic information extraction (GSIE) and adaptive feature fusion (AFF). The GSIE module is designed to overcome the limitation of local information in traditional methods and facilitate global semantic information interaction by introducing multi-angle feature rotation to extend the receptive field. The AFF module effectively captures fine-grained features of objects by dynamically adjusting fusion weights, thereby reducing the loss of deep semantic information during feature transfer and fusion. The experimental results on the VEDAI and LEVIR remote sensing datasets demonstrate that when embedding these two modules into YOLO series algorithms that only use the small-scale detector, there is a significant improvement in performance while reducing computational complexity.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":null,"pages":null},"PeriodicalIF":3.2000,"publicationDate":"2024-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Signal Processing Letters","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10697274/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

The You Only Look Once (YOLO) series of object detection algorithms has attracted considerable attention for its notable advantages in speed and accuracy, resulting in widespread applications in various real-world scenarios. However, achieving outstanding accuracy on remote sensing images with densely arranged small targets and complex backgrounds remains a challenging task. To address this issue, this letter proposes two easily integrated modules suitable for the YOLO architecture, namely global semantic information extraction (GSIE) and adaptive feature fusion (AFF). The GSIE module is designed to overcome the limitation of local information in traditional methods and facilitate global semantic information interaction by introducing multi-angle feature rotation to extend the receptive field. The AFF module effectively captures fine-grained features of objects by dynamically adjusting fusion weights, thereby reducing the loss of deep semantic information during feature transfer and fusion. The experimental results on the VEDAI and LEVIR remote sensing datasets demonstrate that when embedding these two modules into YOLO series algorithms that only use the small-scale detector, there is a significant improvement in performance while reducing computational complexity.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
遥感图像中 YOLO 系列物体检测的一般优化方法
只看一次(YOLO)系列物体检测算法因其在速度和精度方面的显著优势而备受关注,并在现实世界的各种场景中得到了广泛应用。然而,在具有密集排列的小目标和复杂背景的遥感图像上实现出色的精度仍然是一项具有挑战性的任务。针对这一问题,本文提出了适合 YOLO 架构的两个易于集成的模块,即全局语义信息提取(GSIE)和自适应特征融合(AFF)。GSIE 模块旨在克服传统方法中局部信息的局限性,通过引入多角度特征旋转来扩展感受野,从而促进全局语义信息的交互。AFF 模块通过动态调整融合权重,有效捕捉物体的细粒度特征,从而减少特征转移和融合过程中深层语义信息的损失。在 VEDAI 和 LEVIR 遥感数据集上的实验结果表明,将这两个模块嵌入到只使用小尺度探测器的 YOLO 系列算法中,可以显著提高性能,同时降低计算复杂度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Signal Processing Letters
IEEE Signal Processing Letters 工程技术-工程:电子与电气
CiteScore
7.40
自引率
12.80%
发文量
339
审稿时长
2.8 months
期刊介绍: The IEEE Signal Processing Letters is a monthly, archival publication designed to provide rapid dissemination of original, cutting-edge ideas and timely, significant contributions in signal, image, speech, language and audio processing. Papers published in the Letters can be presented within one year of their appearance in signal processing conferences such as ICASSP, GlobalSIP and ICIP, and also in several workshop organized by the Signal Processing Society.
期刊最新文献
KFA: Keyword Feature Augmentation for Open Set Keyword Spotting RFI-Aware and Low-Cost Maximum Likelihood Imaging for High-Sensitivity Radio Telescopes Audio Mamba: Bidirectional State Space Model for Audio Representation Learning System-Informed Neural Network for Frequency Detection Order Estimation of Linear-Phase FIR Filters for DAC Equalization in Multiple Nyquist Bands
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1