A Lightweight Fusion Strategy With Enhanced Interlayer Feature Correlation for Small Object Detection

IF 7.5 1区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Geoscience and Remote Sensing Pub Date : 2024-09-10 DOI:10.1109/TGRS.2024.3457155
Yao Xiao;Tingfa Xu;Xin Yu;Yuqiang Fang;Jianan Li
{"title":"A Lightweight Fusion Strategy With Enhanced Interlayer Feature Correlation for Small Object Detection","authors":"Yao Xiao;Tingfa Xu;Xin Yu;Yuqiang Fang;Jianan Li","doi":"10.1109/TGRS.2024.3457155","DOIUrl":null,"url":null,"abstract":"Detecting small objects in drone imagery is challenging due to low resolution and background blending, leading to limited feature information. Multiscale feature fusion can enhance detection by capturing information at different scales, but traditional strategies fall short. Simple concatenation or addition operations do not fully utilize multiscale fusion advantages, resulting in insufficient correlation between features. This inadequacy hinders the detection of small objects, especially in complex backgrounds and densely populated areas. To address this issue and efficiently utilize the limited computational resources, we propose a lightweight fusion strategy based on enhanced interlayer feature correlation (EFC) to replace the traditional feature fusion strategy in feature pyramid network (FPN). The semantic expressions of different layers in the feature pyramid are inconsistent. In EFC, the grouped feature focus unit (GFF) enhances the feature correlation of each layer by focusing on the contextual information of different features. The multilevel feature reconstruction module (MFR) effectively reconstructs and transforms the strength and weakness information of each layer in the pyramid to reduce redundant feature fusion and retain more information about small targets in deep networks. It is noteworthy that the proposed method is plug-and-play and can be widely applied to various base networks. Extensive experiments and comprehensive evaluations on VisDrone, unmanned aerial vehicle benchmark object detection and tracking (UAVDT), and microsoft common objects in context (COCO) demonstrate the effectiveness. Using generalized focal loss (GFL) as the baseline on the VisDrone dataset with a large number of small targets, the proposed method improves the detection mean average precision (mAP) by 1.7%, surpassing many lightweight state-of-the-art methods and significantly reducing the Params and GFLOPs at the neck end. The code will be available at \n<uri>https://github.com/nuliweixiao/EFC.git</uri>\n.","PeriodicalId":13213,"journal":{"name":"IEEE Transactions on Geoscience and Remote Sensing","volume":null,"pages":null},"PeriodicalIF":7.5000,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Geoscience and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10671587/","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Detecting small objects in drone imagery is challenging due to low resolution and background blending, leading to limited feature information. Multiscale feature fusion can enhance detection by capturing information at different scales, but traditional strategies fall short. Simple concatenation or addition operations do not fully utilize multiscale fusion advantages, resulting in insufficient correlation between features. This inadequacy hinders the detection of small objects, especially in complex backgrounds and densely populated areas. To address this issue and efficiently utilize the limited computational resources, we propose a lightweight fusion strategy based on enhanced interlayer feature correlation (EFC) to replace the traditional feature fusion strategy in feature pyramid network (FPN). The semantic expressions of different layers in the feature pyramid are inconsistent. In EFC, the grouped feature focus unit (GFF) enhances the feature correlation of each layer by focusing on the contextual information of different features. The multilevel feature reconstruction module (MFR) effectively reconstructs and transforms the strength and weakness information of each layer in the pyramid to reduce redundant feature fusion and retain more information about small targets in deep networks. It is noteworthy that the proposed method is plug-and-play and can be widely applied to various base networks. Extensive experiments and comprehensive evaluations on VisDrone, unmanned aerial vehicle benchmark object detection and tracking (UAVDT), and microsoft common objects in context (COCO) demonstrate the effectiveness. Using generalized focal loss (GFL) as the baseline on the VisDrone dataset with a large number of small targets, the proposed method improves the detection mean average precision (mAP) by 1.7%, surpassing many lightweight state-of-the-art methods and significantly reducing the Params and GFLOPs at the neck end. The code will be available at https://github.com/nuliweixiao/EFC.git .
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
增强层间特征相关性的轻量级小物体检测融合策略
由于分辨率低和背景混杂,导致特征信息有限,因此在无人机图像中检测小型物体具有挑战性。多尺度特征融合可通过捕捉不同尺度的信息来增强检测能力,但传统策略存在不足。简单的连接或加法操作不能充分利用多尺度融合的优势,导致特征之间的相关性不足。这种不足阻碍了对小型物体的检测,尤其是在复杂背景和人口稠密地区。为解决这一问题并有效利用有限的计算资源,我们提出了一种基于增强层间特征相关性(EFC)的轻量级融合策略,以取代特征金字塔网络(FPN)中的传统特征融合策略。特征金字塔中不同层的语义表达是不一致的。在 EFC 中,分组特征聚焦单元(GFF)通过聚焦不同特征的上下文信息来增强各层的特征相关性。多层次特征重构模块(MFR)可有效重构和转换金字塔中各层的强弱信息,从而减少冗余特征融合,保留更多深度网络中的小目标信息。值得一提的是,所提出的方法即插即用,可广泛应用于各种基础网络。在 VisDrone、无人机基准目标检测与跟踪(UAVDT)和微软上下文中的常见物体(COCO)上进行的大量实验和综合评估证明了该方法的有效性。在具有大量小型目标的 VisDrone 数据集上,以广义焦点损失(GFL)为基准,所提出的方法将检测平均精度(mAP)提高了 1.7%,超过了许多轻量级的先进方法,并显著降低了颈端的 Params 和 GFLOPs。代码将发布在 https://github.com/nuliweixiao/EFC.git 网站上。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Geoscience and Remote Sensing
IEEE Transactions on Geoscience and Remote Sensing 工程技术-地球化学与地球物理
CiteScore
11.50
自引率
28.00%
发文量
1912
审稿时长
4.0 months
期刊介绍: IEEE Transactions on Geoscience and Remote Sensing (TGRS) is a monthly publication that focuses on the theory, concepts, and techniques of science and engineering as applied to sensing the land, oceans, atmosphere, and space; and the processing, interpretation, and dissemination of this information.
期刊最新文献
Remote sensing images destriping via nonconvex regularization and fast regional decomposition An Enhanced Global Feature-Guided Network Based on Multiple Filtering Noise Reduction for Remote Sensing Image Compression SHBGAN: Hybrid Bilateral Attention GAN for Seismic Image Super-Resolution Reconstruction FDBANet: A Fusion Frequency Domain Denoising and Multi-Scale Boundary Attention Network for Sonar Image Semantic Segmentation Remote-Sensing Image Scene Classification via Graph Template Enhancement and Supplementation Network with Dual-Teacher Knowledge Distillation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1