YOLO-Faster: An efficient remote sensing object detection method based on AMFFN.

IF 2.6 4区 综合性期刊 Q2 MULTIDISCIPLINARY SCIENCES Science Progress Pub Date : 2024-10-01 DOI:10.1177/00368504241280765
Yicheng Tong, Guan Yue, Longfei Fan, Guosen Lyu, Deya Zhu, Yan Liu, Boyuan Meng, Shu Liu, Xiaokai Mu, Congling Tian
{"title":"YOLO-Faster: An efficient remote sensing object detection method based on AMFFN.","authors":"Yicheng Tong, Guan Yue, Longfei Fan, Guosen Lyu, Deya Zhu, Yan Liu, Boyuan Meng, Shu Liu, Xiaokai Mu, Congling Tian","doi":"10.1177/00368504241280765","DOIUrl":null,"url":null,"abstract":"<p><p>As a pivotal task within computer vision, object detection finds application across a diverse spectrum of industrial scenarios. The advent of deep learning technologies has significantly elevated the accuracy of object detectors designed for general-purpose applications. Nevertheless, in contrast to conventional terrestrial environments, remote sensing object detection scenarios pose formidable challenges, including intricate and diverse backgrounds, fluctuating object scales, and pronounced interference from background noise, rendering remote sensing object detection an enduringly demanding task. In addition, despite the superior detection performance of deep learning-based object detection networks compared to traditional counterparts, their substantial parameter and computational demands curtail their feasibility for deployment on mobile devices equipped with low-power processors. In response to the aforementioned challenges, this paper introduces an enhanced lightweight remote sensing object detection network, denoted as YOLO-Faster, built upon the foundation of YOLOv5. Firstly, the lightweight design and inference speed of the object detection network is augmented by incorporating the lightweight network as the foundational network within YOLOv5, satisfying the demand for real-time detection on mobile devices. Moreover, to tackle the issue of detecting objects of different scales in large and complex backgrounds, an adaptive multiscale feature fusion network is introduced, which dynamically adjusts the large receptive field to capture dependencies among objects of different scales, enabling better modeling of object detection scenarios in remote sensing scenes. At last, the robustness of the object detection network under background noise is enhanced through incorporating a decoupled detection head that separates the classification and regression processes of the detection network. The results obtained from the public remote sensing object detection dataset DOTA show that the proposed method has a mean average precision of 71.4% and a detection speed of 38 frames per second.</p>","PeriodicalId":56061,"journal":{"name":"Science Progress","volume":"107 4","pages":"368504241280765"},"PeriodicalIF":2.6000,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11475215/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Science Progress","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1177/00368504241280765","RegionNum":4,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

As a pivotal task within computer vision, object detection finds application across a diverse spectrum of industrial scenarios. The advent of deep learning technologies has significantly elevated the accuracy of object detectors designed for general-purpose applications. Nevertheless, in contrast to conventional terrestrial environments, remote sensing object detection scenarios pose formidable challenges, including intricate and diverse backgrounds, fluctuating object scales, and pronounced interference from background noise, rendering remote sensing object detection an enduringly demanding task. In addition, despite the superior detection performance of deep learning-based object detection networks compared to traditional counterparts, their substantial parameter and computational demands curtail their feasibility for deployment on mobile devices equipped with low-power processors. In response to the aforementioned challenges, this paper introduces an enhanced lightweight remote sensing object detection network, denoted as YOLO-Faster, built upon the foundation of YOLOv5. Firstly, the lightweight design and inference speed of the object detection network is augmented by incorporating the lightweight network as the foundational network within YOLOv5, satisfying the demand for real-time detection on mobile devices. Moreover, to tackle the issue of detecting objects of different scales in large and complex backgrounds, an adaptive multiscale feature fusion network is introduced, which dynamically adjusts the large receptive field to capture dependencies among objects of different scales, enabling better modeling of object detection scenarios in remote sensing scenes. At last, the robustness of the object detection network under background noise is enhanced through incorporating a decoupled detection head that separates the classification and regression processes of the detection network. The results obtained from the public remote sensing object detection dataset DOTA show that the proposed method has a mean average precision of 71.4% and a detection speed of 38 frames per second.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
YOLO-Faster:基于 AMFFN 的高效遥感物体检测方法。
作为计算机视觉领域的一项重要任务,物体检测被广泛应用于各种工业场景。深度学习技术的出现大大提高了为通用应用设计的物体检测器的精度。然而,与传统的地面环境相比,遥感物体检测场景面临着巨大的挑战,包括复杂多样的背景、波动的物体尺度以及明显的背景噪声干扰,这使得遥感物体检测成为一项长期艰巨的任务。此外,尽管基于深度学习的物体检测网络与传统网络相比具有更优越的检测性能,但其对参数和计算的大量需求限制了其在配备低功耗处理器的移动设备上部署的可行性。针对上述挑战,本文在 YOLOv5 的基础上推出了一种增强型轻量级遥感物体检测网络,简称为 YOLO-Faster。首先,通过将轻量级网络作为 YOLOv5 的基础网络,增强了物体检测网络的轻量级设计和推理速度,满足了移动设备实时检测的需求。此外,针对大型复杂背景中不同尺度物体的检测问题,引入了自适应多尺度特征融合网络,动态调整大感受野以捕捉不同尺度物体之间的依赖关系,从而更好地模拟遥感场景中的物体检测场景。最后,通过将检测网络的分类和回归过程分离的解耦检测头,增强了物体检测网络在背景噪声下的鲁棒性。从公共遥感物体检测数据集 DOTA 获得的结果表明,所提出的方法的平均精度为 71.4%,检测速度为每秒 38 帧。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Science Progress
Science Progress Multidisciplinary-Multidisciplinary
CiteScore
3.80
自引率
0.00%
发文量
119
期刊介绍: Science Progress has for over 100 years been a highly regarded review publication in science, technology and medicine. Its objective is to excite the readers'' interest in areas with which they may not be fully familiar but which could facilitate their interest, or even activity, in a cognate field.
期刊最新文献
A voltage mode grounded capacitance multiplier with widely tunable gain for ultra-low cutoff frequency filter. Appropriate dose of tranexamic acid in the topical treatment of anterior epistaxis, 500 mg vs 1000 mg: A double-blind randomized controlled trial. Research status and prospect of flexible optimization design methodology of propeller CNC polishing machines. Sliding mode control with self-adaptive parameters of a 5-DOF hybrid robot. Spoofing attack recognition for GNSS-based train positioning using a BO-LightGBM method.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1