DIR-YOLOv5: a real-time drone-perspective infrared object detection method based on YOLOv5

Yuexing Wang, Chuangang Xu, Jianzhong Su, Jinwen Tian
{"title":"DIR-YOLOv5: a real-time drone-perspective infrared object detection method based on YOLOv5","authors":"Yuexing Wang, Chuangang Xu, Jianzhong Su, Jinwen Tian","doi":"10.1117/12.2692731","DOIUrl":null,"url":null,"abstract":"With the advancement of drone technology, object detection from the perspective of drones has found extensive applications in various fields, including surveillance, search operations, and reconnaissance tasks. Currently, most drones in the market are equipped with visible light imagers, while some high-end drones are equipped with infrared imaging detectors capable of performing infrared object detection tasks. Infrared imaging utilizes a passive imaging mode, enabling it to detect thermal radiation emitted by objects. As a result, it offers the distinct advantage of continuous operation without being restricted by daylight conditions. In comparison to visible imaging, infrared imaging uses longer wavelengths and possesses a certain level of penetration capability through clouds and smoke. Consequently, infrared object detection represents a significant research area within the field of object detection. However, detecting infrared objects, especially small ones, remains challenging due to the complexity of background information, lower resolution compared to visible images, and the lack of shape and texture information in infrared images. In response to these challenges, this study proposes a real-time drone-perspective infrared (IR) object detection method based on the YOLOv5 framework, known as DIR-YOLOv5. To effectively address the challenge of infrared vehicles occupying fewer pixels in the drone’s perspective image and making objects difficult to detect, the coordinate attention (CA) for feature enhancement is introduced. we also introduce a Spatial-Channel dynamic and query-aware sparse attention mechanism (SCBiFormer), which is optimized based on BiFormer. Additionally, we redefine the loss function as the Repulsion Loss function to tackle the problem of infrared vehicle objects gathering and overlapping occlusion in scenarios like parking lots. Furthermore, we expand the ISVD infrared image object detection dataset to include multiple scenarios and conduct experiments using this dataset. The experimental results demonstrate the excellent performance of the proposed method in infrared image object detection tasks, showing improved object detection accuracy and reduced false detection rate compared to current mainstream methods.","PeriodicalId":298662,"journal":{"name":"Applied Optics and Photonics China","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Optics and Photonics China","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2692731","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

With the advancement of drone technology, object detection from the perspective of drones has found extensive applications in various fields, including surveillance, search operations, and reconnaissance tasks. Currently, most drones in the market are equipped with visible light imagers, while some high-end drones are equipped with infrared imaging detectors capable of performing infrared object detection tasks. Infrared imaging utilizes a passive imaging mode, enabling it to detect thermal radiation emitted by objects. As a result, it offers the distinct advantage of continuous operation without being restricted by daylight conditions. In comparison to visible imaging, infrared imaging uses longer wavelengths and possesses a certain level of penetration capability through clouds and smoke. Consequently, infrared object detection represents a significant research area within the field of object detection. However, detecting infrared objects, especially small ones, remains challenging due to the complexity of background information, lower resolution compared to visible images, and the lack of shape and texture information in infrared images. In response to these challenges, this study proposes a real-time drone-perspective infrared (IR) object detection method based on the YOLOv5 framework, known as DIR-YOLOv5. To effectively address the challenge of infrared vehicles occupying fewer pixels in the drone’s perspective image and making objects difficult to detect, the coordinate attention (CA) for feature enhancement is introduced. we also introduce a Spatial-Channel dynamic and query-aware sparse attention mechanism (SCBiFormer), which is optimized based on BiFormer. Additionally, we redefine the loss function as the Repulsion Loss function to tackle the problem of infrared vehicle objects gathering and overlapping occlusion in scenarios like parking lots. Furthermore, we expand the ISVD infrared image object detection dataset to include multiple scenarios and conduct experiments using this dataset. The experimental results demonstrate the excellent performance of the proposed method in infrared image object detection tasks, showing improved object detection accuracy and reduced false detection rate compared to current mainstream methods.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
DIR-YOLOv5:基于 YOLOv5 的实时无人机透视红外物体探测方法
随着无人机技术的发展,从无人机的角度进行物体探测已广泛应用于各个领域,包括监视、搜索行动和侦察任务。目前,市场上大多数无人机都配备了可见光成像仪,而一些高端无人机则配备了红外成像探测器,能够执行红外物体探测任务。红外成像采用被动成像模式,能够探测物体发出的热辐射。因此,它具有不受日光条件限制,可连续工作的显著优势。与可见光成像相比,红外成像使用的波长更长,具有一定的穿透云层和烟雾的能力。因此,红外物体检测是物体检测领域的一个重要研究领域。然而,由于背景信息复杂、分辨率低于可见光图像以及红外图像中缺乏形状和纹理信息,检测红外物体(尤其是小物体)仍然具有挑战性。针对这些挑战,本研究提出了一种基于 YOLOv5 框架的无人机实时透视红外(IR)物体检测方法,即 DIR-YOLOv5。为了有效解决红外飞行器在无人机透视图像中占据较少像素而导致物体难以检测的难题,我们引入了用于特征增强的坐标注意(CA)。我们还引入了一种基于 BiFormer 优化的空间通道动态和查询感知稀疏注意机制(SCBiFormer)。此外,我们将损失函数重新定义为斥力损失函数,以解决停车场等场景中红外车辆物体聚集和重叠遮挡的问题。此外,我们将 ISVD 红外图像物体检测数据集扩展到多个场景,并使用该数据集进行了实验。实验结果证明了所提出的方法在红外图像物体检测任务中的优异性能,与当前主流方法相比,提高了物体检测精度,降低了误检率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Composition disorder in InAs/InAsSb superlattice by STM Optical true time delay technique with bidirectional consistency based on unidirectional optical amplifier Large curvature concave metallic mesh with high optical transmittance and strong electromagnetic interference shielding efficiency DP-OOK to QPSK conversion based on vector phase-sensitive amplification bridging core and access networks Real-time digitized RoF transceiver technology based on FPGA
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1