Lightweight Low-Altitude UAV Object Detection Based on Improved YOLOv5s

International Journal of Advanced Network, Monitoring and Controls Pub Date : 2024-01-01 DOI:10.2478/ijanmc-2024-0009

Haokai Zeng, Jing Li, Liping Qu

{"title":"Lightweight Low-Altitude UAV Object Detection Based on Improved YOLOv5s","authors":"Haokai Zeng, Jing Li, Liping Qu","doi":"10.2478/ijanmc-2024-0009","DOIUrl":null,"url":null,"abstract":"\n In the context of rapid developments in drone technology, the significance of recognizing and detecting low-altitude unmanned aerial vehicles (UAVs) has grown. Although conventional algorithmic enhancements have increased the detection rate of low-altitude UAV targets, they tend to neglect the intricate nature and computational demands of the algorithms. This paper introduces ATD-YOLO, an enhanced target detection model based on the YOLOv5s architecture, aimed at tackling this issue. Firstly, a realistic low-altitude UAV dataset is fashioned by amalgamating various publicly available datasets. Secondly, a C3F module grounded in FasterNet, incorporating Partial Convolution (PConv), is introduced to decrease model parameters while upholding detection accuracy. Furthermore, the backbone network incorporates an Efficient Multi-Scale Attention (EMA) module to extract essential image information while filtering out irrelevant details, facilitating adaptive feature fusion. Additionally, the universal upsampling operator CARAFE (Content-aware reassembly of features) is utilized instead of nearest-neighbor upsampling. This enhancement boosts the performance of the feature pyramid network by expanding the receptive field for data feature fusion. Lastly, the Slim-Neck network is introduced to fine-tune the feature fusion network, thereby reducing the model’s floating-point calculations and parameters. Experimental findings demonstrate that the improved ATD-YOLO model achieves an accuracy of 92.8%, with a 31.4% decrease in parameters and a 28.7% decrease in floating-point calculations compared to the original model. The detection speed reaches 75.37 frames per second (FPS). These experiments affirm that the proposed enhancement method meets the deployment requirements for low computational power while maintaining high precision.","PeriodicalId":193299,"journal":{"name":"International Journal of Advanced Network, Monitoring and Controls","volume":"15 4","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Advanced Network, Monitoring and Controls","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2478/ijanmc-2024-0009","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In the context of rapid developments in drone technology, the significance of recognizing and detecting low-altitude unmanned aerial vehicles (UAVs) has grown. Although conventional algorithmic enhancements have increased the detection rate of low-altitude UAV targets, they tend to neglect the intricate nature and computational demands of the algorithms. This paper introduces ATD-YOLO, an enhanced target detection model based on the YOLOv5s architecture, aimed at tackling this issue. Firstly, a realistic low-altitude UAV dataset is fashioned by amalgamating various publicly available datasets. Secondly, a C3F module grounded in FasterNet, incorporating Partial Convolution (PConv), is introduced to decrease model parameters while upholding detection accuracy. Furthermore, the backbone network incorporates an Efficient Multi-Scale Attention (EMA) module to extract essential image information while filtering out irrelevant details, facilitating adaptive feature fusion. Additionally, the universal upsampling operator CARAFE (Content-aware reassembly of features) is utilized instead of nearest-neighbor upsampling. This enhancement boosts the performance of the feature pyramid network by expanding the receptive field for data feature fusion. Lastly, the Slim-Neck network is introduced to fine-tune the feature fusion network, thereby reducing the model’s floating-point calculations and parameters. Experimental findings demonstrate that the improved ATD-YOLO model achieves an accuracy of 92.8%, with a 31.4% decrease in parameters and a 28.7% decrease in floating-point calculations compared to the original model. The detection speed reaches 75.37 frames per second (FPS). These experiments affirm that the proposed enhancement method meets the deployment requirements for low computational power while maintaining high precision.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于改进型 YOLOv5s 的轻量级低空无人机物体探测技术

在无人机技术飞速发展的背景下，识别和探测低空无人飞行器（UAV）的重要性与日俱增。虽然传统的增强算法提高了低空无人飞行器目标的探测率，但它们往往忽视了算法的复杂性和计算需求。本文介绍了基于 YOLOv5s 架构的增强型目标检测模型 ATD-YOLO，旨在解决这一问题。首先，通过合并各种公开可用的数据集，建立了一个逼真的低空无人机数据集。其次，在 FasterNet 中引入 C3F 模块，结合部分卷积（PConv），以减少模型参数，同时保持检测精度。此外，骨干网络还采用了高效多尺度注意力（EMA）模块，以提取重要的图像信息，同时过滤掉无关细节，促进自适应特征融合。此外，还使用了通用上采样算子 CARAFE（内容感知特征重组）来替代近邻上采样。这一改进扩大了数据特征融合的感受野，从而提高了特征金字塔网络的性能。最后，还引入了 Slim-Neck 网络来微调特征融合网络，从而减少了模型的浮点运算和参数。实验结果表明，改进后的 ATD-YOLO 模型准确率达到 92.8%，与原始模型相比，参数减少了 31.4%，浮点运算减少了 28.7%。检测速度达到每秒 75.37 帧（FPS）。这些实验证实，所提出的增强方法既能满足低计算功率的部署要求，又能保持高精度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

International Journal of Advanced Network, Monitoring and Controls

自引率

0.00%

发文量