DS-YOLO：基于倒置瓶颈和多尺度融合网络的密集小目标检测算法

Biomimetic Intelligence and Robotics Pub Date : 2024-10-26 DOI:10.1016/j.birob.2024.100190

Hongyu Zhang , Guoliang Li , Dapeng Wan , Ziyue Wang , Jinshun Dong , Shoujun Lin , Lixia Deng , Haiying Liu

{"title":"DS-YOLO：基于倒置瓶颈和多尺度融合网络的密集小目标检测算法","authors":"Hongyu Zhang , Guoliang Li , Dapeng Wan , Ziyue Wang , Jinshun Dong , Shoujun Lin , Lixia Deng , Haiying Liu","doi":"10.1016/j.birob.2024.100190","DOIUrl":null,"url":null,"abstract":"<div><div>In the field of security, intelligent surveillance tasks often involve a large number of dense and small objects, with severe occlusion between them, making detection particularly challenging. To address this significant challenge, Dense and Small YOLO (DS-YOLO), a dense small object detection algorithm based on YOLOv8s, is proposed in this paper. Firstly, to enhance the dense small objects’ feature extraction capability of backbone network, the paper proposes a lightweight backbone. The improved C2fUIB is employed to create a lightweight model and expand the receptive field, enabling the capture of richer contextual information and reducing the impact of occlusion on detection accuracy. Secondly, to enhance the feature fusion capability of model, a multi-scale feature fusion network, Light-weight Full Scale PAFPN (LFS-PAFPN), combined with the DO-C2f module, is introduced. The new module successfully reduces the miss rate of dense small objects while ensuring the accuracy of detecting large objects. Finally, to minimize feature loss of dense objects during network transmission, a dynamic upsampling module, DySample, is implemented. DS-YOLO was trained and tested on the CrowdHuman and VisDrone2019 datasets, which contain a large number of densely populated pedestrians, vehicles and other objects. Experimental evaluations demonstrated that DS-YOLO has advantages in dense small object detection tasks. Compared with YOLOv8s, the Recall and [email protected] are increased by 4.9% and 4.2% on CrowdHuman dataset, 4.6% and 5% on VisDrone2019, respectively. Simultaneously, DS-YOLO does not introduce a substantial amount of computing overhead, maintaining low hardware requirements.</div></div>","PeriodicalId":100184,"journal":{"name":"Biomimetic Intelligence and Robotics","volume":"4 4","pages":"Article 100190"},"PeriodicalIF":0.0000,"publicationDate":"2024-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"DS-YOLO: A dense small object detection algorithm based on inverted bottleneck and multi-scale fusion network\",\"authors\":\"Hongyu Zhang , Guoliang Li , Dapeng Wan , Ziyue Wang , Jinshun Dong , Shoujun Lin , Lixia Deng , Haiying Liu\",\"doi\":\"10.1016/j.birob.2024.100190\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In the field of security, intelligent surveillance tasks often involve a large number of dense and small objects, with severe occlusion between them, making detection particularly challenging. To address this significant challenge, Dense and Small YOLO (DS-YOLO), a dense small object detection algorithm based on YOLOv8s, is proposed in this paper. Firstly, to enhance the dense small objects’ feature extraction capability of backbone network, the paper proposes a lightweight backbone. The improved C2fUIB is employed to create a lightweight model and expand the receptive field, enabling the capture of richer contextual information and reducing the impact of occlusion on detection accuracy. Secondly, to enhance the feature fusion capability of model, a multi-scale feature fusion network, Light-weight Full Scale PAFPN (LFS-PAFPN), combined with the DO-C2f module, is introduced. The new module successfully reduces the miss rate of dense small objects while ensuring the accuracy of detecting large objects. Finally, to minimize feature loss of dense objects during network transmission, a dynamic upsampling module, DySample, is implemented. DS-YOLO was trained and tested on the CrowdHuman and VisDrone2019 datasets, which contain a large number of densely populated pedestrians, vehicles and other objects. Experimental evaluations demonstrated that DS-YOLO has advantages in dense small object detection tasks. Compared with YOLOv8s, the Recall and [email protected] are increased by 4.9% and 4.2% on CrowdHuman dataset, 4.6% and 5% on VisDrone2019, respectively. Simultaneously, DS-YOLO does not introduce a substantial amount of computing overhead, maintaining low hardware requirements.</div></div>\",\"PeriodicalId\":100184,\"journal\":{\"name\":\"Biomimetic Intelligence and Robotics\",\"volume\":\"4 4\",\"pages\":\"Article 100190\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-10-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biomimetic Intelligence and Robotics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2667379724000482\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomimetic Intelligence and Robotics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667379724000482","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在安防领域，智能监控任务往往涉及大量密集的小型物体，而且这些物体之间存在严重的遮挡，因此检测工作尤其具有挑战性。针对这一重大挑战，本文提出了一种基于 YOLOv8s 的密集小物体检测算法--密集小物体 YOLO（Dense and Small YOLO，DS-YOLO）。首先，为了增强骨干网的密集小目标特征提取能力，本文提出了一种轻量级骨干网。采用改进的 C2fUIB 创建轻量级模型并扩大感受野，从而能够捕获更丰富的上下文信息，降低遮挡对检测精度的影响。其次，为增强模型的特征融合能力，引入了多尺度特征融合网络--轻量级全尺度 PAFPN（LFS-PAFPN），并与 DO-C2f 模块相结合。新模块成功降低了高密度小物体的漏检率，同时保证了大物体的检测精度。最后，为了最大限度地减少密集物体在网络传输过程中的特征损失，还实施了动态上采样模块 DySample。DS-YOLO 在 CrowdHuman 和 VisDrone2019 数据集上进行了训练和测试，这两个数据集包含大量密集的行人、车辆和其他物体。实验评估表明，DS-YOLO 在密集小物体检测任务中具有优势。与 YOLOv8s 相比，DS-YOLO 在 CrowdHuman 数据集上的 Recall 和 [email protected] 分别提高了 4.9% 和 4.2%，在 VisDrone2019 上分别提高了 4.6% 和 5%。同时，DS-YOLO 没有引入大量计算开销，保持了较低的硬件要求。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

DS-YOLO: A dense small object detection algorithm based on inverted bottleneck and multi-scale fusion network

In the field of security, intelligent surveillance tasks often involve a large number of dense and small objects, with severe occlusion between them, making detection particularly challenging. To address this significant challenge, Dense and Small YOLO (DS-YOLO), a dense small object detection algorithm based on YOLOv8s, is proposed in this paper. Firstly, to enhance the dense small objects’ feature extraction capability of backbone network, the paper proposes a lightweight backbone. The improved C2fUIB is employed to create a lightweight model and expand the receptive field, enabling the capture of richer contextual information and reducing the impact of occlusion on detection accuracy. Secondly, to enhance the feature fusion capability of model, a multi-scale feature fusion network, Light-weight Full Scale PAFPN (LFS-PAFPN), combined with the DO-C2f module, is introduced. The new module successfully reduces the miss rate of dense small objects while ensuring the accuracy of detecting large objects. Finally, to minimize feature loss of dense objects during network transmission, a dynamic upsampling module, DySample, is implemented. DS-YOLO was trained and tested on the CrowdHuman and VisDrone2019 datasets, which contain a large number of densely populated pedestrians, vehicles and other objects. Experimental evaluations demonstrated that DS-YOLO has advantages in dense small object detection tasks. Compared with YOLOv8s, the Recall and [email protected] are increased by 4.9% and 4.2% on CrowdHuman dataset, 4.6% and 5% on VisDrone2019, respectively. Simultaneously, DS-YOLO does not introduce a substantial amount of computing overhead, maintaining low hardware requirements.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Biomimetic Intelligence and Robotics

CiteScore

1.80

自引率

0.00%

发文量