A Large-scale Detection Algorithm and Application Based on YOLOv4

2021 IEEE 24th International Conference on Computational Science and Engineering (CSE) Pub Date : 2021-10-01 DOI:10.1109/CSE53436.2021.00026

Xiangbin Shi, Jinwen Peng

{"title":"A Large-scale Detection Algorithm and Application Based on YOLOv4","authors":"Xiangbin Shi, Jinwen Peng","doi":"10.1109/CSE53436.2021.00026","DOIUrl":null,"url":null,"abstract":"In this paper, we addressed the problems of occlusion and crowding in large-scale object detection. First, large-scale detection is more complex and diverse than traditional object detection. The number of targets to be detected is larger and often clustered together. This will produce occlusion and dense detection problems, which brings a serious challenge to object detection. Secondly, current dominant object detection is rarely trained and inferred on large-scale labeled dataset, so it is unable to evaluate the performance of these detection models on large dataset. To solve the above problems, we propose L-YOLO large-scale object detection algorithm. We modified the structure of feature pyramid network, then the receptive field was increased by using four-scale detection. Next, we propose a new loss function designed specifically for large-scale scenarios, which keeps the prediction box that is not the target as far away from the target as possible. It prevents the fusion of adjacent boundary boxes in the inference process and improves the detection performance in the case of occlusion effectively. At last, we use a new non-maximum suppression rule to prevent suppression of the correct detection box during infer. We annotated new dataset for large-scale detection, retrained and evaluated our model. Experiments on our dataset show the superiority of our model. Compared to the original YOLOv4, our improved model increases 1.8% mAP.","PeriodicalId":6838,"journal":{"name":"2021 IEEE 24th International Conference on Computational Science and Engineering (CSE)","volume":"54 1","pages":"116-122"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 24th International Conference on Computational Science and Engineering (CSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSE53436.2021.00026","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In this paper, we addressed the problems of occlusion and crowding in large-scale object detection. First, large-scale detection is more complex and diverse than traditional object detection. The number of targets to be detected is larger and often clustered together. This will produce occlusion and dense detection problems, which brings a serious challenge to object detection. Secondly, current dominant object detection is rarely trained and inferred on large-scale labeled dataset, so it is unable to evaluate the performance of these detection models on large dataset. To solve the above problems, we propose L-YOLO large-scale object detection algorithm. We modified the structure of feature pyramid network, then the receptive field was increased by using four-scale detection. Next, we propose a new loss function designed specifically for large-scale scenarios, which keeps the prediction box that is not the target as far away from the target as possible. It prevents the fusion of adjacent boundary boxes in the inference process and improves the detection performance in the case of occlusion effectively. At last, we use a new non-maximum suppression rule to prevent suppression of the correct detection box during infer. We annotated new dataset for large-scale detection, retrained and evaluated our model. Experiments on our dataset show the superiority of our model. Compared to the original YOLOv4, our improved model increases 1.8% mAP.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于YOLOv4的大规模检测算法及应用

在本文中，我们解决了大规模目标检测中的遮挡和拥挤问题。首先，与传统的目标检测相比，大规模检测更加复杂和多样。需要检测的目标数量较大，并且经常聚集在一起。这会产生遮挡和密集检测问题，给目标检测带来严峻的挑战。其次，目前的主导目标检测很少在大规模标记数据集上进行训练和推断，因此无法评估这些检测模型在大型数据集上的性能。为了解决上述问题，我们提出了L-YOLO大规模目标检测算法。首先对特征金字塔网络结构进行修正，然后采用四尺度检测增加接收野。接下来，我们提出了一种专门为大规模场景设计的新的损失函数，使非目标的预测框尽可能远离目标。在推理过程中防止了相邻边界框的融合，有效提高了遮挡情况下的检测性能。最后，我们采用了一种新的非最大抑制规则来防止正确检测框在推断过程中被抑制。我们为大规模检测标注了新的数据集，重新训练和评估了我们的模型。在我们的数据集上的实验表明了我们模型的优越性。与原来的YOLOv4相比，我们改进的模型增加了1.8%的mAP。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2021 IEEE 24th International Conference on Computational Science and Engineering (CSE)

自引率

0.00%

发文量