{"title":"基于YOLOv4的轻量化改进目标检测算法","authors":"Rui Chen, Zhenzhong Li, Yuzhao Zhang, Yuehang Li","doi":"10.1145/3573942.3574027","DOIUrl":null,"url":null,"abstract":"To address the problem that the existing object detection network models are large in size and complex in operation and cannot satisfy both detection speed and accuracy under the limited resources and small size platform. Based on YOLOv4 as the benchmark network, a lightweight object detection model LW-YOLO is proposed. Firstly, the backbone feature extraction network is replaced with MobileNetv1, while the number of feature fusion network parameters is significantly reduced by the depth separable convolutional module. Then the BN layer coefficients are used as scaling factors for the importance of the convolutional channels, the scaling factors are sparse using polarization regularization, the errors before and after pruning are reconstructed using least squares and channel weighting methods. The appropriate pruning thresholds are obtained by minimizing the reconstructed errors, the channels with small scaling factor values are eliminated to achieve the lightweight. The experimental results on the VOC (Visual Object Classes) dataset show that the detection accuracy of LW-YOLO is 87.00%, and the FPS(Frames Per Second ) reaches 48.89, which is better than the original YOLOv4 algorithm. It also significantly reduces the number of parameters, computation, and model size, which is more suitable for application in resource-poor embedded mobile devices.","PeriodicalId":103293,"journal":{"name":"Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Lightweight Improved Based on YOLOv4 Object Detection Algorithm\",\"authors\":\"Rui Chen, Zhenzhong Li, Yuzhao Zhang, Yuehang Li\",\"doi\":\"10.1145/3573942.3574027\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To address the problem that the existing object detection network models are large in size and complex in operation and cannot satisfy both detection speed and accuracy under the limited resources and small size platform. Based on YOLOv4 as the benchmark network, a lightweight object detection model LW-YOLO is proposed. Firstly, the backbone feature extraction network is replaced with MobileNetv1, while the number of feature fusion network parameters is significantly reduced by the depth separable convolutional module. Then the BN layer coefficients are used as scaling factors for the importance of the convolutional channels, the scaling factors are sparse using polarization regularization, the errors before and after pruning are reconstructed using least squares and channel weighting methods. The appropriate pruning thresholds are obtained by minimizing the reconstructed errors, the channels with small scaling factor values are eliminated to achieve the lightweight. The experimental results on the VOC (Visual Object Classes) dataset show that the detection accuracy of LW-YOLO is 87.00%, and the FPS(Frames Per Second ) reaches 48.89, which is better than the original YOLOv4 algorithm. It also significantly reduces the number of parameters, computation, and model size, which is more suitable for application in resource-poor embedded mobile devices.\",\"PeriodicalId\":103293,\"journal\":{\"name\":\"Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition\",\"volume\":\"48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3573942.3574027\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3573942.3574027","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
针对现有目标检测网络模型规模大、操作复杂,在有限的资源和小型平台下无法同时满足检测速度和精度的问题。以YOLOv4为基准网络,提出了一种轻量级的目标检测模型LW-YOLO。首先,用MobileNetv1取代主干特征提取网络,同时利用深度可分卷积模块显著减少特征融合网络参数的数量;然后将BN层系数作为卷积信道重要性的标度因子,利用极化正则化对标度因子进行稀疏处理,利用最小二乘法和信道加权法重构剪枝前后的误差。通过最小化重构误差获得合适的剪枝阈值,剔除比例因子较小的通道,实现轻量化。在VOC (Visual Object Classes)数据集上的实验结果表明,LW-YOLO的检测准确率为87.00%,FPS(Frames Per Second)达到48.89,优于原来的YOLOv4算法。它还显著减少了参数数量、计算量和模型尺寸,更适合在资源贫乏的嵌入式移动设备中应用。
Lightweight Improved Based on YOLOv4 Object Detection Algorithm
To address the problem that the existing object detection network models are large in size and complex in operation and cannot satisfy both detection speed and accuracy under the limited resources and small size platform. Based on YOLOv4 as the benchmark network, a lightweight object detection model LW-YOLO is proposed. Firstly, the backbone feature extraction network is replaced with MobileNetv1, while the number of feature fusion network parameters is significantly reduced by the depth separable convolutional module. Then the BN layer coefficients are used as scaling factors for the importance of the convolutional channels, the scaling factors are sparse using polarization regularization, the errors before and after pruning are reconstructed using least squares and channel weighting methods. The appropriate pruning thresholds are obtained by minimizing the reconstructed errors, the channels with small scaling factor values are eliminated to achieve the lightweight. The experimental results on the VOC (Visual Object Classes) dataset show that the detection accuracy of LW-YOLO is 87.00%, and the FPS(Frames Per Second ) reaches 48.89, which is better than the original YOLOv4 algorithm. It also significantly reduces the number of parameters, computation, and model size, which is more suitable for application in resource-poor embedded mobile devices.