基于多尺度特征和规范化注意力模型的车辆检测算法

Proceedings of the 2022 2nd International Conference on Control and Intelligent Robotics Pub Date : 2022-06-24 DOI:10.1145/3548608.3559196

Yu-Shuai Duan, Huarong Xu, Lifen Weng

{"title":"基于多尺度特征和规范化注意力模型的车辆检测算法","authors":"Yu-Shuai Duan, Huarong Xu, Lifen Weng","doi":"10.1145/3548608.3559196","DOIUrl":null,"url":null,"abstract":"As the key technology of automatic driving perception module, vehicle detection in complex scenes requires real-time and accurate acquisition of the position and distance information of surrounding vehicles, so as to ensure the safety of passengers. Centernet algorithm performs well in vehicle detection, achieving a trade-off between accuracy and speed, but the network only extracts features of the target at the last layer of the feature map, leading to the problem of missed and false detections during detection. Therefore, this paper proposes a Vehicle-CenterNet detection model, which obtains more detailed information by modifying the original ResNet, constructing layered connections within a single residual block, and increasing the perceptual field size of each layer by stacking convolution operators. In addition, the Mish activation function is used instead of the ReLU activation function, and the smoothed activation function allows better information penetration into the neural network, resulting in better accuracy and generalization. The normalization-based attention module (NAM) is also incorporated to suppress non-target features and further improve the detection accuracy of the model. Experimental results on VOC dataset and KITTI dataset show that the mean average precision (mAP) and F1 Score of the proposed method are improved to different degrees, and the comprehensive performance is better than the original CenterNet algorithm.","PeriodicalId":201434,"journal":{"name":"Proceedings of the 2022 2nd International Conference on Control and Intelligent Robotics","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Vehicle detection algorithm based on multi-scale features and normalization attention model\",\"authors\":\"Yu-Shuai Duan, Huarong Xu, Lifen Weng\",\"doi\":\"10.1145/3548608.3559196\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As the key technology of automatic driving perception module, vehicle detection in complex scenes requires real-time and accurate acquisition of the position and distance information of surrounding vehicles, so as to ensure the safety of passengers. Centernet algorithm performs well in vehicle detection, achieving a trade-off between accuracy and speed, but the network only extracts features of the target at the last layer of the feature map, leading to the problem of missed and false detections during detection. Therefore, this paper proposes a Vehicle-CenterNet detection model, which obtains more detailed information by modifying the original ResNet, constructing layered connections within a single residual block, and increasing the perceptual field size of each layer by stacking convolution operators. In addition, the Mish activation function is used instead of the ReLU activation function, and the smoothed activation function allows better information penetration into the neural network, resulting in better accuracy and generalization. The normalization-based attention module (NAM) is also incorporated to suppress non-target features and further improve the detection accuracy of the model. Experimental results on VOC dataset and KITTI dataset show that the mean average precision (mAP) and F1 Score of the proposed method are improved to different degrees, and the comprehensive performance is better than the original CenterNet algorithm.\",\"PeriodicalId\":201434,\"journal\":{\"name\":\"Proceedings of the 2022 2nd International Conference on Control and Intelligent Robotics\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-06-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2022 2nd International Conference on Control and Intelligent Robotics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3548608.3559196\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 2nd International Conference on Control and Intelligent Robotics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3548608.3559196","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

复杂场景下的车辆检测作为自动驾驶感知模块的关键技术，需要实时准确地获取周围车辆的位置和距离信息，以保证乘客的安全。Centernet算法在车辆检测方面表现良好，实现了精度和速度之间的权衡，但该网络只提取了特征图最后一层的目标特征，导致检测过程中存在漏检和误检问题。因此，本文提出了一种Vehicle-CenterNet检测模型，该模型通过修改原始ResNet，在单个残差块内构建分层连接，并通过叠加卷积算子增加每层的感知场大小，从而获得更详细的信息。此外，使用Mish激活函数代替ReLU激活函数，平滑的激活函数可以更好地将信息渗透到神经网络中，从而获得更好的准确性和泛化性。同时加入了基于归一化的注意模块(NAM)来抑制非目标特征，进一步提高了模型的检测精度。在VOC数据集和KITTI数据集上的实验结果表明，本文方法的平均精度(mAP)和F1 Score均有不同程度的提高，综合性能优于原有的CenterNet算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Vehicle detection algorithm based on multi-scale features and normalization attention model

As the key technology of automatic driving perception module, vehicle detection in complex scenes requires real-time and accurate acquisition of the position and distance information of surrounding vehicles, so as to ensure the safety of passengers. Centernet algorithm performs well in vehicle detection, achieving a trade-off between accuracy and speed, but the network only extracts features of the target at the last layer of the feature map, leading to the problem of missed and false detections during detection. Therefore, this paper proposes a Vehicle-CenterNet detection model, which obtains more detailed information by modifying the original ResNet, constructing layered connections within a single residual block, and increasing the perceptual field size of each layer by stacking convolution operators. In addition, the Mish activation function is used instead of the ReLU activation function, and the smoothed activation function allows better information penetration into the neural network, resulting in better accuracy and generalization. The normalization-based attention module (NAM) is also incorporated to suppress non-target features and further improve the detection accuracy of the model. Experimental results on VOC dataset and KITTI dataset show that the mean average precision (mAP) and F1 Score of the proposed method are improved to different degrees, and the comprehensive performance is better than the original CenterNet algorithm.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 2022 2nd International Conference on Control and Intelligent Robotics

自引率

0.00%

发文量