基于多尺度特征和规范化注意力模型的车辆检测算法

Yu-Shuai Duan, Huarong Xu, Lifen Weng
{"title":"基于多尺度特征和规范化注意力模型的车辆检测算法","authors":"Yu-Shuai Duan, Huarong Xu, Lifen Weng","doi":"10.1145/3548608.3559196","DOIUrl":null,"url":null,"abstract":"As the key technology of automatic driving perception module, vehicle detection in complex scenes requires real-time and accurate acquisition of the position and distance information of surrounding vehicles, so as to ensure the safety of passengers. Centernet algorithm performs well in vehicle detection, achieving a trade-off between accuracy and speed, but the network only extracts features of the target at the last layer of the feature map, leading to the problem of missed and false detections during detection. Therefore, this paper proposes a Vehicle-CenterNet detection model, which obtains more detailed information by modifying the original ResNet, constructing layered connections within a single residual block, and increasing the perceptual field size of each layer by stacking convolution operators. In addition, the Mish activation function is used instead of the ReLU activation function, and the smoothed activation function allows better information penetration into the neural network, resulting in better accuracy and generalization. The normalization-based attention module (NAM) is also incorporated to suppress non-target features and further improve the detection accuracy of the model. Experimental results on VOC dataset and KITTI dataset show that the mean average precision (mAP) and F1 Score of the proposed method are improved to different degrees, and the comprehensive performance is better than the original CenterNet algorithm.","PeriodicalId":201434,"journal":{"name":"Proceedings of the 2022 2nd International Conference on Control and Intelligent Robotics","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Vehicle detection algorithm based on multi-scale features and normalization attention model\",\"authors\":\"Yu-Shuai Duan, Huarong Xu, Lifen Weng\",\"doi\":\"10.1145/3548608.3559196\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As the key technology of automatic driving perception module, vehicle detection in complex scenes requires real-time and accurate acquisition of the position and distance information of surrounding vehicles, so as to ensure the safety of passengers. Centernet algorithm performs well in vehicle detection, achieving a trade-off between accuracy and speed, but the network only extracts features of the target at the last layer of the feature map, leading to the problem of missed and false detections during detection. Therefore, this paper proposes a Vehicle-CenterNet detection model, which obtains more detailed information by modifying the original ResNet, constructing layered connections within a single residual block, and increasing the perceptual field size of each layer by stacking convolution operators. In addition, the Mish activation function is used instead of the ReLU activation function, and the smoothed activation function allows better information penetration into the neural network, resulting in better accuracy and generalization. The normalization-based attention module (NAM) is also incorporated to suppress non-target features and further improve the detection accuracy of the model. Experimental results on VOC dataset and KITTI dataset show that the mean average precision (mAP) and F1 Score of the proposed method are improved to different degrees, and the comprehensive performance is better than the original CenterNet algorithm.\",\"PeriodicalId\":201434,\"journal\":{\"name\":\"Proceedings of the 2022 2nd International Conference on Control and Intelligent Robotics\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-06-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2022 2nd International Conference on Control and Intelligent Robotics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3548608.3559196\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 2nd International Conference on Control and Intelligent Robotics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3548608.3559196","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

复杂场景下的车辆检测作为自动驾驶感知模块的关键技术,需要实时准确地获取周围车辆的位置和距离信息,以保证乘客的安全。Centernet算法在车辆检测方面表现良好,实现了精度和速度之间的权衡,但该网络只提取了特征图最后一层的目标特征,导致检测过程中存在漏检和误检问题。因此,本文提出了一种Vehicle-CenterNet检测模型,该模型通过修改原始ResNet,在单个残差块内构建分层连接,并通过叠加卷积算子增加每层的感知场大小,从而获得更详细的信息。此外,使用Mish激活函数代替ReLU激活函数,平滑的激活函数可以更好地将信息渗透到神经网络中,从而获得更好的准确性和泛化性。同时加入了基于归一化的注意模块(NAM)来抑制非目标特征,进一步提高了模型的检测精度。在VOC数据集和KITTI数据集上的实验结果表明,本文方法的平均精度(mAP)和F1 Score均有不同程度的提高,综合性能优于原有的CenterNet算法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Vehicle detection algorithm based on multi-scale features and normalization attention model
As the key technology of automatic driving perception module, vehicle detection in complex scenes requires real-time and accurate acquisition of the position and distance information of surrounding vehicles, so as to ensure the safety of passengers. Centernet algorithm performs well in vehicle detection, achieving a trade-off between accuracy and speed, but the network only extracts features of the target at the last layer of the feature map, leading to the problem of missed and false detections during detection. Therefore, this paper proposes a Vehicle-CenterNet detection model, which obtains more detailed information by modifying the original ResNet, constructing layered connections within a single residual block, and increasing the perceptual field size of each layer by stacking convolution operators. In addition, the Mish activation function is used instead of the ReLU activation function, and the smoothed activation function allows better information penetration into the neural network, resulting in better accuracy and generalization. The normalization-based attention module (NAM) is also incorporated to suppress non-target features and further improve the detection accuracy of the model. Experimental results on VOC dataset and KITTI dataset show that the mean average precision (mAP) and F1 Score of the proposed method are improved to different degrees, and the comprehensive performance is better than the original CenterNet algorithm.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Study on Optimization of cold chain logistics distribution path of agricultural products in Hefei Design and Implementation of a Batteryless Pedometer based on a Motion Tracking Sensor Rapid visual positioning of sheet metal parts based on electronic drawing templates An analysis of hot topics and trends in foreign 3D printing technology research——analysis of knowledge graphs based on citation indexes such as SSCI Tibetan Jiu Chess Game Algorithm based on Expert Knowledge
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1