{"title":"基于卷积神经网络的多类目标检测算法","authors":"Yanjuan Wang, H. Niu, Xiao Wang, Liang Chen","doi":"10.1109/ICIASE45644.2019.9074015","DOIUrl":null,"url":null,"abstract":"In order to improve the accurate recognition rate and localization rate of multi-class object detection, a new network structure, Res-YOLO-R., based on the combination of Residual Network (ResNet) and You Only Look Once (YOLO) detection network, is proposed. To improve the location ability and speed up the convergence of the network, the number and size of prediction boxes for YOLO network are redesigned by clustering analysis algorithm. Removing part of the pool layer and using convolution layer to raise or reduce the dimension of the feature to improve the ability of feature extraction and computing of the network. ResNet is designed as the feature extraction part, and the final average pool layer and the full connection layer are removed, and combines with the improved YOLO detection network to improve the degradation problem caused by the increasement of the network depth. In order to make the network learn object context information better, the ROUTE and REORG layers are used to fuse feature from different layers, and the feature map is reorganized. Through the comparison of experiments on commodity data sets, the network structure can effectively reduce the false detection rate and miss detection rate, improve the detection accuracy, positioning ability and recall rate of commodities, and have good real-time and generalization ability and strong practicability.","PeriodicalId":206741,"journal":{"name":"2019 IEEE International Conference of Intelligent Applied Systems on Engineering (ICIASE)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Multi-class Object Detection Algorithm Based on Convolutional Neural Network\",\"authors\":\"Yanjuan Wang, H. Niu, Xiao Wang, Liang Chen\",\"doi\":\"10.1109/ICIASE45644.2019.9074015\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In order to improve the accurate recognition rate and localization rate of multi-class object detection, a new network structure, Res-YOLO-R., based on the combination of Residual Network (ResNet) and You Only Look Once (YOLO) detection network, is proposed. To improve the location ability and speed up the convergence of the network, the number and size of prediction boxes for YOLO network are redesigned by clustering analysis algorithm. Removing part of the pool layer and using convolution layer to raise or reduce the dimension of the feature to improve the ability of feature extraction and computing of the network. ResNet is designed as the feature extraction part, and the final average pool layer and the full connection layer are removed, and combines with the improved YOLO detection network to improve the degradation problem caused by the increasement of the network depth. In order to make the network learn object context information better, the ROUTE and REORG layers are used to fuse feature from different layers, and the feature map is reorganized. Through the comparison of experiments on commodity data sets, the network structure can effectively reduce the false detection rate and miss detection rate, improve the detection accuracy, positioning ability and recall rate of commodities, and have good real-time and generalization ability and strong practicability.\",\"PeriodicalId\":206741,\"journal\":{\"name\":\"2019 IEEE International Conference of Intelligent Applied Systems on Engineering (ICIASE)\",\"volume\":\"36 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE International Conference of Intelligent Applied Systems on Engineering (ICIASE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIASE45644.2019.9074015\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Conference of Intelligent Applied Systems on Engineering (ICIASE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIASE45644.2019.9074015","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
摘要
为了提高多类目标检测的准确识别率和定位率,提出了一种新的网络结构Res-YOLO-R。提出了一种基于残余网络(ResNet)和You Only Look Once (YOLO)检测网络相结合的检测网络。为了提高网络的定位能力,加快网络的收敛速度,利用聚类分析算法重新设计了YOLO网络预测盒的数量和大小。去除部分池层,利用卷积层提高或降低特征维数,提高网络的特征提取和计算能力。设计了ResNet作为特征提取部分,去除最终的平均池层和全连接层,结合改进的YOLO检测网络,改善了网络深度增加带来的退化问题。为了使网络更好地学习对象上下文信息,利用ROUTE层和REORG层融合不同层的特征,对特征映射进行重组。通过对商品数据集的实验对比,该网络结构能够有效降低误检率和漏检率,提高商品的检测精度、定位能力和召回率,具有良好的实时性和泛化能力,实用性强。
Multi-class Object Detection Algorithm Based on Convolutional Neural Network
In order to improve the accurate recognition rate and localization rate of multi-class object detection, a new network structure, Res-YOLO-R., based on the combination of Residual Network (ResNet) and You Only Look Once (YOLO) detection network, is proposed. To improve the location ability and speed up the convergence of the network, the number and size of prediction boxes for YOLO network are redesigned by clustering analysis algorithm. Removing part of the pool layer and using convolution layer to raise or reduce the dimension of the feature to improve the ability of feature extraction and computing of the network. ResNet is designed as the feature extraction part, and the final average pool layer and the full connection layer are removed, and combines with the improved YOLO detection network to improve the degradation problem caused by the increasement of the network depth. In order to make the network learn object context information better, the ROUTE and REORG layers are used to fuse feature from different layers, and the feature map is reorganized. Through the comparison of experiments on commodity data sets, the network structure can effectively reduce the false detection rate and miss detection rate, improve the detection accuracy, positioning ability and recall rate of commodities, and have good real-time and generalization ability and strong practicability.