Yadong Wang, Jin Li, Ruocong Yang, Zexuan Wang, Yue Zhang
{"title":"Object Recognition Algorithm for Complex Scenes Based on Improved YOLO v3","authors":"Yadong Wang, Jin Li, Ruocong Yang, Zexuan Wang, Yue Zhang","doi":"10.1109/ICCSSE52761.2021.9545150","DOIUrl":null,"url":null,"abstract":"YOLO v3 is widely used in industry because of its high detection accuracy and speed, but there is a problem that it can only output accurate position coordinates and cannot predict the localization uncertainty of bbox. To solve this problem, an improved YOLO v3 algorithm is proposed. By increasing the output of position parameters and predicting localization uncertainty of bbox with Gaussian modeling to remove the boxes with high bbox uncertainty in the detection process. A new Localization loss function is designed on the basis of increasing the output of bbox coordinates. Batch Normalization layer and Convolution layer are combined to reduce the use of video memory space and improve network performance. The experimental results show that the mAP50 of the improved YOLO v3 algorithm in the helmet wearing test set is improved by 7.99%.","PeriodicalId":143697,"journal":{"name":"2021 IEEE 7th International Conference on Control Science and Systems Engineering (ICCSSE)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 7th International Conference on Control Science and Systems Engineering (ICCSSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCSSE52761.2021.9545150","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
YOLO v3 is widely used in industry because of its high detection accuracy and speed, but there is a problem that it can only output accurate position coordinates and cannot predict the localization uncertainty of bbox. To solve this problem, an improved YOLO v3 algorithm is proposed. By increasing the output of position parameters and predicting localization uncertainty of bbox with Gaussian modeling to remove the boxes with high bbox uncertainty in the detection process. A new Localization loss function is designed on the basis of increasing the output of bbox coordinates. Batch Normalization layer and Convolution layer are combined to reduce the use of video memory space and improve network performance. The experimental results show that the mAP50 of the improved YOLO v3 algorithm in the helmet wearing test set is improved by 7.99%.