{"title":"An Improved Faster R-CNN Algorithm for Pedestrian Detection","authors":"Zhaoyang Zhao, Jianwei Ma, Chao Ma, Yuzhu Wang","doi":"10.1109/ITME53901.2021.00026","DOIUrl":null,"url":null,"abstract":"Pedestrian detection is an important branch of computer vision and has been the focus of research due to its wide range of applications. Although commonly used object detection model Faster R-CNN has achieved good results. However, there are still some shortcomings in the specific task of detecting pedestrians. This paper made three improvements to the Faster R-CNN to better adapt it to the pedestrian detection task. First, we did a lot of experiments and finally chose MobileNetv2 as our backbone network. Second, we designed a multi-branch feature pyramid network (M-FPN), which is used to better integrate the model's shallow feature information with the deep feature information improved the model's ability to detect pedestrians. Finally, an attention region proposal network SE-RPN is used to improve the model's ability to focus on pedestrian features and suppress attention to background interference features. The experimental results show that the improvement strategy proposed in this paper has achieved better results. These strategies improve the average accuracy of Faster R-CNN on our self-built dataset by 6.14% and the detection speed by 27fps. The AP on Caltech dataset reaches 87.01%, and the detection speed can achieve 39.4fps.","PeriodicalId":6774,"journal":{"name":"2021 11th International Conference on Information Technology in Medicine and Education (ITME)","volume":"28 1","pages":"76-80"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 11th International Conference on Information Technology in Medicine and Education (ITME)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITME53901.2021.00026","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Pedestrian detection is an important branch of computer vision and has been the focus of research due to its wide range of applications. Although commonly used object detection model Faster R-CNN has achieved good results. However, there are still some shortcomings in the specific task of detecting pedestrians. This paper made three improvements to the Faster R-CNN to better adapt it to the pedestrian detection task. First, we did a lot of experiments and finally chose MobileNetv2 as our backbone network. Second, we designed a multi-branch feature pyramid network (M-FPN), which is used to better integrate the model's shallow feature information with the deep feature information improved the model's ability to detect pedestrians. Finally, an attention region proposal network SE-RPN is used to improve the model's ability to focus on pedestrian features and suppress attention to background interference features. The experimental results show that the improvement strategy proposed in this paper has achieved better results. These strategies improve the average accuracy of Faster R-CNN on our self-built dataset by 6.14% and the detection speed by 27fps. The AP on Caltech dataset reaches 87.01%, and the detection speed can achieve 39.4fps.