{"title":"基于多尺度CSPN和双重注意的行人检测研究","authors":"Xinxin Huang, Zhenyu Yin, Chao Fan","doi":"10.1109/ICTech55460.2022.00096","DOIUrl":null,"url":null,"abstract":"Pedestrian detection is a significant research direction in the computer vision, but the detection performance of existing pedestrian detection algorithms is inadequate. Therefore, this article proposes a novel algorithm to improve the anchor-free pedestrian detection algorithm. First, the multi-scale CSPN module is used to deepen the network depth, further extract semantic information on multiple scales, and improve detection performance. Moreover, the dual attention module based on feature fusion is used to effectively fuse features of different scales, assigning new weights to the fused features in the two dimensions of space and channel. Experiments show our method reduces MR−2 by 0.10%, 2.60% and 0.98% on the Reasonable, Heavy Occlusion and ALL of the Caltech pedestrian dataset, which is better than the existing algorithms.","PeriodicalId":290836,"journal":{"name":"2022 11th International Conference of Information and Communication Technology (ICTech))","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Towards Better Pedestrian Detection Using Multi-Scale CSPN and Dual Attention\",\"authors\":\"Xinxin Huang, Zhenyu Yin, Chao Fan\",\"doi\":\"10.1109/ICTech55460.2022.00096\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Pedestrian detection is a significant research direction in the computer vision, but the detection performance of existing pedestrian detection algorithms is inadequate. Therefore, this article proposes a novel algorithm to improve the anchor-free pedestrian detection algorithm. First, the multi-scale CSPN module is used to deepen the network depth, further extract semantic information on multiple scales, and improve detection performance. Moreover, the dual attention module based on feature fusion is used to effectively fuse features of different scales, assigning new weights to the fused features in the two dimensions of space and channel. Experiments show our method reduces MR−2 by 0.10%, 2.60% and 0.98% on the Reasonable, Heavy Occlusion and ALL of the Caltech pedestrian dataset, which is better than the existing algorithms.\",\"PeriodicalId\":290836,\"journal\":{\"name\":\"2022 11th International Conference of Information and Communication Technology (ICTech))\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 11th International Conference of Information and Communication Technology (ICTech))\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICTech55460.2022.00096\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 11th International Conference of Information and Communication Technology (ICTech))","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTech55460.2022.00096","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Towards Better Pedestrian Detection Using Multi-Scale CSPN and Dual Attention
Pedestrian detection is a significant research direction in the computer vision, but the detection performance of existing pedestrian detection algorithms is inadequate. Therefore, this article proposes a novel algorithm to improve the anchor-free pedestrian detection algorithm. First, the multi-scale CSPN module is used to deepen the network depth, further extract semantic information on multiple scales, and improve detection performance. Moreover, the dual attention module based on feature fusion is used to effectively fuse features of different scales, assigning new weights to the fused features in the two dimensions of space and channel. Experiments show our method reduces MR−2 by 0.10%, 2.60% and 0.98% on the Reasonable, Heavy Occlusion and ALL of the Caltech pedestrian dataset, which is better than the existing algorithms.