{"title":"基于球面CNN的全向图像目标检测","authors":"Xingxing Li, Yu Liu, Yumei Wang","doi":"10.1109/IC-NIDC54101.2021.9660451","DOIUrl":null,"url":null,"abstract":"Omnidirectional cameras are gaining popularity in VR/AR applications and autonomous driving due to their wide field of view. However, the images produced by the cameras have geometric distortions especially in the polar regions. This distortion poses a great challenge to computer vision tasks such as object detection. In this paper, we propose a CNN architecture called spherical CNN which is designed for omnidirectional images. According to the mapping relationship between the sphere and plane, our spherical CNN changes the size of convolution kernel and the locations of sampling points at different latitudes to adapt the image distortion. In order to verify the effectiveness of spherical CNN for the omnidirectional image object detection task, it is applied to detection network SSD(Single Shot MultiBox Detector). In our experiments, we achieve a 2% improvement on the mAP75 which represents the accuracy of detection. The experimental results verify that spherical CNN can improve the detection performance for omnidirectional images.","PeriodicalId":264468,"journal":{"name":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Object Detection in Omnidirectional Images Based on Spherical CNN\",\"authors\":\"Xingxing Li, Yu Liu, Yumei Wang\",\"doi\":\"10.1109/IC-NIDC54101.2021.9660451\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Omnidirectional cameras are gaining popularity in VR/AR applications and autonomous driving due to their wide field of view. However, the images produced by the cameras have geometric distortions especially in the polar regions. This distortion poses a great challenge to computer vision tasks such as object detection. In this paper, we propose a CNN architecture called spherical CNN which is designed for omnidirectional images. According to the mapping relationship between the sphere and plane, our spherical CNN changes the size of convolution kernel and the locations of sampling points at different latitudes to adapt the image distortion. In order to verify the effectiveness of spherical CNN for the omnidirectional image object detection task, it is applied to detection network SSD(Single Shot MultiBox Detector). In our experiments, we achieve a 2% improvement on the mAP75 which represents the accuracy of detection. The experimental results verify that spherical CNN can improve the detection performance for omnidirectional images.\",\"PeriodicalId\":264468,\"journal\":{\"name\":\"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)\",\"volume\":\"37 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IC-NIDC54101.2021.9660451\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC-NIDC54101.2021.9660451","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Object Detection in Omnidirectional Images Based on Spherical CNN
Omnidirectional cameras are gaining popularity in VR/AR applications and autonomous driving due to their wide field of view. However, the images produced by the cameras have geometric distortions especially in the polar regions. This distortion poses a great challenge to computer vision tasks such as object detection. In this paper, we propose a CNN architecture called spherical CNN which is designed for omnidirectional images. According to the mapping relationship between the sphere and plane, our spherical CNN changes the size of convolution kernel and the locations of sampling points at different latitudes to adapt the image distortion. In order to verify the effectiveness of spherical CNN for the omnidirectional image object detection task, it is applied to detection network SSD(Single Shot MultiBox Detector). In our experiments, we achieve a 2% improvement on the mAP75 which represents the accuracy of detection. The experimental results verify that spherical CNN can improve the detection performance for omnidirectional images.