Simone Magistri, Francesco Sambo, Fabio Schoen, Douglas Coimbra de Andrade, Matteo Simoncini, Stefano Caprasecca, Luca Kubin, L. Bravi, L. Taccari
{"title":"基于行车记录仪图像的车辆视点估计的轻量级深度学习模型","authors":"Simone Magistri, Francesco Sambo, Fabio Schoen, Douglas Coimbra de Andrade, Matteo Simoncini, Stefano Caprasecca, Luca Kubin, L. Bravi, L. Taccari","doi":"10.1109/ITSC45102.2020.9294672","DOIUrl":null,"url":null,"abstract":"Vehicle viewpoint estimation from vehicle cameras is a crucial component of road scene understanding.In this paper, we propose a deep lightweight method to predict vehicle viewpoint from a single RGB dashcam image. To this aim, we customize and adapt state-of-the-art deep learning techniques for general object viewpoint estimation to the vehicle viewpoint estimation task. Furthermore, we define a novel objective function that takes into account errors at different granularity to improve neural network training. To keep the model lightweight and fast, we rely upon MobileNetV2 as backbone.Tested both on benchmark viewpoint estimation data (Pascal3D+) and on actual vehicle camera data (nuScenes), our method is shown to outperform the state of the art in vehicle viewpoint estimation, in terms of both accuracy and memory footprint.","PeriodicalId":394538,"journal":{"name":"2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Lightweight Deep Learning Model for Vehicle Viewpoint Estimation from Dashcam Images\",\"authors\":\"Simone Magistri, Francesco Sambo, Fabio Schoen, Douglas Coimbra de Andrade, Matteo Simoncini, Stefano Caprasecca, Luca Kubin, L. Bravi, L. Taccari\",\"doi\":\"10.1109/ITSC45102.2020.9294672\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Vehicle viewpoint estimation from vehicle cameras is a crucial component of road scene understanding.In this paper, we propose a deep lightweight method to predict vehicle viewpoint from a single RGB dashcam image. To this aim, we customize and adapt state-of-the-art deep learning techniques for general object viewpoint estimation to the vehicle viewpoint estimation task. Furthermore, we define a novel objective function that takes into account errors at different granularity to improve neural network training. To keep the model lightweight and fast, we rely upon MobileNetV2 as backbone.Tested both on benchmark viewpoint estimation data (Pascal3D+) and on actual vehicle camera data (nuScenes), our method is shown to outperform the state of the art in vehicle viewpoint estimation, in terms of both accuracy and memory footprint.\",\"PeriodicalId\":394538,\"journal\":{\"name\":\"2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ITSC45102.2020.9294672\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITSC45102.2020.9294672","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Lightweight Deep Learning Model for Vehicle Viewpoint Estimation from Dashcam Images
Vehicle viewpoint estimation from vehicle cameras is a crucial component of road scene understanding.In this paper, we propose a deep lightweight method to predict vehicle viewpoint from a single RGB dashcam image. To this aim, we customize and adapt state-of-the-art deep learning techniques for general object viewpoint estimation to the vehicle viewpoint estimation task. Furthermore, we define a novel objective function that takes into account errors at different granularity to improve neural network training. To keep the model lightweight and fast, we rely upon MobileNetV2 as backbone.Tested both on benchmark viewpoint estimation data (Pascal3D+) and on actual vehicle camera data (nuScenes), our method is shown to outperform the state of the art in vehicle viewpoint estimation, in terms of both accuracy and memory footprint.