N. Stojanović, Vasilije Pantić, Vladan Damjanović, S. Vukmirovic
{"title":"3D Vehicle Pose Estimation from an Image Using Geometry","authors":"N. Stojanović, Vasilije Pantić, Vladan Damjanović, S. Vukmirovic","doi":"10.1109/INFOTEH53737.2022.9769998","DOIUrl":null,"url":null,"abstract":"3D pose estimation is a quite fascinating problem in the computer vision field, which aims to get the 3D orientation of the object based on the 2D image and can be challenging to find the solution. These solutions are mostly used in the autonomous industry to properly detect cars’ orientation on street scenes. Most solutions and publicly available datasets refer to only one axis orientation (axis parallel to the road), while two others are set to zero. In this paper, we propose a new way of representing a 3D object orientation that is not limited only to the autonomous industry. Our proposal uses Euler angle representation and two coordinate systems (one from the camera and one from the detected object) to define angle orientation for each axis in 3D space. Because this approach is supposed to be used in creating datasets and solutions in the deep learning field, there are some restrictions to be considered to properly train your solution. To better represent the proposal of 3D orientation and its purpose, it is necessary to visualize the bounding box around the detected object, which will follow its orientation. The visualization part is being done using computer-generated images (CGI), while the algorithmic part uses quaternions instead of Euler angles for the rotation of the 3D bounding box.","PeriodicalId":6839,"journal":{"name":"2022 21st International Symposium INFOTEH-JAHORINA (INFOTEH)","volume":"21 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 21st International Symposium INFOTEH-JAHORINA (INFOTEH)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INFOTEH53737.2022.9769998","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
3D pose estimation is a quite fascinating problem in the computer vision field, which aims to get the 3D orientation of the object based on the 2D image and can be challenging to find the solution. These solutions are mostly used in the autonomous industry to properly detect cars’ orientation on street scenes. Most solutions and publicly available datasets refer to only one axis orientation (axis parallel to the road), while two others are set to zero. In this paper, we propose a new way of representing a 3D object orientation that is not limited only to the autonomous industry. Our proposal uses Euler angle representation and two coordinate systems (one from the camera and one from the detected object) to define angle orientation for each axis in 3D space. Because this approach is supposed to be used in creating datasets and solutions in the deep learning field, there are some restrictions to be considered to properly train your solution. To better represent the proposal of 3D orientation and its purpose, it is necessary to visualize the bounding box around the detected object, which will follow its orientation. The visualization part is being done using computer-generated images (CGI), while the algorithmic part uses quaternions instead of Euler angles for the rotation of the 3D bounding box.