{"title":"Visual Odometry integrated with Self-Supervised Monocular Depth Estimation","authors":"Xinyu Qi, Zhijun Fang, Shuqun Yang, Heng Zhou","doi":"10.1109/ICCEAI52939.2021.00090","DOIUrl":null,"url":null,"abstract":"To solve the problem of poor robustness of multiview geometric visual odometry in a dynamic environment, we propose a visual odometry method base on deep learning that combines depth estimation and geometric pose determination. First, we remove the time-consuming dense optical flow prediction network in GeoNet. We propose a method for image reconstruction at one frame interval to solve large displacement between moving objects. Second, we propose a way to find the best pixel to solve the problem of pixel occlusion in the image. Third, to further solve the uneven illumination of the image and the degradation of image quality, we propose an adaptive histogram equalization based on limited contrast to enhance the image. A large number of experimental demonstrations have been carried out on the KITTI public dataset. The experimental results show that our network reduces the time cost and complexity on the general evaluation index. It has a significant improvement compared with GeoNet and has achieved more accurate in-depth and position prediction results.","PeriodicalId":331409,"journal":{"name":"2021 International Conference on Computer Engineering and Artificial Intelligence (ICCEAI)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Computer Engineering and Artificial Intelligence (ICCEAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCEAI52939.2021.00090","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
To solve the problem of poor robustness of multiview geometric visual odometry in a dynamic environment, we propose a visual odometry method base on deep learning that combines depth estimation and geometric pose determination. First, we remove the time-consuming dense optical flow prediction network in GeoNet. We propose a method for image reconstruction at one frame interval to solve large displacement between moving objects. Second, we propose a way to find the best pixel to solve the problem of pixel occlusion in the image. Third, to further solve the uneven illumination of the image and the degradation of image quality, we propose an adaptive histogram equalization based on limited contrast to enhance the image. A large number of experimental demonstrations have been carried out on the KITTI public dataset. The experimental results show that our network reduces the time cost and complexity on the general evaluation index. It has a significant improvement compared with GeoNet and has achieved more accurate in-depth and position prediction results.