Online indoor visual odometry with semantic assistance under implicit epipolar constraints

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pattern Recognition Pub Date : 2024-11-08 DOI:10.1016/j.patcog.2024.111150

Yang Chen , Lin Zhang , Shengjie Zhao , Yicong Zhou

{"title":"Online indoor visual odometry with semantic assistance under implicit epipolar constraints","authors":"Yang Chen , Lin Zhang , Shengjie Zhao , Yicong Zhou","doi":"10.1016/j.patcog.2024.111150","DOIUrl":null,"url":null,"abstract":"<div><div>Among solutions to the tasks of indoor localization and reconstruction, compared with traditional SLAM (Simultaneous Localization And Mapping), learning-based VO (Visual Odometry) has gained more and more popularity due to its robustness and low cost. However, the performance of existing indoor deep VOs is still limited in comparison with their outdoor counterparts mainly owing to large areas of textureless regions and complex indoor motions containing much more rotations. In this paper, the above two challenges are carefully tackled with the proposed SEOVO (Semantic Epipolar-constrained Online VO). On the one hand, as far as we know, SEOVO is the first semantic-aided VO under an online adaptive framework, which adaptively reconstructs low-texture planes without any supervision. On the other hand, we introduce the epipolar geometric constraint in an implicit way for improving the accuracy of pose estimation without destroying the global scale consistency. The efficiency and efficacy of SEOVO have been corroborated by extensive experiments conducted on both public datasets and our collected video sequences.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"159 ","pages":"Article 111150"},"PeriodicalIF":7.5000,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320324009014","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Among solutions to the tasks of indoor localization and reconstruction, compared with traditional SLAM (Simultaneous Localization And Mapping), learning-based VO (Visual Odometry) has gained more and more popularity due to its robustness and low cost. However, the performance of existing indoor deep VOs is still limited in comparison with their outdoor counterparts mainly owing to large areas of textureless regions and complex indoor motions containing much more rotations. In this paper, the above two challenges are carefully tackled with the proposed SEOVO (Semantic Epipolar-constrained Online VO). On the one hand, as far as we know, SEOVO is the first semantic-aided VO under an online adaptive framework, which adaptively reconstructs low-texture planes without any supervision. On the other hand, we introduce the epipolar geometric constraint in an implicit way for improving the accuracy of pose estimation without destroying the global scale consistency. The efficiency and efficacy of SEOVO have been corroborated by extensive experiments conducted on both public datasets and our collected video sequences.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

隐式极点约束下的语义辅助在线室内视觉里程测量法

在室内定位和重建任务的解决方案中，与传统的 SLAM（同步定位和绘图）相比，基于学习的 VO（视觉轨迹测量）因其鲁棒性和低成本而越来越受欢迎。然而，与室外相比，现有的室内深度 VO 性能仍然有限，这主要是由于室内存在大面积的无纹理区域和包含更多旋转的复杂室内运动。本文提出的 SEOVO（Semantic Epipolar-constrained Online VO）可以很好地解决上述两个难题。一方面，据我们所知，SEOVO 是第一个在线自适应框架下的语义辅助 VO，它可以在没有任何监督的情况下自适应地重建低纹理平面。另一方面，我们以隐含的方式引入了外极点几何约束，在不破坏全局尺度一致性的前提下提高了姿态估计的准确性。在公共数据集和我们收集的视频序列上进行的大量实验证实了 SEOVO 的效率和功效。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Pattern Recognition 工程技术-工程：电子与电气

CiteScore

14.40

自引率

16.20%

发文量

683

审稿时长

5.6 months

期刊介绍： The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.

期刊最新文献

Editorial Board One-hot constrained symmetric nonnegative matrix factorization for image clustering Collaboratively enhanced and integrated detail-context information for low-light image enhancement A novel 6DoF pose estimation method using transformer fusion SS ViT: Observing pathologies of multi-layer perceptron weights and re-setting vision transformer