D. Yudin, Yaroslav K. Solomentsev, R. Musaev, A. Staroverov, A. Panov
{"title":"HPointLoc: Point-based Indoor Place Recognition using Synthetic RGB-D Images","authors":"D. Yudin, Yaroslav K. Solomentsev, R. Musaev, A. Staroverov, A. Panov","doi":"10.48550/arXiv.2212.14649","DOIUrl":null,"url":null,"abstract":"We present a novel dataset named as HPointLoc, specially designed for exploring capabilities of visual place recognition in indoor environment and loop detection in simultaneous localization and mapping. The loop detection sub-task is especially relevant when a robot with an on-board RGB-D camera can drive past the same place (``Point\") at different angles. The dataset is based on the popular Habitat simulator, in which it is possible to generate photorealistic indoor scenes using both own sensor data and open datasets, such as Matterport3D. To study the main stages of solving the place recognition problem on the HPointLoc dataset, we proposed a new modular approach named as PNTR. It first performs an image retrieval with the Patch-NetVLAD method, then extracts keypoints and matches them using R2D2, LoFTR or SuperPoint with SuperGlue, and finally performs a camera pose optimization step with TEASER++. Such a solution to the place recognition problem has not been previously studied in existing publications. The PNTR approach has shown the best quality metrics on the HPointLoc dataset and has a high potential for real use in localization systems for unmanned vehicles. The proposed dataset and framework are publicly available: https://github.com/metra4ok/HPointLoc.","PeriodicalId":281152,"journal":{"name":"International Conference on Neural Information Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Neural Information Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2212.14649","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
We present a novel dataset named as HPointLoc, specially designed for exploring capabilities of visual place recognition in indoor environment and loop detection in simultaneous localization and mapping. The loop detection sub-task is especially relevant when a robot with an on-board RGB-D camera can drive past the same place (``Point") at different angles. The dataset is based on the popular Habitat simulator, in which it is possible to generate photorealistic indoor scenes using both own sensor data and open datasets, such as Matterport3D. To study the main stages of solving the place recognition problem on the HPointLoc dataset, we proposed a new modular approach named as PNTR. It first performs an image retrieval with the Patch-NetVLAD method, then extracts keypoints and matches them using R2D2, LoFTR or SuperPoint with SuperGlue, and finally performs a camera pose optimization step with TEASER++. Such a solution to the place recognition problem has not been previously studied in existing publications. The PNTR approach has shown the best quality metrics on the HPointLoc dataset and has a high potential for real use in localization systems for unmanned vehicles. The proposed dataset and framework are publicly available: https://github.com/metra4ok/HPointLoc.
我们提出了一个名为HPointLoc的新数据集,专门用于探索室内环境中的视觉位置识别和同时定位和地图绘制中的环路检测能力。当带有车载RGB-D摄像头的机器人可以以不同角度经过同一地点(“点”)时,环路检测子任务尤为重要。该数据集基于流行的生境模拟器,其中可以使用自己的传感器数据和开放数据集(如Matterport3D)生成逼真的室内场景。为了研究在HPointLoc数据集上解决位置识别问题的主要阶段,我们提出了一种新的模块化方法PNTR。首先使用Patch-NetVLAD方法进行图像检索,然后使用R2D2, LoFTR或SuperPoint with SuperGlue提取关键点并进行匹配,最后使用TEASER++进行相机姿态优化步骤。这种位置识别问题的解决方案在现有出版物中尚未进行过研究。PNTR方法在HPointLoc数据集上显示出了最好的质量指标,并且在无人驾驶车辆的定位系统中具有很高的实际应用潜力。建议的数据集和框架是公开的:https://github.com/metra4ok/HPointLoc。