Optimized Feature Points and Keyframe Methods for VSLAM in High-Dynamic Indoor Environments

IF 8.4 1区工程技术 Q1 ENGINEERING, CIVIL IEEE Transactions on Intelligent Transportation Systems Pub Date : 2025-01-13 DOI:10.1109/TITS.2024.3520177

Zhuhua Hu;Wenlu Qi;Kunkun Ding;Hao Qi;Yaochi Zhao;Xuebo Zhang;Mingfeng Wang

{"title":"Optimized Feature Points and Keyframe Methods for VSLAM in High-Dynamic Indoor Environments","authors":"Zhuhua Hu;Wenlu Qi;Kunkun Ding;Hao Qi;Yaochi Zhao;Xuebo Zhang;Mingfeng Wang","doi":"10.1109/TITS.2024.3520177","DOIUrl":null,"url":null,"abstract":"VSLAM is one of the key technologies for indoor mobile robots, used to perceive the surrounding environment, achieve accurate positioning and mapping. However, traditional VSLAM algorithms based on the assumption of a static environment still face certain challenges. The movement, occlusion, and appearance changes of dynamic objects can lead to feature point-matching errors, making data association difficult and causing biases in motion estimation. In order to address this challenge, this paper proposes a dynamic feature point removal method and a closed-loop detection method for high dynamic scenes, aiming to effectively improve the robustness and positioning accuracy in dynamic environments. First, the YOLOv7-tiny object detection network and LK optical flow algorithm are combined to detect the dynamic area, and the adaptive threshold keyframe selection method is adopted to solve the problem of poor quality of keyframe caused by the existing heuristic threshold selection method. Then, this paper proposes a dynamic keyframe sequence creation method based on the angle difference between keyframes, which reduces the workload of loop back detection and accelerates the efficiency of loop back detection in the system. Next, the ParC_NetVLAD image matching algorithm is proposed. In this paper, ConvNeXt-Tiny network is used for feature extraction of images, and ParC-Net network and CBAM attention mechanism are added to the feature extraction network. Finally, NetVLAD is used to cluster the extracted local features to obtain global features that can represent images. Experiments are conducted on public TUM RGB-D datasets and in real-world situations. The proposed algorithm reduces the ATE (Absolute Trajectory Error) by 96.4% and the RPE (Relative Trajectory Error) by 82.8% on average in highly dynamic scenarios. In the Pittsburgh30k dataset, the average accuracy of loop closure detection has been improved by 2.6%.","PeriodicalId":13416,"journal":{"name":"IEEE Transactions on Intelligent Transportation Systems","volume":"26 3","pages":"3101-3114"},"PeriodicalIF":8.4000,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Intelligent Transportation Systems","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10838289/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}

引用次数: 0

Abstract

VSLAM is one of the key technologies for indoor mobile robots, used to perceive the surrounding environment, achieve accurate positioning and mapping. However, traditional VSLAM algorithms based on the assumption of a static environment still face certain challenges. The movement, occlusion, and appearance changes of dynamic objects can lead to feature point-matching errors, making data association difficult and causing biases in motion estimation. In order to address this challenge, this paper proposes a dynamic feature point removal method and a closed-loop detection method for high dynamic scenes, aiming to effectively improve the robustness and positioning accuracy in dynamic environments. First, the YOLOv7-tiny object detection network and LK optical flow algorithm are combined to detect the dynamic area, and the adaptive threshold keyframe selection method is adopted to solve the problem of poor quality of keyframe caused by the existing heuristic threshold selection method. Then, this paper proposes a dynamic keyframe sequence creation method based on the angle difference between keyframes, which reduces the workload of loop back detection and accelerates the efficiency of loop back detection in the system. Next, the ParC_NetVLAD image matching algorithm is proposed. In this paper, ConvNeXt-Tiny network is used for feature extraction of images, and ParC-Net network and CBAM attention mechanism are added to the feature extraction network. Finally, NetVLAD is used to cluster the extracted local features to obtain global features that can represent images. Experiments are conducted on public TUM RGB-D datasets and in real-world situations. The proposed algorithm reduces the ATE (Absolute Trajectory Error) by 96.4% and the RPE (Relative Trajectory Error) by 82.8% on average in highly dynamic scenarios. In the Pittsburgh30k dataset, the average accuracy of loop closure detection has been improved by 2.6%.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

高动态室内环境下VSLAM的特征点和关键帧优化方法

VSLAM是室内移动机器人的关键技术之一，用于感知周围环境，实现精确定位和测绘。然而，传统的基于静态环境假设的VSLAM算法仍然面临着一定的挑战。动态对象的运动、遮挡和外观变化会导致特征点匹配错误，使数据关联困难，并导致运动估计偏差。针对这一挑战，本文提出了一种针对高动态场景的动态特征点去除方法和闭环检测方法，旨在有效提高动态环境下的鲁棒性和定位精度。首先，结合yolov7微小目标检测网络和LK光流算法对动态区域进行检测，并采用自适应阈值关键帧选择方法解决现有启发式阈值选择方法导致关键帧质量差的问题。然后，本文提出了一种基于关键帧之间角度差的动态关键帧序列创建方法，减少了回环检测的工作量，提高了系统的回环检测效率。其次，提出了ParC_NetVLAD图像匹配算法。本文采用ConvNeXt-Tiny网络对图像进行特征提取，并在特征提取网络中加入ParC-Net网络和CBAM注意机制。最后，利用NetVLAD对提取的局部特征进行聚类，得到能够代表图像的全局特征。实验是在公开的TUM RGB-D数据集和实际情况下进行的。在高动态场景下，该算法的绝对弹道误差（ATE）平均降低96.4%，相对弹道误差（RPE）平均降低82.8%。在匹兹堡30k数据集中，闭环检测的平均准确率提高了2.6%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Intelligent Transportation Systems 工程技术-工程：电子与电气

CiteScore

14.80

自引率

12.90%

发文量

1872

审稿时长

7.5 months

期刊介绍： The theoretical, experimental and operational aspects of electrical and electronics engineering and information technologies as applied to Intelligent Transportation Systems (ITS). Intelligent Transportation Systems are defined as those systems utilizing synergistic technologies and systems engineering concepts to develop and improve transportation systems of all kinds. The scope of this interdisciplinary activity includes the promotion, consolidation and coordination of ITS technical activities among IEEE entities, and providing a focus for cooperative activities, both internally and externally.