Optimized Feature Points and Keyframe Methods for VSLAM in High-Dynamic Indoor Environments

IF 8.4 1区 工程技术 Q1 ENGINEERING, CIVIL IEEE Transactions on Intelligent Transportation Systems Pub Date : 2025-01-13 DOI:10.1109/TITS.2024.3520177
Zhuhua Hu;Wenlu Qi;Kunkun Ding;Hao Qi;Yaochi Zhao;Xuebo Zhang;Mingfeng Wang
{"title":"Optimized Feature Points and Keyframe Methods for VSLAM in High-Dynamic Indoor Environments","authors":"Zhuhua Hu;Wenlu Qi;Kunkun Ding;Hao Qi;Yaochi Zhao;Xuebo Zhang;Mingfeng Wang","doi":"10.1109/TITS.2024.3520177","DOIUrl":null,"url":null,"abstract":"VSLAM is one of the key technologies for indoor mobile robots, used to perceive the surrounding environment, achieve accurate positioning and mapping. However, traditional VSLAM algorithms based on the assumption of a static environment still face certain challenges. The movement, occlusion, and appearance changes of dynamic objects can lead to feature point-matching errors, making data association difficult and causing biases in motion estimation. In order to address this challenge, this paper proposes a dynamic feature point removal method and a closed-loop detection method for high dynamic scenes, aiming to effectively improve the robustness and positioning accuracy in dynamic environments. First, the YOLOv7-tiny object detection network and LK optical flow algorithm are combined to detect the dynamic area, and the adaptive threshold keyframe selection method is adopted to solve the problem of poor quality of keyframe caused by the existing heuristic threshold selection method. Then, this paper proposes a dynamic keyframe sequence creation method based on the angle difference between keyframes, which reduces the workload of loop back detection and accelerates the efficiency of loop back detection in the system. Next, the ParC_NetVLAD image matching algorithm is proposed. In this paper, ConvNeXt-Tiny network is used for feature extraction of images, and ParC-Net network and CBAM attention mechanism are added to the feature extraction network. Finally, NetVLAD is used to cluster the extracted local features to obtain global features that can represent images. Experiments are conducted on public TUM RGB-D datasets and in real-world situations. The proposed algorithm reduces the ATE (Absolute Trajectory Error) by 96.4% and the RPE (Relative Trajectory Error) by 82.8% on average in highly dynamic scenarios. In the Pittsburgh30k dataset, the average accuracy of loop closure detection has been improved by 2.6%.","PeriodicalId":13416,"journal":{"name":"IEEE Transactions on Intelligent Transportation Systems","volume":"26 3","pages":"3101-3114"},"PeriodicalIF":8.4000,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Intelligent Transportation Systems","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10838289/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0

Abstract

VSLAM is one of the key technologies for indoor mobile robots, used to perceive the surrounding environment, achieve accurate positioning and mapping. However, traditional VSLAM algorithms based on the assumption of a static environment still face certain challenges. The movement, occlusion, and appearance changes of dynamic objects can lead to feature point-matching errors, making data association difficult and causing biases in motion estimation. In order to address this challenge, this paper proposes a dynamic feature point removal method and a closed-loop detection method for high dynamic scenes, aiming to effectively improve the robustness and positioning accuracy in dynamic environments. First, the YOLOv7-tiny object detection network and LK optical flow algorithm are combined to detect the dynamic area, and the adaptive threshold keyframe selection method is adopted to solve the problem of poor quality of keyframe caused by the existing heuristic threshold selection method. Then, this paper proposes a dynamic keyframe sequence creation method based on the angle difference between keyframes, which reduces the workload of loop back detection and accelerates the efficiency of loop back detection in the system. Next, the ParC_NetVLAD image matching algorithm is proposed. In this paper, ConvNeXt-Tiny network is used for feature extraction of images, and ParC-Net network and CBAM attention mechanism are added to the feature extraction network. Finally, NetVLAD is used to cluster the extracted local features to obtain global features that can represent images. Experiments are conducted on public TUM RGB-D datasets and in real-world situations. The proposed algorithm reduces the ATE (Absolute Trajectory Error) by 96.4% and the RPE (Relative Trajectory Error) by 82.8% on average in highly dynamic scenarios. In the Pittsburgh30k dataset, the average accuracy of loop closure detection has been improved by 2.6%.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
高动态室内环境下VSLAM的特征点和关键帧优化方法
VSLAM是室内移动机器人的关键技术之一,用于感知周围环境,实现精确定位和测绘。然而,传统的基于静态环境假设的VSLAM算法仍然面临着一定的挑战。动态对象的运动、遮挡和外观变化会导致特征点匹配错误,使数据关联困难,并导致运动估计偏差。针对这一挑战,本文提出了一种针对高动态场景的动态特征点去除方法和闭环检测方法,旨在有效提高动态环境下的鲁棒性和定位精度。首先,结合yolov7微小目标检测网络和LK光流算法对动态区域进行检测,并采用自适应阈值关键帧选择方法解决现有启发式阈值选择方法导致关键帧质量差的问题。然后,本文提出了一种基于关键帧之间角度差的动态关键帧序列创建方法,减少了回环检测的工作量,提高了系统的回环检测效率。其次,提出了ParC_NetVLAD图像匹配算法。本文采用ConvNeXt-Tiny网络对图像进行特征提取,并在特征提取网络中加入ParC-Net网络和CBAM注意机制。最后,利用NetVLAD对提取的局部特征进行聚类,得到能够代表图像的全局特征。实验是在公开的TUM RGB-D数据集和实际情况下进行的。在高动态场景下,该算法的绝对弹道误差(ATE)平均降低96.4%,相对弹道误差(RPE)平均降低82.8%。在匹兹堡30k数据集中,闭环检测的平均准确率提高了2.6%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Intelligent Transportation Systems
IEEE Transactions on Intelligent Transportation Systems 工程技术-工程:电子与电气
CiteScore
14.80
自引率
12.90%
发文量
1872
审稿时长
7.5 months
期刊介绍: The theoretical, experimental and operational aspects of electrical and electronics engineering and information technologies as applied to Intelligent Transportation Systems (ITS). Intelligent Transportation Systems are defined as those systems utilizing synergistic technologies and systems engineering concepts to develop and improve transportation systems of all kinds. The scope of this interdisciplinary activity includes the promotion, consolidation and coordination of ITS technical activities among IEEE entities, and providing a focus for cooperative activities, both internally and externally.
期刊最新文献
2025 Index IEEE Transactions on Intelligent Transportation Systems IEEE Intelligent Transportation Systems Society Information IEEE Intelligent Transportation Systems Society Information Wireless Channel as a Sensor: An Anti-Electromagnetic Interference Vehicle Detection Method Based on Wireless Sensing Technology Bicycle Travel Time Estimation via Dual Graph-Based Neural Networks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1