{"title":"Camera-LiDAR Fusion With Latent Correlation for Cross-Scene Place Recognition","authors":"Yan Pan;Jiapeng Xie;Jiajie Wu;Bo Zhou","doi":"10.1109/TIE.2024.3440470","DOIUrl":null,"url":null,"abstract":"As a crucial issue for outdoor mobile robot navigation, place recognition (PR) remains challenging in long-term cross-scene applications, necessitating enhanced robustness of PR algorithms. In this article, a novel multimodal PR approach is proposed, which deeply fuses the camera and LiDAR to effectively compensate for their respective limitations. The introduced place descriptor consists of three branches: image-based, point cloud-based, and fusion-based. Specifically, the fusion-based branch employs a dual-stage pipeline, leveraging the latent correlation between the two modalities for information interaction and conducting channel-level fusion. Moreover, implicit alignment in the fusion branch, coupled with information supplementation from the two single-modal branches, better addresses the information loss caused by limited overlap in their field-of-view (FoV). In extensive experiments on Karlsruhe Institute of Technology and Toyota Technological Institute (KITTI), North Campus Long-Term (NCLT), USVInland datasets, and the real-world campus test, the proposed method stands out in precision-recall curves with an average maximum F1 score of 0.949, demonstrating superior robustness and generalization compared with other state-of-the-art methods.","PeriodicalId":13402,"journal":{"name":"IEEE Transactions on Industrial Electronics","volume":"72 3","pages":"2801-2809"},"PeriodicalIF":7.2000,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Industrial Electronics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10665753/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
As a crucial issue for outdoor mobile robot navigation, place recognition (PR) remains challenging in long-term cross-scene applications, necessitating enhanced robustness of PR algorithms. In this article, a novel multimodal PR approach is proposed, which deeply fuses the camera and LiDAR to effectively compensate for their respective limitations. The introduced place descriptor consists of three branches: image-based, point cloud-based, and fusion-based. Specifically, the fusion-based branch employs a dual-stage pipeline, leveraging the latent correlation between the two modalities for information interaction and conducting channel-level fusion. Moreover, implicit alignment in the fusion branch, coupled with information supplementation from the two single-modal branches, better addresses the information loss caused by limited overlap in their field-of-view (FoV). In extensive experiments on Karlsruhe Institute of Technology and Toyota Technological Institute (KITTI), North Campus Long-Term (NCLT), USVInland datasets, and the real-world campus test, the proposed method stands out in precision-recall curves with an average maximum F1 score of 0.949, demonstrating superior robustness and generalization compared with other state-of-the-art methods.
位置识别作为户外移动机器人导航的关键问题,在长期的跨场景应用中仍然具有挑战性,需要增强位置识别算法的鲁棒性。本文提出了一种新的多模态PR方法,该方法将相机和激光雷达深度融合,有效地弥补了它们各自的局限性。所介绍的位置描述符包括三个分支:基于图像的、基于点云的和基于融合的。具体而言,基于融合的分支采用双级管道,利用两种模式之间的潜在相关性进行信息交互并进行通道级融合。此外,融合分支中的隐式对齐,加上两个单模态分支的信息补充,更好地解决了视场(FoV)有限重叠造成的信息丢失问题。在Karlsruhe Institute of Technology and Toyota Institute (KITTI)、North Campus long (NCLT)、USVInland数据集和真实校园测试上进行的大量实验中,该方法在precision-recall曲线上表现突出,平均最高F1得分为0.949,与其他先进方法相比具有更强的鲁棒性和泛化性。
期刊介绍:
Journal Name: IEEE Transactions on Industrial Electronics
Publication Frequency: Monthly
Scope:
The scope of IEEE Transactions on Industrial Electronics encompasses the following areas:
Applications of electronics, controls, and communications in industrial and manufacturing systems and processes.
Power electronics and drive control techniques.
System control and signal processing.
Fault detection and diagnosis.
Power systems.
Instrumentation, measurement, and testing.
Modeling and simulation.
Motion control.
Robotics.
Sensors and actuators.
Implementation of neural networks, fuzzy logic, and artificial intelligence in industrial systems.
Factory automation.
Communication and computer networks.