Minjie Bao;Junguo Fan;Zhendong Fan;Runze Xu;Ke Wang;Chaonan Mu;Ruifeng Li;Hewen Zhou;Peng Kang
{"title":"FROMFusion: Fast and Robust On-Manifold Dense Reconstruction for Low-Cost Wheeled Robots","authors":"Minjie Bao;Junguo Fan;Zhendong Fan;Runze Xu;Ke Wang;Chaonan Mu;Ruifeng Li;Hewen Zhou;Peng Kang","doi":"10.1109/TIM.2024.3481537","DOIUrl":null,"url":null,"abstract":"Existing voxel-hashing (VH)-based dense reconstruction methods have shown impressive results on datasets collected by hand-held cameras. Large-scale scenes are maintained with a truncated signed distance function (TSDF) volumetric representation. However, practically deploying such methods on low-cost embedded mobile robots remains challenging due to heavy computational burdens and various camera perception degeneration cases. In this work, we propose FROMFusion, a fast and robust on-manifold dense reconstruction framework based on multisensor fusion, which systematically solves how to align the point cloud with the hashed TSDF volume (HTV). Its purely geometric nature ensures the robustness to image motion blur and poor lighting conditions. To reduce memory overhead, we propose a spherical-coordinate-based HTV segmentation algorithm. To surmount missing geometric features, camera occlusion, and over range, a loosely coupled LiDAR-wheel-inertial odometry (LWIO) is applied for trustworthy initial guesses in camera pose optimization. At its core is a two-stage depth-to-HTV matching algorithm, which includes a coarse voxel-level pose ergodic search and a fine subvoxel-level Gauss-Newton (GN) solver with Anderson acceleration (AA) strategy for faster convergence. We evenly distribute heavy computational workloads to heterogeneous computing systems. Extensive field experiments on a low-cost wheeled robot cleaner demonstrate our method models continuous surfaces of large-scale scenes with high quality in both geometry and texture, outperforming current state-of-the-art methods in robustness to camera perception degeneration cases by a significant margin. The frame rate of online embedded implementation can reach up to 47.21 Hz maximum.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":"73 ","pages":"1-18"},"PeriodicalIF":5.6000,"publicationDate":"2024-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Instrumentation and Measurement","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10720077/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Existing voxel-hashing (VH)-based dense reconstruction methods have shown impressive results on datasets collected by hand-held cameras. Large-scale scenes are maintained with a truncated signed distance function (TSDF) volumetric representation. However, practically deploying such methods on low-cost embedded mobile robots remains challenging due to heavy computational burdens and various camera perception degeneration cases. In this work, we propose FROMFusion, a fast and robust on-manifold dense reconstruction framework based on multisensor fusion, which systematically solves how to align the point cloud with the hashed TSDF volume (HTV). Its purely geometric nature ensures the robustness to image motion blur and poor lighting conditions. To reduce memory overhead, we propose a spherical-coordinate-based HTV segmentation algorithm. To surmount missing geometric features, camera occlusion, and over range, a loosely coupled LiDAR-wheel-inertial odometry (LWIO) is applied for trustworthy initial guesses in camera pose optimization. At its core is a two-stage depth-to-HTV matching algorithm, which includes a coarse voxel-level pose ergodic search and a fine subvoxel-level Gauss-Newton (GN) solver with Anderson acceleration (AA) strategy for faster convergence. We evenly distribute heavy computational workloads to heterogeneous computing systems. Extensive field experiments on a low-cost wheeled robot cleaner demonstrate our method models continuous surfaces of large-scale scenes with high quality in both geometry and texture, outperforming current state-of-the-art methods in robustness to camera perception degeneration cases by a significant margin. The frame rate of online embedded implementation can reach up to 47.21 Hz maximum.
期刊介绍:
Papers are sought that address innovative solutions to the development and use of electrical and electronic instruments and equipment to measure, monitor and/or record physical phenomena for the purpose of advancing measurement science, methods, functionality and applications. The scope of these papers may encompass: (1) theory, methodology, and practice of measurement; (2) design, development and evaluation of instrumentation and measurement systems and components used in generating, acquiring, conditioning and processing signals; (3) analysis, representation, display, and preservation of the information obtained from a set of measurements; and (4) scientific and technical support to establishment and maintenance of technical standards in the field of Instrumentation and Measurement.