{"title":"A Real-Time and Lightweight Monocular 3-D Object Detector on CPU-Based Edge Devices for UGV's Indoor SLAM Systems","authors":"Bowen Chen;Yan Zhuang;Sen Wang","doi":"10.1109/TMECH.2024.3484431","DOIUrl":null,"url":null,"abstract":"In this article, we introduce RLMono3d, a real-time monocular 3-D object detector designed to run efficiently on edge devices with just one CPU. This approach is specifically applied to the indoor simultaneous localization and mapping (SLAM) system on the unmanned ground vehicle (UGV) platform. The spatial projection model for 3-D cuboids is proposed using projective geometry to determine the topological relationship between the cuboid projection and the bounding box (Bbox). Utilizing the theory above, we propose a real-time lightweight monocular 3-D object detector that is designed to be efficient and lightweight by using 2-D object and line segment detectors to generate 3-D object Bboxes. Furthermore, this approach is integrated into the SLAM system to create a complete semantic SLAM system, which is a typical application of our approach. Experimental results on multiple indoor benchmarking datasets show that the proposed monocular 3-D object detection algorithm outperforms benchmark methods. To validate our approach on the UGVs, we also conducted experiments using the quadruped robot platform. The results demonstrate that our approach can operate on edge devices of UGV platforms and accurately detect 3-D objects in real time.","PeriodicalId":13372,"journal":{"name":"IEEE/ASME Transactions on Mechatronics","volume":"30 5","pages":"3792-3802"},"PeriodicalIF":7.3000,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE/ASME Transactions on Mechatronics","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10754906/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
In this article, we introduce RLMono3d, a real-time monocular 3-D object detector designed to run efficiently on edge devices with just one CPU. This approach is specifically applied to the indoor simultaneous localization and mapping (SLAM) system on the unmanned ground vehicle (UGV) platform. The spatial projection model for 3-D cuboids is proposed using projective geometry to determine the topological relationship between the cuboid projection and the bounding box (Bbox). Utilizing the theory above, we propose a real-time lightweight monocular 3-D object detector that is designed to be efficient and lightweight by using 2-D object and line segment detectors to generate 3-D object Bboxes. Furthermore, this approach is integrated into the SLAM system to create a complete semantic SLAM system, which is a typical application of our approach. Experimental results on multiple indoor benchmarking datasets show that the proposed monocular 3-D object detection algorithm outperforms benchmark methods. To validate our approach on the UGVs, we also conducted experiments using the quadruped robot platform. The results demonstrate that our approach can operate on edge devices of UGV platforms and accurately detect 3-D objects in real time.
期刊介绍:
IEEE/ASME Transactions on Mechatronics publishes high quality technical papers on technological advances in mechatronics. A primary purpose of the IEEE/ASME Transactions on Mechatronics is to have an archival publication which encompasses both theory and practice. Papers published in the IEEE/ASME Transactions on Mechatronics disclose significant new knowledge needed to implement intelligent mechatronics systems, from analysis and design through simulation and hardware and software implementation. The Transactions also contains a letters section dedicated to rapid publication of short correspondence items concerning new research results.