Haosen Wang, Tiankai Chen, Xiaohang Ji, Feng Qian, Yue Ma, Shifeng Wang
{"title":"LiDAR-camera-system-based unsupervised and weakly supervised 3D object detection.","authors":"Haosen Wang, Tiankai Chen, Xiaohang Ji, Feng Qian, Yue Ma, Shifeng Wang","doi":"10.1364/JOSAA.494980","DOIUrl":null,"url":null,"abstract":"<p><p>LiDAR camera systems are now becoming an important part of autonomous driving 3D object detection. Due to limitations in time and resources, only a few critical frames of the synchronized camera data and acquired LiDAR points may be annotated. However, there is still a large amount of unannotated data in practical applications. Therefore, we propose a LiDAR-camera-system-based unsupervised and weakly supervised (LCUW) network as a novel 3D object-detection method. When unannotated data are put into the network, we propose an independent learning mode, which is an unsupervised data preprocessing module. Meanwhile, for detection tasks with high accuracy requirements, we propose an Accompany Construction mode, which is a weakly supervised data preprocessing module that requires only a small amount of annotated data. Then, we generate high-quality training data from the remaining unlabeled data. We also propose a full aggregation bridge block in the feature-extraction part, which uses a stepwise fusion and deepening representation strategy to improve the accuracy. Our comparative, ablation, and runtime test experiments show that the proposed method performs well while advancing the application of LiDAR camera systems.</p>","PeriodicalId":17382,"journal":{"name":"Journal of The Optical Society of America A-optics Image Science and Vision","volume":"40 10","pages":"1849-1860"},"PeriodicalIF":1.4000,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of The Optical Society of America A-optics Image Science and Vision","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1364/JOSAA.494980","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"OPTICS","Score":null,"Total":0}
引用次数: 0
Abstract
LiDAR camera systems are now becoming an important part of autonomous driving 3D object detection. Due to limitations in time and resources, only a few critical frames of the synchronized camera data and acquired LiDAR points may be annotated. However, there is still a large amount of unannotated data in practical applications. Therefore, we propose a LiDAR-camera-system-based unsupervised and weakly supervised (LCUW) network as a novel 3D object-detection method. When unannotated data are put into the network, we propose an independent learning mode, which is an unsupervised data preprocessing module. Meanwhile, for detection tasks with high accuracy requirements, we propose an Accompany Construction mode, which is a weakly supervised data preprocessing module that requires only a small amount of annotated data. Then, we generate high-quality training data from the remaining unlabeled data. We also propose a full aggregation bridge block in the feature-extraction part, which uses a stepwise fusion and deepening representation strategy to improve the accuracy. Our comparative, ablation, and runtime test experiments show that the proposed method performs well while advancing the application of LiDAR camera systems.
期刊介绍:
The Journal of the Optical Society of America A (JOSA A) is devoted to developments in any field of classical optics, image science, and vision. JOSA A includes original peer-reviewed papers on such topics as:
* Atmospheric optics
* Clinical vision
* Coherence and Statistical Optics
* Color
* Diffraction and gratings
* Image processing
* Machine vision
* Physiological optics
* Polarization
* Scattering
* Signal processing
* Thin films
* Visual optics
Also: j opt soc am a.