LiDAR-camera-system-based unsupervised and weakly supervised 3D object detection.

IF 1.4 3区物理与天体物理 Q3 OPTICS Journal of The Optical Society of America A-optics Image Science and Vision Pub Date : 2023-10-01 DOI:10.1364/JOSAA.494980

Haosen Wang, Tiankai Chen, Xiaohang Ji, Feng Qian, Yue Ma, Shifeng Wang

{"title":"LiDAR-camera-system-based unsupervised and weakly supervised 3D object detection.","authors":"Haosen Wang, Tiankai Chen, Xiaohang Ji, Feng Qian, Yue Ma, Shifeng Wang","doi":"10.1364/JOSAA.494980","DOIUrl":null,"url":null,"abstract":"<p><p>LiDAR camera systems are now becoming an important part of autonomous driving 3D object detection. Due to limitations in time and resources, only a few critical frames of the synchronized camera data and acquired LiDAR points may be annotated. However, there is still a large amount of unannotated data in practical applications. Therefore, we propose a LiDAR-camera-system-based unsupervised and weakly supervised (LCUW) network as a novel 3D object-detection method. When unannotated data are put into the network, we propose an independent learning mode, which is an unsupervised data preprocessing module. Meanwhile, for detection tasks with high accuracy requirements, we propose an Accompany Construction mode, which is a weakly supervised data preprocessing module that requires only a small amount of annotated data. Then, we generate high-quality training data from the remaining unlabeled data. We also propose a full aggregation bridge block in the feature-extraction part, which uses a stepwise fusion and deepening representation strategy to improve the accuracy. Our comparative, ablation, and runtime test experiments show that the proposed method performs well while advancing the application of LiDAR camera systems.</p>","PeriodicalId":17382,"journal":{"name":"Journal of The Optical Society of America A-optics Image Science and Vision","volume":"40 10","pages":"1849-1860"},"PeriodicalIF":1.4000,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of The Optical Society of America A-optics Image Science and Vision","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1364/JOSAA.494980","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"OPTICS","Score":null,"Total":0}

引用次数: 0

Abstract

LiDAR camera systems are now becoming an important part of autonomous driving 3D object detection. Due to limitations in time and resources, only a few critical frames of the synchronized camera data and acquired LiDAR points may be annotated. However, there is still a large amount of unannotated data in practical applications. Therefore, we propose a LiDAR-camera-system-based unsupervised and weakly supervised (LCUW) network as a novel 3D object-detection method. When unannotated data are put into the network, we propose an independent learning mode, which is an unsupervised data preprocessing module. Meanwhile, for detection tasks with high accuracy requirements, we propose an Accompany Construction mode, which is a weakly supervised data preprocessing module that requires only a small amount of annotated data. Then, we generate high-quality training data from the remaining unlabeled data. We also propose a full aggregation bridge block in the feature-extraction part, which uses a stepwise fusion and deepening representation strategy to improve the accuracy. Our comparative, ablation, and runtime test experiments show that the proposed method performs well while advancing the application of LiDAR camera systems.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于LiDAR相机系统的无监督和弱监督三维物体检测。

激光雷达相机系统正成为自动驾驶三维物体检测的重要组成部分。由于时间和资源的限制，只有同步相机数据和获取的激光雷达点的几个关键帧可以被注释。然而，在实际应用中仍然存在大量未注释的数据。因此，我们提出了一种基于LiDAR相机系统的无监督和弱监督（LCUW）网络作为一种新的三维目标检测方法。当未标记的数据被放入网络中时，我们提出了一种独立的学习模式，即无监督的数据预处理模块。同时，对于精度要求高的检测任务，我们提出了一种伴随构建模式，这是一种弱监督的数据预处理模块，只需要少量的注释数据。然后，我们从剩余的未标记数据中生成高质量的训练数据。我们还在特征提取部分提出了一种全聚合桥接块，该桥接块使用逐步融合和深化表示策略来提高精度。我们的对比、烧蚀和运行时测试实验表明，该方法在推进激光雷达相机系统应用的同时表现良好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of The Optical Society of America A-optics Image Science and Vision 物理-光学

CiteScore

3.40

自引率

10.50%

发文量

417

审稿时长

3 months

期刊介绍： The Journal of the Optical Society of America A (JOSA A) is devoted to developments in any field of classical optics, image science, and vision. JOSA A includes original peer-reviewed papers on such topics as: * Atmospheric optics * Clinical vision * Coherence and Statistical Optics * Color * Diffraction and gratings * Image processing * Machine vision * Physiological optics * Polarization * Scattering * Signal processing * Thin films * Visual optics Also: j opt soc am a.