{"title":"A Novel Network Architecture and Training Strategies for Camera-Radar 3D Detection","authors":"Sin-Ye Jhong, Hsin-Chun Lin, Xu-Xiang Weng, Ting-Feng Xie, Han-Wei Lin, Yung-Yao Chen","doi":"10.1109/ICCE-Taiwan58799.2023.10226927","DOIUrl":null,"url":null,"abstract":"Intelligent vehicles rely on millimeter-wave radar and machine vision to perceive their surroundings. However, the considerable differences in the features of radar point clouds and those of image pixels make it difficult for models to perform effective fusion. Moreover, high-frequency noise in images can impede the extraction of meaningful features. This paper proposes a novel 3D object detection method that combines millimeter-wave radar and RGB camera data. Our approach includes a gaussian filter for preprocessing, a hierarchical model architecture for fusing radar and image information, and a training stabilization strategy. We evaluated our method using the challenging NuScenes and Taiwan street databases and found that it outperformed the popular CenterFusion model in terms of detection performance. In addition, our method is applicable to a variety of scenarios in Taiwan.","PeriodicalId":112903,"journal":{"name":"2023 International Conference on Consumer Electronics - Taiwan (ICCE-Taiwan)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Consumer Electronics - Taiwan (ICCE-Taiwan)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCE-Taiwan58799.2023.10226927","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Intelligent vehicles rely on millimeter-wave radar and machine vision to perceive their surroundings. However, the considerable differences in the features of radar point clouds and those of image pixels make it difficult for models to perform effective fusion. Moreover, high-frequency noise in images can impede the extraction of meaningful features. This paper proposes a novel 3D object detection method that combines millimeter-wave radar and RGB camera data. Our approach includes a gaussian filter for preprocessing, a hierarchical model architecture for fusing radar and image information, and a training stabilization strategy. We evaluated our method using the challenging NuScenes and Taiwan street databases and found that it outperformed the popular CenterFusion model in terms of detection performance. In addition, our method is applicable to a variety of scenarios in Taiwan.