{"title":"3D Object Detection Based on Multi-view Adaptive Fusion","authors":"Yong Zhang, Huan Wu","doi":"10.1109/ipec54454.2022.9777488","DOIUrl":null,"url":null,"abstract":"Aiming at the problem that multi-view features are difficult to fuse effectively, a multi-view feature adaptive fusion 3D object detection framework is proposed, and new solutions are proposed in two aspects: depth feature fusion and loss function design. It mainly cooperates the bird’s-eye view and cylindrical view, carries out adaptive feature fusion on the premise of considering the interaction between views and the contribution of different view features to the detection task, and improves the importance of network learning structure information and local features through the information of two additional tasks: foreground classification and central regression, At the same time, the loss calculation is optimized in the detection process to improve the regression effect of the target boundary box. Experiments on KITTI dataset show that this method achieves higher performance in all single-stage fusion methods, is better than most two-stage fusion methods, and achieves a good balance between speed and accuracy on KITTI benchmark.","PeriodicalId":232563,"journal":{"name":"2022 IEEE Asia-Pacific Conference on Image Processing, Electronics and Computers (IPEC)","volume":"110 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Asia-Pacific Conference on Image Processing, Electronics and Computers (IPEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ipec54454.2022.9777488","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Aiming at the problem that multi-view features are difficult to fuse effectively, a multi-view feature adaptive fusion 3D object detection framework is proposed, and new solutions are proposed in two aspects: depth feature fusion and loss function design. It mainly cooperates the bird’s-eye view and cylindrical view, carries out adaptive feature fusion on the premise of considering the interaction between views and the contribution of different view features to the detection task, and improves the importance of network learning structure information and local features through the information of two additional tasks: foreground classification and central regression, At the same time, the loss calculation is optimized in the detection process to improve the regression effect of the target boundary box. Experiments on KITTI dataset show that this method achieves higher performance in all single-stage fusion methods, is better than most two-stage fusion methods, and achieves a good balance between speed and accuracy on KITTI benchmark.