{"title":"融合二维目标检测和自监督单目深度估计的快速三维重建","authors":"Chao Fan, Zhenyu Yin, Xinxin Huang, Mingshi Li, Xiaohui Wang, Hui Li","doi":"10.1109/ICTech55460.2022.00104","DOIUrl":null,"url":null,"abstract":"In recent years, real-time 3D reconstruction has gained popularity among many researchers due to its increasing applications. 3D reconstruction methods based on depth estimation have a large number of invalid areas for reconstruction in autonomous driving. To further improve the real-time performance of 3D reconstruction, we propose a novel approach to reduce the consumption of computational resources by extracting significant regions of depth maps. First, binary masks that extract significant regions are generated by a 2D object detection model. Then, we design the TRDE module to extract target regions in generated depth maps. Finally, qualitative and quantitative results on KITTI dataset demonstrate that our approach can perform depth maps optimization and reduce computational resources consumed during the 3D reconstruction. As a result, our method achieves faster 3D reconstruction by fusing 2D object detection and self-supervised monocular depth estimation.","PeriodicalId":290836,"journal":{"name":"2022 11th International Conference of Information and Communication Technology (ICTech))","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Faster 3D Reconstruction by Fusing 2D Object Detection and Self-Supervised Monocular Depth Estimation\",\"authors\":\"Chao Fan, Zhenyu Yin, Xinxin Huang, Mingshi Li, Xiaohui Wang, Hui Li\",\"doi\":\"10.1109/ICTech55460.2022.00104\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, real-time 3D reconstruction has gained popularity among many researchers due to its increasing applications. 3D reconstruction methods based on depth estimation have a large number of invalid areas for reconstruction in autonomous driving. To further improve the real-time performance of 3D reconstruction, we propose a novel approach to reduce the consumption of computational resources by extracting significant regions of depth maps. First, binary masks that extract significant regions are generated by a 2D object detection model. Then, we design the TRDE module to extract target regions in generated depth maps. Finally, qualitative and quantitative results on KITTI dataset demonstrate that our approach can perform depth maps optimization and reduce computational resources consumed during the 3D reconstruction. As a result, our method achieves faster 3D reconstruction by fusing 2D object detection and self-supervised monocular depth estimation.\",\"PeriodicalId\":290836,\"journal\":{\"name\":\"2022 11th International Conference of Information and Communication Technology (ICTech))\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 11th International Conference of Information and Communication Technology (ICTech))\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICTech55460.2022.00104\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 11th International Conference of Information and Communication Technology (ICTech))","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTech55460.2022.00104","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Faster 3D Reconstruction by Fusing 2D Object Detection and Self-Supervised Monocular Depth Estimation
In recent years, real-time 3D reconstruction has gained popularity among many researchers due to its increasing applications. 3D reconstruction methods based on depth estimation have a large number of invalid areas for reconstruction in autonomous driving. To further improve the real-time performance of 3D reconstruction, we propose a novel approach to reduce the consumption of computational resources by extracting significant regions of depth maps. First, binary masks that extract significant regions are generated by a 2D object detection model. Then, we design the TRDE module to extract target regions in generated depth maps. Finally, qualitative and quantitative results on KITTI dataset demonstrate that our approach can perform depth maps optimization and reduce computational resources consumed during the 3D reconstruction. As a result, our method achieves faster 3D reconstruction by fusing 2D object detection and self-supervised monocular depth estimation.