基于c -双注意网络的多视点三维图像重建

Int. J. Wavelets Multiresolution Inf. Process. Pub Date : 2022-12-13 DOI:10.1142/s0219691322500448

T. U. Kamble, S. Mahajan

{"title":"基于c -双注意网络的多视点三维图像重建","authors":"T. U. Kamble, S. Mahajan","doi":"10.1142/s0219691322500448","DOIUrl":null,"url":null,"abstract":"3D image reconstruction using multi-view imaging is widely utilized in several application domains: construction field, disaster management, urban planning, etc. The 3D reconstruction from the multi-view image is still challenging due to the high freedom and inaccurate reconstruction. This research introduces the hybrid deep learning technique for reconstructing the 3D image, in which the C-dual attention layer is proposed for generating the feature map to support the image reconstruction. The proposed 3D image reconstruction uses the encoder–decoder–refiner which is utilized for reconstruction. Initially, the features are extracted from the AlexNet and ResNet-50 features automatically. Then, the proposed C-dual attention layer is utilized for generating the inter-channel and inter-spatial relationship among the features to obtain enhanced reconstruction accuracy. The inter-channel relationship is evaluated using the channel attention layer, and the inter-spatial relationship is evaluated using the spatial attention layer of the encoder module. Here, the features generated by the spatial attention layer are combined to form the feature map in a 2D map. The proposed C-dual attention encoder provides enhanced features that help to acquire enhanced 3D image reconstruction. The proposed method is evaluated based on loss, IoU_3D, and IoU_2D, and acquired the values of 0.0721, 1.25 and 1.37, respectively.","PeriodicalId":158567,"journal":{"name":"Int. J. Wavelets Multiresolution Inf. Process.","volume":"174 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"3D Image reconstruction using C-dual attention network from multi-view images\",\"authors\":\"T. U. Kamble, S. Mahajan\",\"doi\":\"10.1142/s0219691322500448\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"3D image reconstruction using multi-view imaging is widely utilized in several application domains: construction field, disaster management, urban planning, etc. The 3D reconstruction from the multi-view image is still challenging due to the high freedom and inaccurate reconstruction. This research introduces the hybrid deep learning technique for reconstructing the 3D image, in which the C-dual attention layer is proposed for generating the feature map to support the image reconstruction. The proposed 3D image reconstruction uses the encoder–decoder–refiner which is utilized for reconstruction. Initially, the features are extracted from the AlexNet and ResNet-50 features automatically. Then, the proposed C-dual attention layer is utilized for generating the inter-channel and inter-spatial relationship among the features to obtain enhanced reconstruction accuracy. The inter-channel relationship is evaluated using the channel attention layer, and the inter-spatial relationship is evaluated using the spatial attention layer of the encoder module. Here, the features generated by the spatial attention layer are combined to form the feature map in a 2D map. The proposed C-dual attention encoder provides enhanced features that help to acquire enhanced 3D image reconstruction. The proposed method is evaluated based on loss, IoU_3D, and IoU_2D, and acquired the values of 0.0721, 1.25 and 1.37, respectively.\",\"PeriodicalId\":158567,\"journal\":{\"name\":\"Int. J. Wavelets Multiresolution Inf. Process.\",\"volume\":\"174 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Int. J. Wavelets Multiresolution Inf. Process.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1142/s0219691322500448\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Wavelets Multiresolution Inf. Process.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/s0219691322500448","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

基于多视点成像的三维图像重建在建筑、灾害管理、城市规划等多个应用领域得到了广泛的应用。多视点图像的三维重建由于自由度高、重建精度不高，仍然是一个挑战。本研究引入了用于三维图像重建的混合深度学习技术，其中提出了C-dual注意层来生成特征映射以支持图像重建。所提出的三维图像重建使用了用于重建的编码器-解码器-细化器。最初，这些特征是自动从AlexNet和ResNet-50特征中提取的。然后，利用所提出的c -双注意层生成特征之间的通道间和空间间关系，以提高重建精度。使用信道注意层评估信道间关系，使用编码器模块的空间注意层评估空间间关系。在这里，将空间注意层生成的特征组合在一起，形成二维地图中的特征图。提出的c -双注意力编码器提供了增强的功能，有助于获得增强的3D图像重建。基于loss、IoU_3D和IoU_2D对该方法进行了评价，得到的结果分别为0.0721、1.25和1.37。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

3D Image reconstruction using C-dual attention network from multi-view images

3D image reconstruction using multi-view imaging is widely utilized in several application domains: construction field, disaster management, urban planning, etc. The 3D reconstruction from the multi-view image is still challenging due to the high freedom and inaccurate reconstruction. This research introduces the hybrid deep learning technique for reconstructing the 3D image, in which the C-dual attention layer is proposed for generating the feature map to support the image reconstruction. The proposed 3D image reconstruction uses the encoder–decoder–refiner which is utilized for reconstruction. Initially, the features are extracted from the AlexNet and ResNet-50 features automatically. Then, the proposed C-dual attention layer is utilized for generating the inter-channel and inter-spatial relationship among the features to obtain enhanced reconstruction accuracy. The inter-channel relationship is evaluated using the channel attention layer, and the inter-spatial relationship is evaluated using the spatial attention layer of the encoder module. Here, the features generated by the spatial attention layer are combined to form the feature map in a 2D map. The proposed C-dual attention encoder provides enhanced features that help to acquire enhanced 3D image reconstruction. The proposed method is evaluated based on loss, IoU_3D, and IoU_2D, and acquired the values of 0.0721, 1.25 and 1.37, respectively.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Int. J. Wavelets Multiresolution Inf. Process.

自引率

0.00%

发文量