基于深度神经网络的光场图像全视点深度恢复

2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC) Pub Date : 2021-11-17 DOI:10.1109/IC-NIDC54101.2021.9660403

Fan Zhang, Xueming Li, Qiang Fu

{"title":"基于深度神经网络的光场图像全视点深度恢复","authors":"Fan Zhang, Xueming Li, Qiang Fu","doi":"10.1109/IC-NIDC54101.2021.9660403","DOIUrl":null,"url":null,"abstract":"Recovery depth from a lenslet light field image can facilitate lots of applications including super-resolution and 3D reconstruction. However, current works mainly focus on the central sub-aperture image but pay little attention to full-viewpoint light field images. In this paper, we propose a deep learning-based method to recovery full-viewpoint depth by estimating the disparity map from the given light field image. We employ the ResNet to extract multi-dimensional features from the given light field image which is encoded as a 3D epipolar plane image, establish dense connections to enable the neural network to calculate the cost volume from extracted features, and use an AutoEncoder to convert the cost volume to a disparity map of the given light field image. We show several experimental results and two comparisons with the related works to demonstrate the effect and performance of our method.","PeriodicalId":264468,"journal":{"name":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Full-Viewpoint Depth Recovery of Light Field Image via Deep Neural Network\",\"authors\":\"Fan Zhang, Xueming Li, Qiang Fu\",\"doi\":\"10.1109/IC-NIDC54101.2021.9660403\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recovery depth from a lenslet light field image can facilitate lots of applications including super-resolution and 3D reconstruction. However, current works mainly focus on the central sub-aperture image but pay little attention to full-viewpoint light field images. In this paper, we propose a deep learning-based method to recovery full-viewpoint depth by estimating the disparity map from the given light field image. We employ the ResNet to extract multi-dimensional features from the given light field image which is encoded as a 3D epipolar plane image, establish dense connections to enable the neural network to calculate the cost volume from extracted features, and use an AutoEncoder to convert the cost volume to a disparity map of the given light field image. We show several experimental results and two comparisons with the related works to demonstrate the effect and performance of our method.\",\"PeriodicalId\":264468,\"journal\":{\"name\":\"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)\",\"volume\":\"47 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IC-NIDC54101.2021.9660403\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC-NIDC54101.2021.9660403","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

透镜光场图像的恢复深度可以促进超分辨率和三维重建等许多应用。然而，目前的工作主要集中在中心子孔径图像上，而对全视点光场图像的关注较少。本文提出了一种基于深度学习的方法，通过估计给定光场图像的视差图来恢复全视点深度。我们使用ResNet从给定的光场图像中提取多维特征，并将其编码为三维极平面图像，建立密集连接使神经网络能够从提取的特征中计算代价体积，并使用AutoEncoder将代价体积转换为给定光场图像的视差图。我们给出了几个实验结果，并与相关工作进行了两次比较，以证明我们的方法的效果和性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Full-Viewpoint Depth Recovery of Light Field Image via Deep Neural Network

Recovery depth from a lenslet light field image can facilitate lots of applications including super-resolution and 3D reconstruction. However, current works mainly focus on the central sub-aperture image but pay little attention to full-viewpoint light field images. In this paper, we propose a deep learning-based method to recovery full-viewpoint depth by estimating the disparity map from the given light field image. We employ the ResNet to extract multi-dimensional features from the given light field image which is encoded as a 3D epipolar plane image, establish dense connections to enable the neural network to calculate the cost volume from extracted features, and use an AutoEncoder to convert the cost volume to a disparity map of the given light field image. We show several experimental results and two comparisons with the related works to demonstrate the effect and performance of our method.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)

自引率

0.00%

发文量