LFIC-DRASC: Deep Light Field Image Compression Using Disentangled Representation and Asymmetrical Strip Convolution

Shiyu Feng, Yun Zhang, Linwei Zhu, Sam Kwong
{"title":"LFIC-DRASC: Deep Light Field Image Compression Using Disentangled Representation and Asymmetrical Strip Convolution","authors":"Shiyu Feng, Yun Zhang, Linwei Zhu, Sam Kwong","doi":"arxiv-2409.11711","DOIUrl":null,"url":null,"abstract":"Light-Field (LF) image is emerging 4D data of light rays that is capable of\nrealistically presenting spatial and angular information of 3D scene. However,\nthe large data volume of LF images becomes the most challenging issue in\nreal-time processing, transmission, and storage. In this paper, we propose an\nend-to-end deep LF Image Compression method Using Disentangled Representation\nand Asymmetrical Strip Convolution (LFIC-DRASC) to improve coding efficiency.\nFirstly, we formulate the LF image compression problem as learning a\ndisentangled LF representation network and an image encoding-decoding network.\nSecondly, we propose two novel feature extractors that leverage the structural\nprior of LF data by integrating features across different dimensions.\nMeanwhile, disentangled LF representation network is proposed to enhance the LF\nfeature disentangling and decoupling. Thirdly, we propose the LFIC-DRASC for LF\nimage compression, where two Asymmetrical Strip Convolution (ASC) operators,\ni.e. horizontal and vertical, are proposed to capture long-range correlation in\nLF feature space. These two ASC operators can be combined with the square\nconvolution to further decouple LF features, which enhances the model ability\nin representing intricate spatial relationships. Experimental results\ndemonstrate that the proposed LFIC-DRASC achieves an average of 20.5\\% bit rate\nreductions comparing with the state-of-the-art methods.","PeriodicalId":501289,"journal":{"name":"arXiv - EE - Image and Video Processing","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - EE - Image and Video Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.11711","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Light-Field (LF) image is emerging 4D data of light rays that is capable of realistically presenting spatial and angular information of 3D scene. However, the large data volume of LF images becomes the most challenging issue in real-time processing, transmission, and storage. In this paper, we propose an end-to-end deep LF Image Compression method Using Disentangled Representation and Asymmetrical Strip Convolution (LFIC-DRASC) to improve coding efficiency. Firstly, we formulate the LF image compression problem as learning a disentangled LF representation network and an image encoding-decoding network. Secondly, we propose two novel feature extractors that leverage the structural prior of LF data by integrating features across different dimensions. Meanwhile, disentangled LF representation network is proposed to enhance the LF feature disentangling and decoupling. Thirdly, we propose the LFIC-DRASC for LF image compression, where two Asymmetrical Strip Convolution (ASC) operators, i.e. horizontal and vertical, are proposed to capture long-range correlation in LF feature space. These two ASC operators can be combined with the square convolution to further decouple LF features, which enhances the model ability in representing intricate spatial relationships. Experimental results demonstrate that the proposed LFIC-DRASC achieves an average of 20.5\% bit rate reductions comparing with the state-of-the-art methods.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
LFIC-DRASC:利用分离表示和非对称条带卷积进行深度光场图像压缩
光场(LF)图像是新兴的光射线四维数据,能够真实呈现三维场景的空间和角度信息。然而,光场图像数据量大,成为实时处理、传输和存储中最具挑战性的问题。本文提出了一种端到端的深度低频图像压缩方法--使用非平行表示和非对称条带卷积(LFIC-DRASC)来提高编码效率。其次,我们提出了两个新颖的特征提取器,通过整合不同维度的特征来利用低频数据的结构先验性。第三,我们提出了用于低频图像压缩的 LFIC-DRASC,其中提出了两个非对称带卷积(ASC)算子,即水平和垂直算子,以捕捉低频特征空间中的长程相关性。这两个 ASC 算子可与平方卷积相结合,进一步解耦低频特征,从而增强了模型表现复杂空间关系的能力。实验结果表明,与最先进的方法相比,所提出的 LFIC-DRASC 平均降低了 20.5% 的比特率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
multiPI-TransBTS: A Multi-Path Learning Framework for Brain Tumor Image Segmentation Based on Multi-Physical Information Autopet III challenge: Incorporating anatomical knowledge into nnUNet for lesion segmentation in PET/CT Denoising diffusion models for high-resolution microscopy image restoration Tumor aware recurrent inter-patient deformable image registration of computed tomography scans with lung cancer Cross-Organ and Cross-Scanner Adenocarcinoma Segmentation using Rein to Fine-tune Vision Foundation Models
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1