Redundant frame structure using M-frame for interactive light field streaming

2016 IEEE International Conference on Image Processing (ICIP) Pub Date : 2016-09-01 DOI:10.1109/ICIP.2016.7532582

B. Motz, Gene Cheung, Antonio Ortega

{"title":"Redundant frame structure using M-frame for interactive light field streaming","authors":"B. Motz, Gene Cheung, Antonio Ortega","doi":"10.1109/ICIP.2016.7532582","DOIUrl":null,"url":null,"abstract":"A light field (LF) is a 2D array of closely spaced viewpoint images of a static 3D scene. In an interactive LF streaming (ILFS) scenario, a user successively requests desired neighboring viewpoints for observation, and in response the server must transmit pre-encoded data for correct decoding of the requested viewpoint images. Designing frame structures for ILFS is challenging, since at encoding time it is not known what navigation path a user will take, making differential coding very difficult to employ. In this paper, leveraging on a recent work on the merge operator - a new distributed source coding technique that efficiently merges differences among a set of side information (SI) frames into an identical reconstruction - we design redundant frame structures that facilitate ILFS, trading off expected transmission cost with total storage size. Specifically, we first propose a new view interaction model that captures view navigation tendencies of typical users. Assuming a flexible one-frame buffer at the decoder, we then derive a set of recursive equations that compute the expected transmission cost for a navigation lifetime of T views, given the proposed interaction model and a pre-encoded frame structure. Finally, we propose an algorithm that greedily builds a redundant frame structure, minimizing a weighted sum of expected transmission cost and total storage size. Experimental results show that our proposed algorithm generates frame structures with better transmission / storage tradeoffs than competing schemes.","PeriodicalId":6521,"journal":{"name":"2016 IEEE International Conference on Image Processing (ICIP)","volume":"40 1","pages":"1369-1373"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE International Conference on Image Processing (ICIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIP.2016.7532582","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

Abstract

A light field (LF) is a 2D array of closely spaced viewpoint images of a static 3D scene. In an interactive LF streaming (ILFS) scenario, a user successively requests desired neighboring viewpoints for observation, and in response the server must transmit pre-encoded data for correct decoding of the requested viewpoint images. Designing frame structures for ILFS is challenging, since at encoding time it is not known what navigation path a user will take, making differential coding very difficult to employ. In this paper, leveraging on a recent work on the merge operator - a new distributed source coding technique that efficiently merges differences among a set of side information (SI) frames into an identical reconstruction - we design redundant frame structures that facilitate ILFS, trading off expected transmission cost with total storage size. Specifically, we first propose a new view interaction model that captures view navigation tendencies of typical users. Assuming a flexible one-frame buffer at the decoder, we then derive a set of recursive equations that compute the expected transmission cost for a navigation lifetime of T views, given the proposed interaction model and a pre-encoded frame structure. Finally, we propose an algorithm that greedily builds a redundant frame structure, minimizing a weighted sum of expected transmission cost and total storage size. Experimental results show that our proposed algorithm generates frame structures with better transmission / storage tradeoffs than competing schemes.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

冗余框架结构，采用m -框架进行交互光场流

光场(LF)是静态3D场景中紧密间隔的视点图像的二维阵列。在交互式LF流(ILFS)场景中，用户连续请求所需的邻近视点进行观察，作为响应，服务器必须传输预编码的数据以正确解码所请求的视点图像。为ILFS设计框架结构是具有挑战性的，因为在编码时不知道用户将采取什么导航路径，使得差分编码非常难以使用。在本文中，利用最近对合并算子的研究——一种新的分布式源编码技术，有效地将一组侧信息(SI)帧之间的差异合并到一个相同的重构中——我们设计了冗余的帧结构，促进了ILFS，权衡了预期的传输成本和总存储大小。具体而言，我们首先提出了一种新的视图交互模型，该模型捕捉了典型用户的视图导航倾向。假设解码器有一个灵活的一帧缓冲区，然后我们推导出一组递归方程，计算T视图导航生命周期的预期传输成本，给定所提出的交互模型和预编码的帧结构。最后，我们提出了一种贪婪地构建冗余帧结构的算法，最小化期望传输成本和总存储大小的加权总和。实验结果表明，该算法生成的帧结构比竞争方案具有更好的传输/存储权衡。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2016 IEEE International Conference on Image Processing (ICIP)

自引率

0.00%

发文量

期刊最新文献

Content-adaptive pyramid representation for 3D object classification Automating the measurement of physiological parameters: A case study in the image analysis of cilia motion Horizon based orientation estimation for planetary surface navigation Softcast with per-carrier power-constrained channels Speeding-up a convolutional neural network by connecting an SVM network