Deep Reinforcement Learning-based Rate Adaptation for Adaptive 360-Degree Video Streaming

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2019-05-12 DOI:10.1109/ICASSP.2019.8683779

Nuowen Kan, Junni Zou, Kexin Tang, Chenglin Li, Ning Liu, H. Xiong

引用次数: 28

Abstract

In this paper, we propose a deep reinforcement learning (DRL)-based rate adaptation algorithm for adaptive 360-degree video streaming, which is able to maximize the quality of experience of viewers by adapting the transmitted video quality to the time-varying network conditions. Specifically, to reduce the possible switching latency of the field of view (FoV), we design a new QoE metric by introducing a penalty term for the large buffer occupancy. A scalable FoV method is further proposed to alleviate the combinatorial explosion of the action space in the DRL formulation. Then, we model the rate adaptation logic as a Markov decision process and employ the DRL-based algorithm to dynamically learn the optimal video transmission rate. Simulation results show the superior performance of the proposed algorithm compared to the existing algorithms.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于深度强化学习的自适应360度视频流速率自适应

在本文中，我们提出了一种基于深度强化学习(DRL)的自适应360度视频流的速率自适应算法，该算法通过使传输的视频质量适应时变的网络条件，从而最大限度地提高观众的体验质量。具体来说，为了减少视场切换的延迟，我们设计了一个新的QoE指标，引入了对大缓冲区占用的惩罚项。进一步提出了一种可扩展的视场方法，以缓解DRL公式中动作空间的组合爆炸。然后，我们将速率自适应逻辑建模为马尔可夫决策过程，并采用基于drl的算法动态学习最优视频传输速率。仿真结果表明，该算法的性能优于现有算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

自引率

0.00%

发文量

期刊最新文献

Universal Acoustic Modeling Using Neural Mixture Models Speech Landmark Bigrams for Depression Detection from Naturalistic Smartphone Speech Robust M-estimation Based Matrix Completion When Can a System of Subnetworks Be Registered Uniquely? Learning Search Path for Region-level Image Matching