Zhengming Zhang, Yaru Zheng, Chunguo Li, Yongming Huang, Luxi Yang
{"title":"Cache-Enabled Adaptive Bit Rate Streaming via Deep Self-Transfer Reinforcement Learning","authors":"Zhengming Zhang, Yaru Zheng, Chunguo Li, Yongming Huang, Luxi Yang","doi":"10.1109/WCSP.2018.8555916","DOIUrl":null,"url":null,"abstract":"Caching and rate allocation are two promising approaches to support video streaming over wireless networks. However, existing rate allocation designs do not fully exploit the advantages of the two approaches. This paper investigates the problem of cache-enabled video rate allocation. We establish a mathematical model for this problem, and point out that it is difficult to solve it with traditional dynamic programming. Then we propose a deep reinforcement learning approach to solve it. Firstly, we model the problem as a Markov decision problem. Then we present a deep Q-learning algorithm with a special knowledge transfer process to find out an effective allocation policy. Finally, numerical results are given to demonstrate that the proposed solution can effectively maintain high-quality of service. We also investigate the impact of critical parameters on the performance of our algorithm.","PeriodicalId":423073,"journal":{"name":"2018 10th International Conference on Wireless Communications and Signal Processing (WCSP)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 10th International Conference on Wireless Communications and Signal Processing (WCSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WCSP.2018.8555916","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Caching and rate allocation are two promising approaches to support video streaming over wireless networks. However, existing rate allocation designs do not fully exploit the advantages of the two approaches. This paper investigates the problem of cache-enabled video rate allocation. We establish a mathematical model for this problem, and point out that it is difficult to solve it with traditional dynamic programming. Then we propose a deep reinforcement learning approach to solve it. Firstly, we model the problem as a Markov decision problem. Then we present a deep Q-learning algorithm with a special knowledge transfer process to find out an effective allocation policy. Finally, numerical results are given to demonstrate that the proposed solution can effectively maintain high-quality of service. We also investigate the impact of critical parameters on the performance of our algorithm.