Di Wang, Qianqian Liu, Jie Tian, Yuan Zhi, Jingping Qiao, Ji Bian
{"title":"基于d2d的无人机中继网络中缓存的深度强化学习","authors":"Di Wang, Qianqian Liu, Jie Tian, Yuan Zhi, Jingping Qiao, Ji Bian","doi":"10.1109/iccc52777.2021.9580299","DOIUrl":null,"url":null,"abstract":"Unmanned aerial vehicle (UAV)-relaying can forward files for user devices, but also faces the challenge of the traffic blockage of wireless backhaul. In this paper, we propose a novel caching strategy to pre-cache some popular files at both UAV and user devices to reduce duplicate transmissions in device-to-device (D2D)-enabled UAV-relaying networks. Considering the quality of experience (QoE) of the requesting users, we formulate a file access delay minimization problem by optimizing the cache placement. Due to the dynamics of the environment and the complexity of the formulated problem, we propose a deep deterministic policy gradient (DDPG)-based cache placement optimizing algorithm to decide which files to be cached and where to be cached. In addition, we also analyze theoretically the complexity of the proposed algorithm. Numerical results show our proposed scheme outperforms other baselines.","PeriodicalId":425118,"journal":{"name":"2021 IEEE/CIC International Conference on Communications in China (ICCC)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Deep Reinforcement Learning for Caching in D2D-Enabled UAV-Relaying Networks\",\"authors\":\"Di Wang, Qianqian Liu, Jie Tian, Yuan Zhi, Jingping Qiao, Ji Bian\",\"doi\":\"10.1109/iccc52777.2021.9580299\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Unmanned aerial vehicle (UAV)-relaying can forward files for user devices, but also faces the challenge of the traffic blockage of wireless backhaul. In this paper, we propose a novel caching strategy to pre-cache some popular files at both UAV and user devices to reduce duplicate transmissions in device-to-device (D2D)-enabled UAV-relaying networks. Considering the quality of experience (QoE) of the requesting users, we formulate a file access delay minimization problem by optimizing the cache placement. Due to the dynamics of the environment and the complexity of the formulated problem, we propose a deep deterministic policy gradient (DDPG)-based cache placement optimizing algorithm to decide which files to be cached and where to be cached. In addition, we also analyze theoretically the complexity of the proposed algorithm. Numerical results show our proposed scheme outperforms other baselines.\",\"PeriodicalId\":425118,\"journal\":{\"name\":\"2021 IEEE/CIC International Conference on Communications in China (ICCC)\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE/CIC International Conference on Communications in China (ICCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/iccc52777.2021.9580299\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE/CIC International Conference on Communications in China (ICCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iccc52777.2021.9580299","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Deep Reinforcement Learning for Caching in D2D-Enabled UAV-Relaying Networks
Unmanned aerial vehicle (UAV)-relaying can forward files for user devices, but also faces the challenge of the traffic blockage of wireless backhaul. In this paper, we propose a novel caching strategy to pre-cache some popular files at both UAV and user devices to reduce duplicate transmissions in device-to-device (D2D)-enabled UAV-relaying networks. Considering the quality of experience (QoE) of the requesting users, we formulate a file access delay minimization problem by optimizing the cache placement. Due to the dynamics of the environment and the complexity of the formulated problem, we propose a deep deterministic policy gradient (DDPG)-based cache placement optimizing algorithm to decide which files to be cached and where to be cached. In addition, we also analyze theoretically the complexity of the proposed algorithm. Numerical results show our proposed scheme outperforms other baselines.