{"title":"Dynamic Resource Allocation for MmWave UAV Communications: A Deep Reinforcement Learning Approach","authors":"Yangyang Wang, Yawen Chen, Zhaoming Lu, X. Wen","doi":"10.1109/ICCCWorkshops55477.2022.9896667","DOIUrl":null,"url":null,"abstract":"Millimeter wave (mmWave) enabled unmanned aerial vehicle (UAV) communications featured by high flexibility and data rate, are widely regarded as an essential element of 6G networks. This paper focuses on the dynamic resource allocation of mmWave UAV communication systems. This problem as a joint optimization of the 3D UAV trajectory, beamwidth and power allocation, with the objective of maximizing normalized spectral efficiency (NSE). Considering that this problem is non-convex and can not be solved directly with the traditional methods, we propose to decouple it into two tractable sub-problems. Moreover, we propose two deep deterministic policy gradient (DDPG)-based algorithms to effectively find the optimal solution in continuous space. Simulation results show that the proposed DDPG-based algorithms can significantly improve throughput.","PeriodicalId":148869,"journal":{"name":"2022 IEEE/CIC International Conference on Communications in China (ICCC Workshops)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE/CIC International Conference on Communications in China (ICCC Workshops)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCWorkshops55477.2022.9896667","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Millimeter wave (mmWave) enabled unmanned aerial vehicle (UAV) communications featured by high flexibility and data rate, are widely regarded as an essential element of 6G networks. This paper focuses on the dynamic resource allocation of mmWave UAV communication systems. This problem as a joint optimization of the 3D UAV trajectory, beamwidth and power allocation, with the objective of maximizing normalized spectral efficiency (NSE). Considering that this problem is non-convex and can not be solved directly with the traditional methods, we propose to decouple it into two tractable sub-problems. Moreover, we propose two deep deterministic policy gradient (DDPG)-based algorithms to effectively find the optimal solution in continuous space. Simulation results show that the proposed DDPG-based algorithms can significantly improve throughput.