Runzhi Tang, Junxuan Wang, Fan Jiang, Xuewei Zhang, Jianbo Du
{"title":"通过深度强化学习对 RIS 辅助无人机通信系统的 3D 轨迹和相移进行联合优化","authors":"Runzhi Tang, Junxuan Wang, Fan Jiang, Xuewei Zhang, Jianbo Du","doi":"10.1016/j.phycom.2024.102456","DOIUrl":null,"url":null,"abstract":"<div><p>Unmanned aerial vehicle (UAV) can be deployed as aerial base station to provide communication services for the user equipments (UEs). However, in urban environments, the links between UAV and UEs might be frequently blocked by obstacles, leading to severely adverse effects on the quality of service (QoS) of UEs. Additionally, due to the limited energy of the UAV, it might not always be feasible to re-establish the line-of-sight (LoS) links by frequently adjusting the positions of the UAV. In this context, the reconfigurable intelligent surface (RIS) is utilized to enhance the transmission range of UAV-UE links by reflecting incident signals to UEs. In this paper, we investigate the RIS-assisted UAV communication systems with the goal of maximizing the energy efficiency of the UAV through a joint optimization of the UAV’s trajectory and the RIS’s phase shift. The formulated optimization problem is non-convex, and challenging to solve in a polynomial time. Therefore, an effective deep reinforcement learning (DRL)-based solution, named Dueling DQN-PER is proposed, which combines the Dueling DQN algorithm with the prioritized experience replay (PER) technique. To ensure the fairness among all UEs, we design a service fairness index, and integrate it into the reward function when designing the proposed algorithm. Numerical results demonstrate that: 1) the proposed Dueling DQN-PER algorithm is capable of improving the system energy efficiency and has a better training performance than benchmark schemes; 2) by devising the service fairness index, the fairness among all UEs is ensured while enhancing the system performance in energy efficiency; 3) the RIS-assisted UAV communication systems benefit from significant energy efficiency gain over the systems without RIS.</p></div>","PeriodicalId":48707,"journal":{"name":"Physical Communication","volume":"66 ","pages":"Article 102456"},"PeriodicalIF":2.0000,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Joint 3D trajectory and phase shift optimization via deep reinforcement learning for RIS-assisted UAV communication systems\",\"authors\":\"Runzhi Tang, Junxuan Wang, Fan Jiang, Xuewei Zhang, Jianbo Du\",\"doi\":\"10.1016/j.phycom.2024.102456\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Unmanned aerial vehicle (UAV) can be deployed as aerial base station to provide communication services for the user equipments (UEs). However, in urban environments, the links between UAV and UEs might be frequently blocked by obstacles, leading to severely adverse effects on the quality of service (QoS) of UEs. Additionally, due to the limited energy of the UAV, it might not always be feasible to re-establish the line-of-sight (LoS) links by frequently adjusting the positions of the UAV. In this context, the reconfigurable intelligent surface (RIS) is utilized to enhance the transmission range of UAV-UE links by reflecting incident signals to UEs. In this paper, we investigate the RIS-assisted UAV communication systems with the goal of maximizing the energy efficiency of the UAV through a joint optimization of the UAV’s trajectory and the RIS’s phase shift. The formulated optimization problem is non-convex, and challenging to solve in a polynomial time. Therefore, an effective deep reinforcement learning (DRL)-based solution, named Dueling DQN-PER is proposed, which combines the Dueling DQN algorithm with the prioritized experience replay (PER) technique. To ensure the fairness among all UEs, we design a service fairness index, and integrate it into the reward function when designing the proposed algorithm. Numerical results demonstrate that: 1) the proposed Dueling DQN-PER algorithm is capable of improving the system energy efficiency and has a better training performance than benchmark schemes; 2) by devising the service fairness index, the fairness among all UEs is ensured while enhancing the system performance in energy efficiency; 3) the RIS-assisted UAV communication systems benefit from significant energy efficiency gain over the systems without RIS.</p></div>\",\"PeriodicalId\":48707,\"journal\":{\"name\":\"Physical Communication\",\"volume\":\"66 \",\"pages\":\"Article 102456\"},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2024-07-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Physical Communication\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1874490724001745\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physical Communication","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1874490724001745","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Joint 3D trajectory and phase shift optimization via deep reinforcement learning for RIS-assisted UAV communication systems
Unmanned aerial vehicle (UAV) can be deployed as aerial base station to provide communication services for the user equipments (UEs). However, in urban environments, the links between UAV and UEs might be frequently blocked by obstacles, leading to severely adverse effects on the quality of service (QoS) of UEs. Additionally, due to the limited energy of the UAV, it might not always be feasible to re-establish the line-of-sight (LoS) links by frequently adjusting the positions of the UAV. In this context, the reconfigurable intelligent surface (RIS) is utilized to enhance the transmission range of UAV-UE links by reflecting incident signals to UEs. In this paper, we investigate the RIS-assisted UAV communication systems with the goal of maximizing the energy efficiency of the UAV through a joint optimization of the UAV’s trajectory and the RIS’s phase shift. The formulated optimization problem is non-convex, and challenging to solve in a polynomial time. Therefore, an effective deep reinforcement learning (DRL)-based solution, named Dueling DQN-PER is proposed, which combines the Dueling DQN algorithm with the prioritized experience replay (PER) technique. To ensure the fairness among all UEs, we design a service fairness index, and integrate it into the reward function when designing the proposed algorithm. Numerical results demonstrate that: 1) the proposed Dueling DQN-PER algorithm is capable of improving the system energy efficiency and has a better training performance than benchmark schemes; 2) by devising the service fairness index, the fairness among all UEs is ensured while enhancing the system performance in energy efficiency; 3) the RIS-assisted UAV communication systems benefit from significant energy efficiency gain over the systems without RIS.
期刊介绍:
PHYCOM: Physical Communication is an international and archival journal providing complete coverage of all topics of interest to those involved in all aspects of physical layer communications. Theoretical research contributions presenting new techniques, concepts or analyses, applied contributions reporting on experiences and experiments, and tutorials are published.
Topics of interest include but are not limited to:
Physical layer issues of Wireless Local Area Networks, WiMAX, Wireless Mesh Networks, Sensor and Ad Hoc Networks, PCS Systems; Radio access protocols and algorithms for the physical layer; Spread Spectrum Communications; Channel Modeling; Detection and Estimation; Modulation and Coding; Multiplexing and Carrier Techniques; Broadband Wireless Communications; Wireless Personal Communications; Multi-user Detection; Signal Separation and Interference rejection: Multimedia Communications over Wireless; DSP Applications to Wireless Systems; Experimental and Prototype Results; Multiple Access Techniques; Space-time Processing; Synchronization Techniques; Error Control Techniques; Cryptography; Software Radios; Tracking; Resource Allocation and Inference Management; Multi-rate and Multi-carrier Communications; Cross layer Design and Optimization; Propagation and Channel Characterization; OFDM Systems; MIMO Systems; Ultra-Wideband Communications; Cognitive Radio System Architectures; Platforms and Hardware Implementations for the Support of Cognitive, Radio Systems; Cognitive Radio Resource Management and Dynamic Spectrum Sharing.