通过深度强化学习对 RIS 辅助无人机通信系统的 3D 轨迹和相移进行联合优化

IF 2 4区计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Physical Communication Pub Date : 2024-07-26 DOI:10.1016/j.phycom.2024.102456

Runzhi Tang, Junxuan Wang, Fan Jiang, Xuewei Zhang, Jianbo Du

{"title":"通过深度强化学习对 RIS 辅助无人机通信系统的 3D 轨迹和相移进行联合优化","authors":"Runzhi Tang, Junxuan Wang, Fan Jiang, Xuewei Zhang, Jianbo Du","doi":"10.1016/j.phycom.2024.102456","DOIUrl":null,"url":null,"abstract":"<div><p>Unmanned aerial vehicle (UAV) can be deployed as aerial base station to provide communication services for the user equipments (UEs). However, in urban environments, the links between UAV and UEs might be frequently blocked by obstacles, leading to severely adverse effects on the quality of service (QoS) of UEs. Additionally, due to the limited energy of the UAV, it might not always be feasible to re-establish the line-of-sight (LoS) links by frequently adjusting the positions of the UAV. In this context, the reconfigurable intelligent surface (RIS) is utilized to enhance the transmission range of UAV-UE links by reflecting incident signals to UEs. In this paper, we investigate the RIS-assisted UAV communication systems with the goal of maximizing the energy efficiency of the UAV through a joint optimization of the UAV’s trajectory and the RIS’s phase shift. The formulated optimization problem is non-convex, and challenging to solve in a polynomial time. Therefore, an effective deep reinforcement learning (DRL)-based solution, named Dueling DQN-PER is proposed, which combines the Dueling DQN algorithm with the prioritized experience replay (PER) technique. To ensure the fairness among all UEs, we design a service fairness index, and integrate it into the reward function when designing the proposed algorithm. Numerical results demonstrate that: 1) the proposed Dueling DQN-PER algorithm is capable of improving the system energy efficiency and has a better training performance than benchmark schemes; 2) by devising the service fairness index, the fairness among all UEs is ensured while enhancing the system performance in energy efficiency; 3) the RIS-assisted UAV communication systems benefit from significant energy efficiency gain over the systems without RIS.</p></div>","PeriodicalId":48707,"journal":{"name":"Physical Communication","volume":"66 ","pages":"Article 102456"},"PeriodicalIF":2.0000,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Joint 3D trajectory and phase shift optimization via deep reinforcement learning for RIS-assisted UAV communication systems\",\"authors\":\"Runzhi Tang, Junxuan Wang, Fan Jiang, Xuewei Zhang, Jianbo Du\",\"doi\":\"10.1016/j.phycom.2024.102456\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Unmanned aerial vehicle (UAV) can be deployed as aerial base station to provide communication services for the user equipments (UEs). However, in urban environments, the links between UAV and UEs might be frequently blocked by obstacles, leading to severely adverse effects on the quality of service (QoS) of UEs. Additionally, due to the limited energy of the UAV, it might not always be feasible to re-establish the line-of-sight (LoS) links by frequently adjusting the positions of the UAV. In this context, the reconfigurable intelligent surface (RIS) is utilized to enhance the transmission range of UAV-UE links by reflecting incident signals to UEs. In this paper, we investigate the RIS-assisted UAV communication systems with the goal of maximizing the energy efficiency of the UAV through a joint optimization of the UAV’s trajectory and the RIS’s phase shift. The formulated optimization problem is non-convex, and challenging to solve in a polynomial time. Therefore, an effective deep reinforcement learning (DRL)-based solution, named Dueling DQN-PER is proposed, which combines the Dueling DQN algorithm with the prioritized experience replay (PER) technique. To ensure the fairness among all UEs, we design a service fairness index, and integrate it into the reward function when designing the proposed algorithm. Numerical results demonstrate that: 1) the proposed Dueling DQN-PER algorithm is capable of improving the system energy efficiency and has a better training performance than benchmark schemes; 2) by devising the service fairness index, the fairness among all UEs is ensured while enhancing the system performance in energy efficiency; 3) the RIS-assisted UAV communication systems benefit from significant energy efficiency gain over the systems without RIS.</p></div>\",\"PeriodicalId\":48707,\"journal\":{\"name\":\"Physical Communication\",\"volume\":\"66 \",\"pages\":\"Article 102456\"},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2024-07-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Physical Communication\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1874490724001745\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physical Communication","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1874490724001745","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

摘要

无人飞行器（UAV）可作为空中基站部署，为用户设备（UE）提供通信服务。然而，在城市环境中，无人飞行器和 UE 之间的链路可能会经常被障碍物阻挡，从而对 UE 的服务质量（QoS）造成严重不利影响。此外，由于无人机的能量有限，通过频繁调整无人机位置来重新建立视距（LoS）链路并不总是可行的。在这种情况下，可重构智能表面（RIS）被用来通过向 UE 反射入射信号来增强 UAV-UE 链路的传输范围。本文研究了 RIS 辅助无人机通信系统，目标是通过联合优化无人机的轨迹和 RIS 的相移，最大限度地提高无人机的能效。所提出的优化问题是非凸问题，在多项式时间内求解具有挑战性。因此，我们提出了一种有效的基于深度强化学习（DRL）的解决方案，名为 Dueling DQN-PER，它将 Dueling DQN 算法与优先经验重放（PER）技术相结合。为了确保所有 UE 之间的公平性，我们设计了一个服务公平性指数，并在设计该算法时将其集成到奖励函数中。数值结果表明1）与基准方案相比，所提出的 Dueling DQN-PER 算法能够提高系统能效，并具有更好的训练性能；2）通过设计服务公平性指数，在提高系统能效性能的同时确保了所有 UE 之间的公平性；3）与没有 RIS 的系统相比，有 RIS 辅助的无人机通信系统能显著提高能效。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Joint 3D trajectory and phase shift optimization via deep reinforcement learning for RIS-assisted UAV communication systems

Unmanned aerial vehicle (UAV) can be deployed as aerial base station to provide communication services for the user equipments (UEs). However, in urban environments, the links between UAV and UEs might be frequently blocked by obstacles, leading to severely adverse effects on the quality of service (QoS) of UEs. Additionally, due to the limited energy of the UAV, it might not always be feasible to re-establish the line-of-sight (LoS) links by frequently adjusting the positions of the UAV. In this context, the reconfigurable intelligent surface (RIS) is utilized to enhance the transmission range of UAV-UE links by reflecting incident signals to UEs. In this paper, we investigate the RIS-assisted UAV communication systems with the goal of maximizing the energy efficiency of the UAV through a joint optimization of the UAV’s trajectory and the RIS’s phase shift. The formulated optimization problem is non-convex, and challenging to solve in a polynomial time. Therefore, an effective deep reinforcement learning (DRL)-based solution, named Dueling DQN-PER is proposed, which combines the Dueling DQN algorithm with the prioritized experience replay (PER) technique. To ensure the fairness among all UEs, we design a service fairness index, and integrate it into the reward function when designing the proposed algorithm. Numerical results demonstrate that: 1) the proposed Dueling DQN-PER algorithm is capable of improving the system energy efficiency and has a better training performance than benchmark schemes; 2) by devising the service fairness index, the fairness among all UEs is ensured while enhancing the system performance in energy efficiency; 3) the RIS-assisted UAV communication systems benefit from significant energy efficiency gain over the systems without RIS.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Physical Communication ENGINEERING, ELECTRICAL & ELECTRONICTELECO-TELECOMMUNICATIONS

CiteScore

5.00

自引率

9.10%

发文量

212

审稿时长

55 days

期刊介绍： PHYCOM: Physical Communication is an international and archival journal providing complete coverage of all topics of interest to those involved in all aspects of physical layer communications. Theoretical research contributions presenting new techniques, concepts or analyses, applied contributions reporting on experiences and experiments, and tutorials are published. Topics of interest include but are not limited to: Physical layer issues of Wireless Local Area Networks, WiMAX, Wireless Mesh Networks, Sensor and Ad Hoc Networks, PCS Systems; Radio access protocols and algorithms for the physical layer; Spread Spectrum Communications; Channel Modeling; Detection and Estimation; Modulation and Coding; Multiplexing and Carrier Techniques; Broadband Wireless Communications; Wireless Personal Communications; Multi-user Detection; Signal Separation and Interference rejection: Multimedia Communications over Wireless; DSP Applications to Wireless Systems; Experimental and Prototype Results; Multiple Access Techniques; Space-time Processing; Synchronization Techniques; Error Control Techniques; Cryptography; Software Radios; Tracking; Resource Allocation and Inference Management; Multi-rate and Multi-carrier Communications; Cross layer Design and Optimization; Propagation and Channel Characterization; OFDM Systems; MIMO Systems; Ultra-Wideband Communications; Cognitive Radio System Architectures; Platforms and Hardware Implementations for the Support of Cognitive, Radio Systems; Cognitive Radio Resource Management and Dynamic Spectrum Sharing.