Energy-Efficient Trajectory Planning With Joint Device Selection and Power Splitting for mmWaves-Enabled UAV-NOMA Networks

IEEE Transactions on Machine Learning in Communications and Networking Pub Date : 2024-03-02 DOI:10.1109/TMLCN.2024.3396438

Ahmad Gendia;Osamu Muta;Sherief Hashima;Kohei Hatano

{"title":"Energy-Efficient Trajectory Planning With Joint Device Selection and Power Splitting for mmWaves-Enabled UAV-NOMA Networks","authors":"Ahmad Gendia;Osamu Muta;Sherief Hashima;Kohei Hatano","doi":"10.1109/TMLCN.2024.3396438","DOIUrl":null,"url":null,"abstract":"This paper proposes two energy-efficient reinforcement learning (RL)-based algorithms for millimeter wave (mmWave)-enabled unmanned aerial vehicle (UAV) communications toward beyond-5G (B5G). This can be especially useful in ad-hoc communication scenarios within a neighborhood with main-network connectivity problems such as in areas affected by natural disasters. To improve the system’s overall sum-rate performance, the UAV-operated mobile base station (UAV-MBS) can harness non-orthogonal multiple access (NOMA) as an efficient protocol to grant ground devices access to fast downlink connections. Dynamic selection of suitable hovering spots within the target zone where the battery-constrained UAV needs to be positioned as well as calibrated NOMA power control with proper device pairing are critical for optimized performance. We propose cost-subsidized multiarmed bandit (CS-MAB) and double deep Q-network (DDQN)-based solutions to jointly address the problems of dynamic UAV path design, device pairing, and power splitting for downlink data transmission in NOMA-based systems. To verify that the proposed RL-based solutions support high sum-rates, numerical simulations are presented. In addition, exhaustive and random search benchmarks are provided as baselines for the achievable upper and lower sum-rate levels, respectively. The proposed DDQN agent achieves 96% of the sum-rate provided by the optimal exhaustive scanning whereas CS-MAB reaches 91.5%. By contrast, a conventional channel state sorting pairing (CSSP) solver achieves about 89.3%.","PeriodicalId":100641,"journal":{"name":"IEEE Transactions on Machine Learning in Communications and Networking","volume":"2 ","pages":"617-632"},"PeriodicalIF":0.0000,"publicationDate":"2024-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10517756","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Machine Learning in Communications and Networking","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10517756/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

This paper proposes two energy-efficient reinforcement learning (RL)-based algorithms for millimeter wave (mmWave)-enabled unmanned aerial vehicle (UAV) communications toward beyond-5G (B5G). This can be especially useful in ad-hoc communication scenarios within a neighborhood with main-network connectivity problems such as in areas affected by natural disasters. To improve the system’s overall sum-rate performance, the UAV-operated mobile base station (UAV-MBS) can harness non-orthogonal multiple access (NOMA) as an efficient protocol to grant ground devices access to fast downlink connections. Dynamic selection of suitable hovering spots within the target zone where the battery-constrained UAV needs to be positioned as well as calibrated NOMA power control with proper device pairing are critical for optimized performance. We propose cost-subsidized multiarmed bandit (CS-MAB) and double deep Q-network (DDQN)-based solutions to jointly address the problems of dynamic UAV path design, device pairing, and power splitting for downlink data transmission in NOMA-based systems. To verify that the proposed RL-based solutions support high sum-rates, numerical simulations are presented. In addition, exhaustive and random search benchmarks are provided as baselines for the achievable upper and lower sum-rate levels, respectively. The proposed DDQN agent achieves 96% of the sum-rate provided by the optimal exhaustive scanning whereas CS-MAB reaches 91.5%. By contrast, a conventional channel state sorting pairing (CSSP) solver achieves about 89.3%.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用联合设备选择和功率分配为毫米波无人机-NOMA 网络制定高能效轨迹规划

本文提出了两种基于强化学习（RL）的高能效算法，用于支持毫米波（mmWave）的无人机（UAV）通信，以实现超越 5G （B5G）。这对于存在主网络连接问题的邻近地区（如受自然灾害影响的地区）的临时通信场景尤其有用。为了提高系统的整体总速率性能，无人机移动基站（UAV-MBS）可以利用非正交多址接入（NOMA）作为一种高效协议，让地面设备接入快速下行链路连接。在电池有限的无人机需要定位的目标区域内动态选择合适的悬停点，以及校准的 NOMA 功率控制和适当的设备配对对于优化性能至关重要。我们提出了基于成本补贴的多臂匪盗（CS-MAB）和双深Q网络（DDQN）的解决方案，以共同解决无人机动态路径设计、设备配对以及基于NOMA系统的下行数据传输功率分配等问题。为了验证所提出的基于 RL 的解决方案是否支持高和速率，本文进行了数值模拟。此外，还提供了穷举搜索和随机搜索基准，分别作为可实现的上限和下限总和速率水平的基准。提议的 DDQN 代理达到了最优穷举扫描所提供的总和速率的 96%，而 CS-MAB 则达到了 91.5%。相比之下，传统的信道状态排序配对（CSSP）求解器可达到约 89.3%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Machine Learning in Communications and Networking

自引率

0.00%

发文量