{"title":"A Centralized Multi-Agent DRL-Based Trajectory Control Strategy for Unmanned Aerial Vehicle-Enabled Wireless Communications","authors":"Getaneh Berie Tarekegn;Rong-Terng Juang;Belayneh Abebe Tesfaw;Hsin-Piao Lin;Huan-Chia Hsu;Robel Berie Tarekegn;Li-Chia Tai","doi":"10.1109/OJVT.2024.3451143","DOIUrl":null,"url":null,"abstract":"Unmanned aerial vehicles (UAVs) are becoming increasingly popular as mobile base stations due to their flexible deployment and low-cost features, particularly for emergency communications, traffic offloading, and terrestrial communications infrastructure failures. This paper presents an autonomous trajectory control method for multiple UAVs equipped with base stations for UAV-enabled wireless communications. The objective of this work is to address the optimization challenge of maximizing both communication coverage and network throughput for ground users. The proposed multi-aerial base station trajectory control (MATC) scheme employs a two-stage learning approach. Initially, we developed a long short-term memory-based link quality estimation model to assess each user's link quality over time. The trajectory of the aerial base stations is then continuously adjusted through a centralized multi-agent deep reinforcement learning algorithm to optimize communication performance. We evaluated our proposed system using real channel measurement data, i.e., amplitude and phase signal information. Notably, the proposed approach operates solely on received signals from users, without requiring knowledge of their specific locations. The proposed MATC strategy achieves 97.41% communication coverage while maintaining satisfactory system throughput performance. Numerical results demonstrate that the proposed method significantly enhances both communication coverage and network throughput in comparison to the base line algorithms.","PeriodicalId":34270,"journal":{"name":"IEEE Open Journal of Vehicular Technology","volume":null,"pages":null},"PeriodicalIF":5.3000,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10654501","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Open Journal of Vehicular Technology","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10654501/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Unmanned aerial vehicles (UAVs) are becoming increasingly popular as mobile base stations due to their flexible deployment and low-cost features, particularly for emergency communications, traffic offloading, and terrestrial communications infrastructure failures. This paper presents an autonomous trajectory control method for multiple UAVs equipped with base stations for UAV-enabled wireless communications. The objective of this work is to address the optimization challenge of maximizing both communication coverage and network throughput for ground users. The proposed multi-aerial base station trajectory control (MATC) scheme employs a two-stage learning approach. Initially, we developed a long short-term memory-based link quality estimation model to assess each user's link quality over time. The trajectory of the aerial base stations is then continuously adjusted through a centralized multi-agent deep reinforcement learning algorithm to optimize communication performance. We evaluated our proposed system using real channel measurement data, i.e., amplitude and phase signal information. Notably, the proposed approach operates solely on received signals from users, without requiring knowledge of their specific locations. The proposed MATC strategy achieves 97.41% communication coverage while maintaining satisfactory system throughput performance. Numerical results demonstrate that the proposed method significantly enhances both communication coverage and network throughput in comparison to the base line algorithms.