Yuhan Liu, Chaowei Wang, Danhao Deng, Yuan Yao, Ji Wang, Weidong Wang
{"title":"A DRL Resource Allocation for Downlink NOMA Multi-beam Satellite Communications","authors":"Yuhan Liu, Chaowei Wang, Danhao Deng, Yuan Yao, Ji Wang, Weidong Wang","doi":"10.1109/BMSB58369.2023.10211246","DOIUrl":null,"url":null,"abstract":"With the development of communication technology and the increasing demand, there are more expectations for the research of multi-beam satellite communication system, and the efficient management and allocation of resources on the planet and the improvement of system performance are becoming more and more important. In this paper, the downlink Non-Orthogonal Multiple Access (NOMA) multi-beam satellite is used as the system model, and a greedy principle-based joint user grouping and subchannel joint access algorithm is proposed to solve the problems of user grouping and bandwidth allocation. On this basis, a power resource allocation algorithm based on deep reinforcement learning is proposed. By optimizing the reachability and rate of the system and user fairness under the constraints of the total transmission power and the user's minimum transmission rate, the optimization problem is modeled as Markov decision process, using the Proximal Policy Optimization (PPO) algorithm to achieve optimal allocation. The validity and superiority of the algorithm are verified by simulation, which is of great significance to the research of multi-beam satellite resource allocation.","PeriodicalId":13080,"journal":{"name":"IEEE international Symposium on Broadband Multimedia Systems and Broadcasting","volume":"25 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE international Symposium on Broadband Multimedia Systems and Broadcasting","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BMSB58369.2023.10211246","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
With the development of communication technology and the increasing demand, there are more expectations for the research of multi-beam satellite communication system, and the efficient management and allocation of resources on the planet and the improvement of system performance are becoming more and more important. In this paper, the downlink Non-Orthogonal Multiple Access (NOMA) multi-beam satellite is used as the system model, and a greedy principle-based joint user grouping and subchannel joint access algorithm is proposed to solve the problems of user grouping and bandwidth allocation. On this basis, a power resource allocation algorithm based on deep reinforcement learning is proposed. By optimizing the reachability and rate of the system and user fairness under the constraints of the total transmission power and the user's minimum transmission rate, the optimization problem is modeled as Markov decision process, using the Proximal Policy Optimization (PPO) algorithm to achieve optimal allocation. The validity and superiority of the algorithm are verified by simulation, which is of great significance to the research of multi-beam satellite resource allocation.