{"title":"Deep Reinforcement Learning for Dynamic Bandwidth Allocation in Multi-Beam Satellite Systems","authors":"Shijun Ma, Xin Hu, Xianglai Liao, Weidong Wang","doi":"10.1109/ICCCS52626.2021.9449160","DOIUrl":null,"url":null,"abstract":"Future multi-beam satellite (MBS) network is an essential part of the air-space-ground integrated network, which is the future blueprint of 6G. As the MBS network scales up, how to allocation scarce bandwidth spectrum resources efficiently and dynamically while ensuring the Quality of Service (QoS) of the users has become a great challenge. In this paper, we designed a dynamic bandwidth allocation framework using Proximal Policy Optimization (DBA-PPO) to meet the time-varying traffic demand, maximize utilization and guarantee the QoS of the users in the MBS system. The experimental results show that the proposed bandwidth allocation algorithm can be flexible to achieve the desired effectiveness with low complexity and is more cost-effective for the large scale MBS communications scenario.","PeriodicalId":376290,"journal":{"name":"2021 IEEE 6th International Conference on Computer and Communication Systems (ICCCS)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 6th International Conference on Computer and Communication Systems (ICCCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCS52626.2021.9449160","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Future multi-beam satellite (MBS) network is an essential part of the air-space-ground integrated network, which is the future blueprint of 6G. As the MBS network scales up, how to allocation scarce bandwidth spectrum resources efficiently and dynamically while ensuring the Quality of Service (QoS) of the users has become a great challenge. In this paper, we designed a dynamic bandwidth allocation framework using Proximal Policy Optimization (DBA-PPO) to meet the time-varying traffic demand, maximize utilization and guarantee the QoS of the users in the MBS system. The experimental results show that the proposed bandwidth allocation algorithm can be flexible to achieve the desired effectiveness with low complexity and is more cost-effective for the large scale MBS communications scenario.