{"title":"Power Allocation based on Q-Learning for NOMA Visible Light Communication Networks","authors":"Yefei Tian, Yufei Luo, A. Dang","doi":"10.18178/wcse.2020.06.056","DOIUrl":null,"url":null,"abstract":"Non-orthogonal multiple access (NOMA) has been proposed to enhance system capacity for visible light communication (VLC) systems. However, the effective power allocation strategy is one of critical problems that needs to be solved in NOMA. In this paper, a new method for multi-user downlink power allocation in VLC NOMA based on reinforcement learning is proposed. This method utilizes distributed multi-agent Q-learning algorithm with low complexity to maximize sum throughput of the multiuser VLC downlink system which is subject to both user fairness and quality of service (QoS). The numerical results show that a large sum logarithmic user rate can be obtained with higher probability compared with other conventional power allocation algorithms.","PeriodicalId":292895,"journal":{"name":"Proceedings of 2020 the 10th International Workshop on Computer Science and Engineering","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of 2020 the 10th International Workshop on Computer Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18178/wcse.2020.06.056","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Non-orthogonal multiple access (NOMA) has been proposed to enhance system capacity for visible light communication (VLC) systems. However, the effective power allocation strategy is one of critical problems that needs to be solved in NOMA. In this paper, a new method for multi-user downlink power allocation in VLC NOMA based on reinforcement learning is proposed. This method utilizes distributed multi-agent Q-learning algorithm with low complexity to maximize sum throughput of the multiuser VLC downlink system which is subject to both user fairness and quality of service (QoS). The numerical results show that a large sum logarithmic user rate can be obtained with higher probability compared with other conventional power allocation algorithms.