Chunmei Xu, Shengheng Liu, Cheng Zhang, Yongming Huang, Luxi Yang
{"title":"基于多智能体强化学习的毫米波网络联合用户调度和波束选择","authors":"Chunmei Xu, Shengheng Liu, Cheng Zhang, Yongming Huang, Luxi Yang","doi":"10.1109/SAM48682.2020.9104386","DOIUrl":null,"url":null,"abstract":"In this paper, we consider a multi-cell downlink mmWave communication network and investigate an efficient transmission scheme for all base stations. Since the beams are highly directed with respected to the user equipments, user scheduling and beam selection strategy should be jointly considered. The objective is to develop the joint user scheduling and beam selection strategy that minimizes the long-term average delay cost while satisfying the instantaneous quality of service constraint of each user. To achieve the long-term performance, a distributed algorithm is proposed to develop the joint strategy based on multi-agent reinforcement learning. Simulation results validate the effectiveness of the proposed intelligent distributed method.","PeriodicalId":6753,"journal":{"name":"2020 IEEE 11th Sensor Array and Multichannel Signal Processing Workshop (SAM)","volume":"34 1","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Joint User Scheduling and Beam Selection in mmWave Networks Based on Multi-Agent Reinforcement Learning\",\"authors\":\"Chunmei Xu, Shengheng Liu, Cheng Zhang, Yongming Huang, Luxi Yang\",\"doi\":\"10.1109/SAM48682.2020.9104386\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we consider a multi-cell downlink mmWave communication network and investigate an efficient transmission scheme for all base stations. Since the beams are highly directed with respected to the user equipments, user scheduling and beam selection strategy should be jointly considered. The objective is to develop the joint user scheduling and beam selection strategy that minimizes the long-term average delay cost while satisfying the instantaneous quality of service constraint of each user. To achieve the long-term performance, a distributed algorithm is proposed to develop the joint strategy based on multi-agent reinforcement learning. Simulation results validate the effectiveness of the proposed intelligent distributed method.\",\"PeriodicalId\":6753,\"journal\":{\"name\":\"2020 IEEE 11th Sensor Array and Multichannel Signal Processing Workshop (SAM)\",\"volume\":\"34 1\",\"pages\":\"1-5\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE 11th Sensor Array and Multichannel Signal Processing Workshop (SAM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SAM48682.2020.9104386\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 11th Sensor Array and Multichannel Signal Processing Workshop (SAM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SAM48682.2020.9104386","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Joint User Scheduling and Beam Selection in mmWave Networks Based on Multi-Agent Reinforcement Learning
In this paper, we consider a multi-cell downlink mmWave communication network and investigate an efficient transmission scheme for all base stations. Since the beams are highly directed with respected to the user equipments, user scheduling and beam selection strategy should be jointly considered. The objective is to develop the joint user scheduling and beam selection strategy that minimizes the long-term average delay cost while satisfying the instantaneous quality of service constraint of each user. To achieve the long-term performance, a distributed algorithm is proposed to develop the joint strategy based on multi-agent reinforcement learning. Simulation results validate the effectiveness of the proposed intelligent distributed method.