Reinforcement Learning Method for Beam Management in Millimeter-Wave Networks

2019 UK/ China Emerging Technologies (UCET) Pub Date : 2019-08-01 DOI:10.1109/UCET.2019.8881841

Ruiyu Wang, Oluwakayode Onireti, Lei Zhang, M. Imran, Guangmei Ren, Jing Qiu, Tingjian Tian

{"title":"Reinforcement Learning Method for Beam Management in Millimeter-Wave Networks","authors":"Ruiyu Wang, Oluwakayode Onireti, Lei Zhang, M. Imran, Guangmei Ren, Jing Qiu, Tingjian Tian","doi":"10.1109/UCET.2019.8881841","DOIUrl":null,"url":null,"abstract":"With the rapid growth of mobile data demand, the fifth generation (5G) mobile network must exploit the large amount of spectrum in the millimeter wave (mmWave) band to increase the network capacity. Due to the limitation of propagation distance, line-of-sight (LOS) link is highly desirable for mmWave systems. However, LOS channel is not feasible all the time and mmWave is also impacted significantly by the surrounding environment. The LOS signal can be easily blocked by surrounding buildings. Based on this issue, in this paper, we propose to use reinforcement learning to manage the non line of sight (NLOS) scenario. Specifically, we build a model simulating blocked LOS signal for the user equipment (UE) with only NLOS channel available for the UE. Q-Learning is used to select the NLOS beam that meets the UE's quality of service requirements. Simulation results show that Q-Learning can be used to manage the beam selection. In particular, at initial training stage the Q-Learning explores in the environment. However, with the training process, Q-Learning learns from experience and the received power increases significantly and converges to an excellent level.","PeriodicalId":169373,"journal":{"name":"2019 UK/ China Emerging Technologies (UCET)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 UK/ China Emerging Technologies (UCET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UCET.2019.8881841","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 9

Abstract

With the rapid growth of mobile data demand, the fifth generation (5G) mobile network must exploit the large amount of spectrum in the millimeter wave (mmWave) band to increase the network capacity. Due to the limitation of propagation distance, line-of-sight (LOS) link is highly desirable for mmWave systems. However, LOS channel is not feasible all the time and mmWave is also impacted significantly by the surrounding environment. The LOS signal can be easily blocked by surrounding buildings. Based on this issue, in this paper, we propose to use reinforcement learning to manage the non line of sight (NLOS) scenario. Specifically, we build a model simulating blocked LOS signal for the user equipment (UE) with only NLOS channel available for the UE. Q-Learning is used to select the NLOS beam that meets the UE's quality of service requirements. Simulation results show that Q-Learning can be used to manage the beam selection. In particular, at initial training stage the Q-Learning explores in the environment. However, with the training process, Q-Learning learns from experience and the received power increases significantly and converges to an excellent level.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

毫米波网络波束管理的强化学习方法

随着移动数据需求的快速增长，第五代(5G)移动网络必须利用毫米波(mmWave)频段的大量频谱来增加网络容量。由于受传播距离的限制，毫米波系统非常需要视距链路。然而，LOS信道并非一直可行，毫米波也受到周围环境的显著影响。LOS信号很容易被周围的建筑物阻挡。基于这个问题，本文提出使用强化学习来管理非视线(NLOS)场景。具体来说，我们为用户设备(UE)建立了一个模型，模拟阻塞的LOS信号，其中只有NLOS通道可用于UE。Q-Learning用于选择满足终端业务质量要求的NLOS波束。仿真结果表明，Q-Learning可以用于波束选择管理。特别是在初始训练阶段，Q-Learning在环境中进行探索。但是随着训练的进行，Q-Learning从经验中学习，接收到的功率显著增加，并收敛到一个优秀的水平。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2019 UK/ China Emerging Technologies (UCET)

自引率

0.00%

发文量