On the Training of Reinforcement Learning-based Algorithms in 5G and Beyond Radio Access Networks

2022 IEEE 8th International Conference on Network Softwarization (NetSoft) Pub Date : 2022-06-27 DOI:10.1109/NetSoft54395.2022.9844032

Irene Vilà Muñoz, J. Pérez-Romero, O. Sallent

{"title":"On the Training of Reinforcement Learning-based Algorithms in 5G and Beyond Radio Access Networks","authors":"Irene Vilà Muñoz, J. Pérez-Romero, O. Sallent","doi":"10.1109/NetSoft54395.2022.9844032","DOIUrl":null,"url":null,"abstract":"Reinforcement Learning (RL)-based algorithmic solutions have been profusely proposed in recent years for addressing multiple problems in the Radio Access Network (RAN). However, how RL algorithms have to be trained for a successful exploitation has not received sufficient attention. To address this limitation, which is particularly relevant given the peculiarities of wireless communications, this paper proposes a functional framework for training RL strategies in the RAN. The framework is aligned with the O-RAN Alliance machine learning workflow and introduces specific functionalities for RL, such as the way of specifying the training datasets, the mechanisms to monitor the performance of the trained policies during inference in the real network, and the capability to conduct a retraining if necessary. The proposed framework is illustrated with a relevant use case in 5G, namely RAN slicing, by considering a Deep Q-Network algorithm for capacity sharing. Finally, insights on other possible applicability examples of the proposed framework are provided.","PeriodicalId":125799,"journal":{"name":"2022 IEEE 8th International Conference on Network Softwarization (NetSoft)","volume":"84 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 8th International Conference on Network Softwarization (NetSoft)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NetSoft54395.2022.9844032","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Reinforcement Learning (RL)-based algorithmic solutions have been profusely proposed in recent years for addressing multiple problems in the Radio Access Network (RAN). However, how RL algorithms have to be trained for a successful exploitation has not received sufficient attention. To address this limitation, which is particularly relevant given the peculiarities of wireless communications, this paper proposes a functional framework for training RL strategies in the RAN. The framework is aligned with the O-RAN Alliance machine learning workflow and introduces specific functionalities for RL, such as the way of specifying the training datasets, the mechanisms to monitor the performance of the trained policies during inference in the real network, and the capability to conduct a retraining if necessary. The proposed framework is illustrated with a relevant use case in 5G, namely RAN slicing, by considering a Deep Q-Network algorithm for capacity sharing. Finally, insights on other possible applicability examples of the proposed framework are provided.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

5G及以上无线接入网络中基于强化学习的算法训练研究

近年来，基于强化学习(RL)的算法解决方案被大量提出，用于解决无线接入网(RAN)中的多种问题。然而，如何训练强化学习算法才能成功利用还没有得到足够的重视。考虑到无线通信的特殊性，为了解决这一限制，本文提出了一个在RAN中训练强化学习策略的功能框架。该框架与O-RAN联盟机器学习工作流程保持一致，并为强化学习引入了特定的功能，例如指定训练数据集的方式，在真实网络中进行推理期间监控训练策略性能的机制，以及在必要时进行再训练的能力。通过考虑用于容量共享的Deep Q-Network算法，用5G中的相关用例(即RAN切片)说明了所提出的框架。最后，对所提出的框架的其他可能的适用性示例提供了见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2022 IEEE 8th International Conference on Network Softwarization (NetSoft)

自引率

0.00%

发文量