Federated Policy Distillation for Digital Twin-Enabled Intelligent Resource Trading in 5G Network Slicing

IF 5.4 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS IEEE Transactions on Network and Service Management Pub Date : 2024-10-09 DOI:10.1109/TNSM.2024.3476480

Daniel Ayepah-Mensah;Guolin Sun;Gordon Owusu Boateng;Guisong Liu

{"title":"Federated Policy Distillation for Digital Twin-Enabled Intelligent Resource Trading in 5G Network Slicing","authors":"Daniel Ayepah-Mensah;Guolin Sun;Gordon Owusu Boateng;Guisong Liu","doi":"10.1109/TNSM.2024.3476480","DOIUrl":null,"url":null,"abstract":"Resource sharing in radio access networks (RAN) can be conceptualized as a resource trading process between infrastructure providers (InPs) and multiple mobile virtual network operators (MVNO), where InPs lease essential network resources, such as spectrum and infrastructure, to MVNOs. Given the dynamic nature of RANs, deep reinforcement learning (DRL) is a more suitable approach to decision-making and resource optimization that ensures adaptive and efficient resource allocation strategies. In RAN slicing, DRL struggles due to imbalanced data distribution and reliance on high-quality training data. In addition, the trade-off between the global solution and individual agent goals can lead to oscillatory behavior, preventing convergence to an optimal solution. Therefore, we propose a collaborative intelligent resource trading framework with a graph-based digital twin (DT) for multiple InPs and MVNOs based on Federated DRL. First, we present a customized mutual policy distillation scheme for resource trading, where complex MVNO teacher policies are distilled into InP student models and vice versa. This mutual distillation encourages collaboration to achieve personalized resource trading decisions that reach the optimal local and global solution. Second, the DT uses a graph-based model to capture the dynamic interactions between InPs and MVNOs to improve resource-trade decisions. DT can accurately predict resource prices and demand from MVNO to provide high-quality training data. In addition, DT identifies the underlying patterns and trends through advanced analytics, enabling proactive resource allocation and pricing strategies. The simulation results and analysis confirm the effectiveness and robustness of the proposed framework to an unbalanced data distribution.","PeriodicalId":13423,"journal":{"name":"IEEE Transactions on Network and Service Management","volume":"22 1","pages":"361-379"},"PeriodicalIF":5.4000,"publicationDate":"2024-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Network and Service Management","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10711938/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Resource sharing in radio access networks (RAN) can be conceptualized as a resource trading process between infrastructure providers (InPs) and multiple mobile virtual network operators (MVNO), where InPs lease essential network resources, such as spectrum and infrastructure, to MVNOs. Given the dynamic nature of RANs, deep reinforcement learning (DRL) is a more suitable approach to decision-making and resource optimization that ensures adaptive and efficient resource allocation strategies. In RAN slicing, DRL struggles due to imbalanced data distribution and reliance on high-quality training data. In addition, the trade-off between the global solution and individual agent goals can lead to oscillatory behavior, preventing convergence to an optimal solution. Therefore, we propose a collaborative intelligent resource trading framework with a graph-based digital twin (DT) for multiple InPs and MVNOs based on Federated DRL. First, we present a customized mutual policy distillation scheme for resource trading, where complex MVNO teacher policies are distilled into InP student models and vice versa. This mutual distillation encourages collaboration to achieve personalized resource trading decisions that reach the optimal local and global solution. Second, the DT uses a graph-based model to capture the dynamic interactions between InPs and MVNOs to improve resource-trade decisions. DT can accurately predict resource prices and demand from MVNO to provide high-quality training data. In addition, DT identifies the underlying patterns and trends through advanced analytics, enabling proactive resource allocation and pricing strategies. The simulation results and analysis confirm the effectiveness and robustness of the proposed framework to an unbalanced data distribution.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

5G网络切片中数字双机智能资源交易的联邦策略提炼

无线接入网（RAN）中的资源共享可以被定义为基础设施提供商（inp）和多个移动虚拟网络运营商（MVNO）之间的资源交易过程，其中inp向MVNO租赁必要的网络资源，如频谱和基础设施。考虑到ran的动态特性，深度强化学习（DRL）是一种更适合于决策和资源优化的方法，可以确保自适应和有效的资源分配策略。在RAN切片中，由于数据分布不平衡和对高质量训练数据的依赖，DRL遇到了困难。此外，全局解决方案和个体代理目标之间的权衡可能导致振荡行为，阻止收敛到最优解决方案。因此，我们提出了一种基于联邦DRL的基于图形的数字孪生（DT）的协作智能资源交易框架，用于多个inp和mvno。首先，我们提出了一个定制的资源交易相互策略蒸馏方案，其中复杂的MVNO教师策略被提炼成InP学生模型，反之亦然。这种相互升华鼓励协作，以实现个性化的资源交易决策，从而达到最佳的本地和全球解决方案。其次，DT使用基于图的模型来捕获inp和mvno之间的动态交互，以改进资源交易决策。DT可以从MVNO中准确预测资源价格和需求，提供高质量的培训数据。此外，DT通过高级分析识别潜在的模式和趋势，从而实现主动的资源分配和定价策略。仿真结果和分析验证了该框架对不平衡数据分布的有效性和鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Network and Service Management Computer Science-Computer Networks and Communications

CiteScore

9.30

自引率

15.10%

发文量

325

期刊介绍： IEEE Transactions on Network and Service Management will publish (online only) peerreviewed archival quality papers that advance the state-of-the-art and practical applications of network and service management. Theoretical research contributions (presenting new concepts and techniques) and applied contributions (reporting on experiences and experiments with actual systems) will be encouraged. These transactions will focus on the key technical issues related to: Management Models, Architectures and Frameworks; Service Provisioning, Reliability and Quality Assurance; Management Functions; Enabling Technologies; Information and Communication Models; Policies; Applications and Case Studies; Emerging Technologies and Standards.