{"title":"Digital Twin-Accelerated Online Deep Reinforcement Learning for Admission Control in Sliced Communication Networks","authors":"Zhenyu Tao;Wei Xu;Xiaohu You","doi":"10.1109/TCOMM.2024.3476430","DOIUrl":null,"url":null,"abstract":"The proliferation of diverse wireless services has led to emerging technologies for network slicing. Admission control plays a crucial role in achieving service-oriented goals in sliced communication networks through selective acceptance of service requests. To enhance the performance of admission control in intricate contemporary communication networks, deep reinforcement learning (DRL) has been widely adopted to enhance the effectiveness and flexibility of intelligent admission control. However, due to the simulation-to-reality gap, DRL models trained in a simulation environment can face considerable performance degradation when transferred to the deployment environment. Although online DRL tries to avoid such gaps, its expensive trial-and-error cost poses economic and safety concerns for network operators. We propose a cooperative framework integrating digital twin (DT) and online DRL to address this issue. Specifically, a behavior cloning-based DT is established to parameterize a default admission control policy in real networks, and a DT-accelerated online DRL strategy is then developed for further policy optimization. The DT is constructed as a neural network, featuring a customized output layer to address extensive action spaces in queuing systems. Extensive simulations show that the proposed DRL solution facilitates the stability of the online DRL and accelerates the convergence, yielding a resource utilization improvement of up to 26.39% compared to the state-of-the-art DRL model, while maintaining consistent performance with the online DRL method in terms of long-term revenues. Meanwhile, the proposed solution is versatile and adaptable to various DRL-based network optimization tasks.","PeriodicalId":13041,"journal":{"name":"IEEE Transactions on Communications","volume":"73 4","pages":"2490-2504"},"PeriodicalIF":8.3000,"publicationDate":"2024-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Communications","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10711852/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
The proliferation of diverse wireless services has led to emerging technologies for network slicing. Admission control plays a crucial role in achieving service-oriented goals in sliced communication networks through selective acceptance of service requests. To enhance the performance of admission control in intricate contemporary communication networks, deep reinforcement learning (DRL) has been widely adopted to enhance the effectiveness and flexibility of intelligent admission control. However, due to the simulation-to-reality gap, DRL models trained in a simulation environment can face considerable performance degradation when transferred to the deployment environment. Although online DRL tries to avoid such gaps, its expensive trial-and-error cost poses economic and safety concerns for network operators. We propose a cooperative framework integrating digital twin (DT) and online DRL to address this issue. Specifically, a behavior cloning-based DT is established to parameterize a default admission control policy in real networks, and a DT-accelerated online DRL strategy is then developed for further policy optimization. The DT is constructed as a neural network, featuring a customized output layer to address extensive action spaces in queuing systems. Extensive simulations show that the proposed DRL solution facilitates the stability of the online DRL and accelerates the convergence, yielding a resource utilization improvement of up to 26.39% compared to the state-of-the-art DRL model, while maintaining consistent performance with the online DRL method in terms of long-term revenues. Meanwhile, the proposed solution is versatile and adaptable to various DRL-based network optimization tasks.
期刊介绍:
The IEEE Transactions on Communications is dedicated to publishing high-quality manuscripts that showcase advancements in the state-of-the-art of telecommunications. Our scope encompasses all aspects of telecommunications, including telephone, telegraphy, facsimile, and television, facilitated by electromagnetic propagation methods such as radio, wire, aerial, underground, coaxial, and submarine cables, as well as waveguides, communication satellites, and lasers. We cover telecommunications in various settings, including marine, aeronautical, space, and fixed station services, addressing topics such as repeaters, radio relaying, signal storage, regeneration, error detection and correction, multiplexing, carrier techniques, communication switching systems, data communications, and communication theory. Join us in advancing the field of telecommunications through groundbreaking research and innovation.