Nguyen Thi Thanh Van;Nguyen Le Tuan;Nguyen Cong Luong;Tien Hoa Nguyen;Shaohan Feng;Shimin Gong;Dusit Niyato;Dong In Kim
{"title":"Network Access Selection for URLLC and eMBB Applications in Sub-6 GHz-mmWave-THz Networks: Game Theory Versus Multi-Agent Reinforcement Learning","authors":"Nguyen Thi Thanh Van;Nguyen Le Tuan;Nguyen Cong Luong;Tien Hoa Nguyen;Shaohan Feng;Shimin Gong;Dusit Niyato;Dong In Kim","doi":"10.1109/TCOMM.2024.3524944","DOIUrl":null,"url":null,"abstract":"We investigate a heterogeneous network (HetNet) including sub-6GHz base stations (BSs), mmWave BSs, and THz BSs to support enhanced mobile broadband (eMBB) users and ultra-reliable low-latency communication (URLLC) users. We particularly investigate a user-centric network in which the users locally and dynamically select and switch among BSs over time to achieve their highest utility. Two types of users have different Quality of Service (QoS) requirements. Thus, we design two types of utility functions specifically for the eMBB users and URLLC users. Then, to model the dynamic selection behavior of the users, we propose to use a fractional game with the power-law memory. The fractional game allows the eMBB users and the URLLC users to incorporate their past strategies into their current selection, thus improving their utility. Furthermore, we consider the case that the BSs communicate the system state with each other, and we model the network selection of the users as a multi-agent problem. Then, we propose to use a multi-agent deep reinforcement learning (MADRL) algorithm that enables the URLLC users and eMBB users to make their network selection decision online to achieve their long-term utility. Various simulation results are provided to demonstrate the scalability and effectiveness of the proposed approaches. Particularly, compared with the classical game, the fractional game is able to achieve a higher utility but incurs a higher network adaptation cost. Moreover, the different types of URLLC users (in terms of latency and reliability requirements) and the number of URLLC users in the network significantly affect the total utility and the network selection strategies of the eMBB users. Importantly, given the full observations, the MADRL outperforms both classical and fractional games in terms of total network utility.","PeriodicalId":13041,"journal":{"name":"IEEE Transactions on Communications","volume":"73 8","pages":"6126-6141"},"PeriodicalIF":8.3000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Communications","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10819441/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
We investigate a heterogeneous network (HetNet) including sub-6GHz base stations (BSs), mmWave BSs, and THz BSs to support enhanced mobile broadband (eMBB) users and ultra-reliable low-latency communication (URLLC) users. We particularly investigate a user-centric network in which the users locally and dynamically select and switch among BSs over time to achieve their highest utility. Two types of users have different Quality of Service (QoS) requirements. Thus, we design two types of utility functions specifically for the eMBB users and URLLC users. Then, to model the dynamic selection behavior of the users, we propose to use a fractional game with the power-law memory. The fractional game allows the eMBB users and the URLLC users to incorporate their past strategies into their current selection, thus improving their utility. Furthermore, we consider the case that the BSs communicate the system state with each other, and we model the network selection of the users as a multi-agent problem. Then, we propose to use a multi-agent deep reinforcement learning (MADRL) algorithm that enables the URLLC users and eMBB users to make their network selection decision online to achieve their long-term utility. Various simulation results are provided to demonstrate the scalability and effectiveness of the proposed approaches. Particularly, compared with the classical game, the fractional game is able to achieve a higher utility but incurs a higher network adaptation cost. Moreover, the different types of URLLC users (in terms of latency and reliability requirements) and the number of URLLC users in the network significantly affect the total utility and the network selection strategies of the eMBB users. Importantly, given the full observations, the MADRL outperforms both classical and fractional games in terms of total network utility.
期刊介绍:
The IEEE Transactions on Communications is dedicated to publishing high-quality manuscripts that showcase advancements in the state-of-the-art of telecommunications. Our scope encompasses all aspects of telecommunications, including telephone, telegraphy, facsimile, and television, facilitated by electromagnetic propagation methods such as radio, wire, aerial, underground, coaxial, and submarine cables, as well as waveguides, communication satellites, and lasers. We cover telecommunications in various settings, including marine, aeronautical, space, and fixed station services, addressing topics such as repeaters, radio relaying, signal storage, regeneration, error detection and correction, multiplexing, carrier techniques, communication switching systems, data communications, and communication theory. Join us in advancing the field of telecommunications through groundbreaking research and innovation.