Distributed Multi-Agent Reinforcement Learning for Heterogeneous NOMA-ALOHA Systems

IF 7 1区计算机科学 Q1 TELECOMMUNICATIONS IEEE Transactions on Cognitive Communications and Networking Pub Date : 2024-10-07 DOI:10.1109/TCCN.2024.3474709

Xueyu Wu;Youngwook Ko;Andy M. Tyrrell

{"title":"Distributed Multi-Agent Reinforcement Learning for Heterogeneous NOMA-ALOHA Systems","authors":"Xueyu Wu;Youngwook Ko;Andy M. Tyrrell","doi":"10.1109/TCCN.2024.3474709","DOIUrl":null,"url":null,"abstract":"With ever-growing machine type users in the 6G wireless ecosystems, uncontrolled multiple access control (MAC) is vital to alleviate random collision and fading in their transmission. In this paper, 2-steps random access method is applied for a learning-aided non-orthogonal random access (NORA) system. Specifically, each user independently selects a slot and a power level for uplink packet transmission without any information about other users’ selection and channel state information (CSI); and the base station (BS) performs successive interference cancellation (SIC) to decode packets from multiple users with the use of power differences on the same slot. To design a model-free multiple access under growing complexity and CSI uncertainty, the joint slot and power level selecting problem is modelled as a Markov decision process (MDP) where actions are slot-power pairs. Multi-state Q-Learning algorithms and a confidence-aided Q-Learning method are tailored for the NORA system to solve the MDP under heterogeneous environments. Simulation results show that the three proposed algorithms help the distributed users to find their strategies for slot and power level selections, improving system throughput and fairness simultaneously. The proposed algorithms are particularly shown to make superior performance compared to the benchmarks in high congestion traffics scenarios. This is crucial for achieving massive connectivity in 6G ecosystems, which requires intelligent random access designs to accommodate the growing number of machine type users in diverse conditions.","PeriodicalId":13069,"journal":{"name":"IEEE Transactions on Cognitive Communications and Networking","volume":"11 3","pages":"1902-1912"},"PeriodicalIF":7.0000,"publicationDate":"2024-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Cognitive Communications and Networking","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10706088/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

With ever-growing machine type users in the 6G wireless ecosystems, uncontrolled multiple access control (MAC) is vital to alleviate random collision and fading in their transmission. In this paper, 2-steps random access method is applied for a learning-aided non-orthogonal random access (NORA) system. Specifically, each user independently selects a slot and a power level for uplink packet transmission without any information about other users’ selection and channel state information (CSI); and the base station (BS) performs successive interference cancellation (SIC) to decode packets from multiple users with the use of power differences on the same slot. To design a model-free multiple access under growing complexity and CSI uncertainty, the joint slot and power level selecting problem is modelled as a Markov decision process (MDP) where actions are slot-power pairs. Multi-state Q-Learning algorithms and a confidence-aided Q-Learning method are tailored for the NORA system to solve the MDP under heterogeneous environments. Simulation results show that the three proposed algorithms help the distributed users to find their strategies for slot and power level selections, improving system throughput and fairness simultaneously. The proposed algorithms are particularly shown to make superior performance compared to the benchmarks in high congestion traffics scenarios. This is crucial for achieving massive connectivity in 6G ecosystems, which requires intelligent random access designs to accommodate the growing number of machine type users in diverse conditions.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

异构 NOMA-ALOHA 系统的分布式多代理强化学习

随着6G无线生态系统中机器类型用户的不断增长，不受控制的多址控制（MAC）对于减轻其传输中的随机碰撞和衰落至关重要。本文将两步随机存取方法应用于学习辅助非正交随机存取系统。具体而言，每个用户独立地选择用于上行分组传输的插槽和功率电平，而不需要任何关于其他用户选择和信道状态信息（CSI）的信息；基站（BS）执行连续干扰消除（SIC），利用同一槽上的功率差异对来自多个用户的数据包进行解码。为了设计一种日益复杂和CSI不确定性条件下的无模型多址接入，将节点槽和功率选择问题建模为一个马尔可夫决策过程（MDP），其中动作是槽-功率对。为解决异构环境下的MDP问题，为NORA系统量身定制了多状态Q-Learning算法和置信度辅助Q-Learning方法。仿真结果表明，这三种算法帮助分布式用户找到自己的插槽和功率选择策略，同时提高了系统吞吐量和公平性。在高拥塞流量场景下，与基准测试相比，所提出的算法具有更好的性能。这对于在6G生态系统中实现大规模连接至关重要，这需要智能随机访问设计，以适应不同条件下越来越多的机器类型用户。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Cognitive Communications and Networking Computer Science-Artificial Intelligence

CiteScore

15.50

自引率

7.00%

发文量

108

期刊介绍： The IEEE Transactions on Cognitive Communications and Networking (TCCN) aims to publish high-quality manuscripts that push the boundaries of cognitive communications and networking research. Cognitive, in this context, refers to the application of perception, learning, reasoning, memory, and adaptive approaches in communication system design. The transactions welcome submissions that explore various aspects of cognitive communications and networks, focusing on innovative and holistic approaches to complex system design. Key topics covered include architecture, protocols, cross-layer design, and cognition cycle design for cognitive networks. Additionally, research on machine learning, artificial intelligence, end-to-end and distributed intelligence, software-defined networking, cognitive radios, spectrum sharing, and security and privacy issues in cognitive networks are of interest. The publication also encourages papers addressing novel services and applications enabled by these cognitive concepts.