Distributed Multi-Agent Reinforcement Learning for Heterogeneous NOMA-ALOHA Systems

IF 7 1区 计算机科学 Q1 TELECOMMUNICATIONS IEEE Transactions on Cognitive Communications and Networking Pub Date : 2024-10-07 DOI:10.1109/TCCN.2024.3474709
Xueyu Wu;Youngwook Ko;Andy M. Tyrrell
{"title":"Distributed Multi-Agent Reinforcement Learning for Heterogeneous NOMA-ALOHA Systems","authors":"Xueyu Wu;Youngwook Ko;Andy M. Tyrrell","doi":"10.1109/TCCN.2024.3474709","DOIUrl":null,"url":null,"abstract":"With ever-growing machine type users in the 6G wireless ecosystems, uncontrolled multiple access control (MAC) is vital to alleviate random collision and fading in their transmission. In this paper, 2-steps random access method is applied for a learning-aided non-orthogonal random access (NORA) system. Specifically, each user independently selects a slot and a power level for uplink packet transmission without any information about other users’ selection and channel state information (CSI); and the base station (BS) performs successive interference cancellation (SIC) to decode packets from multiple users with the use of power differences on the same slot. To design a model-free multiple access under growing complexity and CSI uncertainty, the joint slot and power level selecting problem is modelled as a Markov decision process (MDP) where actions are slot-power pairs. Multi-state Q-Learning algorithms and a confidence-aided Q-Learning method are tailored for the NORA system to solve the MDP under heterogeneous environments. Simulation results show that the three proposed algorithms help the distributed users to find their strategies for slot and power level selections, improving system throughput and fairness simultaneously. The proposed algorithms are particularly shown to make superior performance compared to the benchmarks in high congestion traffics scenarios. This is crucial for achieving massive connectivity in 6G ecosystems, which requires intelligent random access designs to accommodate the growing number of machine type users in diverse conditions.","PeriodicalId":13069,"journal":{"name":"IEEE Transactions on Cognitive Communications and Networking","volume":"11 3","pages":"1902-1912"},"PeriodicalIF":7.0000,"publicationDate":"2024-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Cognitive Communications and Networking","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10706088/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

With ever-growing machine type users in the 6G wireless ecosystems, uncontrolled multiple access control (MAC) is vital to alleviate random collision and fading in their transmission. In this paper, 2-steps random access method is applied for a learning-aided non-orthogonal random access (NORA) system. Specifically, each user independently selects a slot and a power level for uplink packet transmission without any information about other users’ selection and channel state information (CSI); and the base station (BS) performs successive interference cancellation (SIC) to decode packets from multiple users with the use of power differences on the same slot. To design a model-free multiple access under growing complexity and CSI uncertainty, the joint slot and power level selecting problem is modelled as a Markov decision process (MDP) where actions are slot-power pairs. Multi-state Q-Learning algorithms and a confidence-aided Q-Learning method are tailored for the NORA system to solve the MDP under heterogeneous environments. Simulation results show that the three proposed algorithms help the distributed users to find their strategies for slot and power level selections, improving system throughput and fairness simultaneously. The proposed algorithms are particularly shown to make superior performance compared to the benchmarks in high congestion traffics scenarios. This is crucial for achieving massive connectivity in 6G ecosystems, which requires intelligent random access designs to accommodate the growing number of machine type users in diverse conditions.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
异构 NOMA-ALOHA 系统的分布式多代理强化学习
随着6G无线生态系统中机器类型用户的不断增长,不受控制的多址控制(MAC)对于减轻其传输中的随机碰撞和衰落至关重要。本文将两步随机存取方法应用于学习辅助非正交随机存取系统。具体而言,每个用户独立地选择用于上行分组传输的插槽和功率电平,而不需要任何关于其他用户选择和信道状态信息(CSI)的信息;基站(BS)执行连续干扰消除(SIC),利用同一槽上的功率差异对来自多个用户的数据包进行解码。为了设计一种日益复杂和CSI不确定性条件下的无模型多址接入,将节点槽和功率选择问题建模为一个马尔可夫决策过程(MDP),其中动作是槽-功率对。为解决异构环境下的MDP问题,为NORA系统量身定制了多状态Q-Learning算法和置信度辅助Q-Learning方法。仿真结果表明,这三种算法帮助分布式用户找到自己的插槽和功率选择策略,同时提高了系统吞吐量和公平性。在高拥塞流量场景下,与基准测试相比,所提出的算法具有更好的性能。这对于在6G生态系统中实现大规模连接至关重要,这需要智能随机访问设计,以适应不同条件下越来越多的机器类型用户。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Cognitive Communications and Networking
IEEE Transactions on Cognitive Communications and Networking Computer Science-Artificial Intelligence
CiteScore
15.50
自引率
7.00%
发文量
108
期刊介绍: The IEEE Transactions on Cognitive Communications and Networking (TCCN) aims to publish high-quality manuscripts that push the boundaries of cognitive communications and networking research. Cognitive, in this context, refers to the application of perception, learning, reasoning, memory, and adaptive approaches in communication system design. The transactions welcome submissions that explore various aspects of cognitive communications and networks, focusing on innovative and holistic approaches to complex system design. Key topics covered include architecture, protocols, cross-layer design, and cognition cycle design for cognitive networks. Additionally, research on machine learning, artificial intelligence, end-to-end and distributed intelligence, software-defined networking, cognitive radios, spectrum sharing, and security and privacy issues in cognitive networks are of interest. The publication also encourages papers addressing novel services and applications enabled by these cognitive concepts.
期刊最新文献
IEEE Communications Society Coverage Optimization in RIS-enabled Satellite-Terrestrial Networks: A Digital Twin-based Spatial-Temporal Approach Confidence-guided Prototypical Contrastive Domain Adaptation for Cross-domain Automatic Modulation Classification Curated Collaborative AI Edge with Network Data Analytics for B5G/6G Radio Access Networks Convolutional Autoencoder-Enhanced Semantic Communication in Optical Fiber Systems
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1