Actor–Critic Reinforcement Learning for Throughput-Optimized Power Allocation in Energy Harvesting NOMA Relay-Assisted Networks

IF 6.3 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Open Journal of the Communications Society Pub Date : 2024-12-09 DOI:10.1109/OJCOMS.2024.3514785
Faeik T. Al Rabee;Ala'Eddin Masadeh;Sharief Abdel-Razeq;Haythem Bany Salameh
{"title":"Actor–Critic Reinforcement Learning for Throughput-Optimized Power Allocation in Energy Harvesting NOMA Relay-Assisted Networks","authors":"Faeik T. Al Rabee;Ala'Eddin Masadeh;Sharief Abdel-Razeq;Haythem Bany Salameh","doi":"10.1109/OJCOMS.2024.3514785","DOIUrl":null,"url":null,"abstract":"In fifth-generation (5G) and beyond (B5G) communication systems, the growing number of connected devices and the increased traffic on the network lead to substantial energy consumption, which requires energy-efficient and high-speed communication solutions. Integrating non-orthogonal multiple access (NOMA), energy harvesting (EH), and millimeter wave (mmWave) technologies has emerged as a powerful approach for achieving massive connectivity and energy-efficient communication paradigms. NOMA-based relay-assisted mmWave networks offer high directivity and enhanced data throughput. However, their design faces significant challenges, such as blockage, limited range, Line-of-Sight (LOS) constraints, and uncertainties in channel gain. Integrating EH and NOMA brings design constraints, namely the uncertainty and dynamic nature of EH sources, that complicate energy management and NOMA’s power multiplexing challenges in optimizing power allocation. These factors require optimizing power and resources to ensure seamless connectivity and energy efficiency. Traditional optimization methods face challenges due to uncertainties in channel gains, EH, and blockages. Although reinforcement learning (RL) is typically used to manage uncertain environments, conventional RL algorithms cannot handle such environments with infinite state and action spaces. To address these challenges, this paper proposes a novel power-allocation framework that integrates an EH-capable source node, a relay, and multiple power-domain NOMA-based users. The proposed framework has two phases. During the first phase, the energy-harvesting source communicates with the relay to maximize the data rate while learning an optimal power allocation policy using an actor-critic approach. This method adapts to the uncertain EH process and varying channel conditions while addressing the limitations associated with infinite state and action spaces inherent in traditional RL for optimal power allocation. The second phase consists of a NOMA-based power allocation mechanism that assigns different powers to the users, such that the data received at the relay are transmitted to its designated users. As it turns out, this problem is non-convex. Hence, we use the sequential convex approximation method to solve this problem. Simulation results demonstrate that the proposed framework significantly outperforms traditional power allocation frameworks in data rate maximization and energy efficiency.","PeriodicalId":33803,"journal":{"name":"IEEE Open Journal of the Communications Society","volume":"5 ","pages":"7941-7953"},"PeriodicalIF":6.3000,"publicationDate":"2024-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10787238","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Open Journal of the Communications Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10787238/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

In fifth-generation (5G) and beyond (B5G) communication systems, the growing number of connected devices and the increased traffic on the network lead to substantial energy consumption, which requires energy-efficient and high-speed communication solutions. Integrating non-orthogonal multiple access (NOMA), energy harvesting (EH), and millimeter wave (mmWave) technologies has emerged as a powerful approach for achieving massive connectivity and energy-efficient communication paradigms. NOMA-based relay-assisted mmWave networks offer high directivity and enhanced data throughput. However, their design faces significant challenges, such as blockage, limited range, Line-of-Sight (LOS) constraints, and uncertainties in channel gain. Integrating EH and NOMA brings design constraints, namely the uncertainty and dynamic nature of EH sources, that complicate energy management and NOMA’s power multiplexing challenges in optimizing power allocation. These factors require optimizing power and resources to ensure seamless connectivity and energy efficiency. Traditional optimization methods face challenges due to uncertainties in channel gains, EH, and blockages. Although reinforcement learning (RL) is typically used to manage uncertain environments, conventional RL algorithms cannot handle such environments with infinite state and action spaces. To address these challenges, this paper proposes a novel power-allocation framework that integrates an EH-capable source node, a relay, and multiple power-domain NOMA-based users. The proposed framework has two phases. During the first phase, the energy-harvesting source communicates with the relay to maximize the data rate while learning an optimal power allocation policy using an actor-critic approach. This method adapts to the uncertain EH process and varying channel conditions while addressing the limitations associated with infinite state and action spaces inherent in traditional RL for optimal power allocation. The second phase consists of a NOMA-based power allocation mechanism that assigns different powers to the users, such that the data received at the relay are transmitted to its designated users. As it turns out, this problem is non-convex. Hence, we use the sequential convex approximation method to solve this problem. Simulation results demonstrate that the proposed framework significantly outperforms traditional power allocation frameworks in data rate maximization and energy efficiency.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
能量收集NOMA中继辅助网络中吞吐量优化功率分配的Actor-Critic强化学习
在第五代(5G)及以上(B5G)通信系统中,连接设备数量的增加和网络流量的增加导致了大量的能源消耗,这需要节能和高速的通信解决方案。集成非正交多址(NOMA)、能量收集(EH)和毫米波(mmWave)技术已成为实现大规模连接和节能通信范例的有力方法。基于noma的中继辅助毫米波网络提供高指向性和增强的数据吞吐量。然而,它们的设计面临着重大挑战,例如阻塞、有限范围、视距(LOS)限制以及信道增益的不确定性。集成EH和NOMA带来了设计约束,即EH源的不确定性和动态性,使能量管理和NOMA在优化功率分配方面的功率复用挑战复杂化。这些因素需要优化电源和资源,以确保无缝连接和能源效率。由于通道增益、EH和阻塞的不确定性,传统的优化方法面临挑战。虽然强化学习(RL)通常用于管理不确定环境,但传统的强化学习算法无法处理具有无限状态和动作空间的环境。为了解决这些挑战,本文提出了一种新的功率分配框架,该框架集成了支持eh的源节点、中继和多个基于功率域noma的用户。拟议的框架分为两个阶段。在第一阶段,能量收集源与中继通信以最大化数据速率,同时使用参与者批评方法学习最佳功率分配策略。该方法适应了不确定的EH过程和变化的信道条件,同时解决了传统RL固有的无限状态和动作空间的局限性,实现了最优功率分配。第二阶段由基于noma的功率分配机制组成,该机制将不同的功率分配给用户,从而将中继接收到的数据传输给其指定的用户。事实证明,这个问题是非凸的。因此,我们使用序贯凸逼近法来解决这个问题。仿真结果表明,该框架在数据速率最大化和能效方面明显优于传统的功率分配框架。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
13.70
自引率
3.80%
发文量
94
审稿时长
10 weeks
期刊介绍: The IEEE Open Journal of the Communications Society (OJ-COMS) is an open access, all-electronic journal that publishes original high-quality manuscripts on advances in the state of the art of telecommunications systems and networks. The papers in IEEE OJ-COMS are included in Scopus. Submissions reporting new theoretical findings (including novel methods, concepts, and studies) and practical contributions (including experiments and development of prototypes) are welcome. Additionally, survey and tutorial articles are considered. The IEEE OJCOMS received its debut impact factor of 7.9 according to the Journal Citation Reports (JCR) 2023. The IEEE Open Journal of the Communications Society covers science, technology, applications and standards for information organization, collection and transfer using electronic, optical and wireless channels and networks. Some specific areas covered include: Systems and network architecture, control and management Protocols, software, and middleware Quality of service, reliability, and security Modulation, detection, coding, and signaling Switching and routing Mobile and portable communications Terminals and other end-user devices Networks for content distribution and distributed computing Communications-based distributed resources control.
期刊最新文献
Link Scheduling in Satellite Networks via Machine Learning Over Riemannian Manifolds Harnessing Meta-Reinforcement Learning for Enhanced Tracking in Geofencing Systems Deep Reinforcement Learning-Based Anti-Jamming Approach for Fast Frequency Hopping Systems 5G Networks Security Mitigation Model: An ANN-ISM Hybrid Approach Enhanced Lightweight Quantum Key Distribution Protocol for Improved Efficiency and Security
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1