Multi-agent deep reinforcement learning-based joint channel selection and power control method

IF 4.9 3区 计算机科学 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Computers & Electrical Engineering Pub Date : 2025-04-01 Epub Date: 2025-02-12 DOI:10.1016/j.compeleceng.2025.110147
Weiwei Bai , Guoqiang Zheng , Weibing Xia , Yu Mu , Yujun Xue
{"title":"Multi-agent deep reinforcement learning-based joint channel selection and power control method","authors":"Weiwei Bai ,&nbsp;Guoqiang Zheng ,&nbsp;Weibing Xia ,&nbsp;Yu Mu ,&nbsp;Yujun Xue","doi":"10.1016/j.compeleceng.2025.110147","DOIUrl":null,"url":null,"abstract":"<div><div>Aiming at the problem of system performance degradation caused by dynamic spectrum access in underlay mode within cognitive radio networks, we propose a multi-agent deep reinforcement learning-based joint channel selection and power control (MA-JCSPC) method. This method formulates the spectrum access problem in underlay mode as an optimization problem of joint channel selection and power control, and transforms this optimization problem into a multi-agent Markov decision process. By designing a multi-agent deep reinforcement learning framework with centralized training and decentralized execution, the channel selection and power control strategies for secondary users are optimized. In this process, a nonlinear reward function is designed by introducing a penalty term, and a novel initial action selection strategy based on a action guidance term is employed to solve the sparse rewards and ineffective exploration problems. The simulation results demonstrate that the MA-JCSPC method surpasses the compared methods in convergence, resource allocation rationality, and throughput. Compared to the centralized deep reinforcement learning (C-DRL) method, the proposed method achieves an average improvement of 6.7% and 9.1% in the sum throughput of secondary users, respectively, under variations in the throughput requirements of the primary user and the number of secondary users.</div></div>","PeriodicalId":50630,"journal":{"name":"Computers & Electrical Engineering","volume":"123 ","pages":"Article 110147"},"PeriodicalIF":4.9000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Electrical Engineering","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0045790625000904","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/12 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0

Abstract

Aiming at the problem of system performance degradation caused by dynamic spectrum access in underlay mode within cognitive radio networks, we propose a multi-agent deep reinforcement learning-based joint channel selection and power control (MA-JCSPC) method. This method formulates the spectrum access problem in underlay mode as an optimization problem of joint channel selection and power control, and transforms this optimization problem into a multi-agent Markov decision process. By designing a multi-agent deep reinforcement learning framework with centralized training and decentralized execution, the channel selection and power control strategies for secondary users are optimized. In this process, a nonlinear reward function is designed by introducing a penalty term, and a novel initial action selection strategy based on a action guidance term is employed to solve the sparse rewards and ineffective exploration problems. The simulation results demonstrate that the MA-JCSPC method surpasses the compared methods in convergence, resource allocation rationality, and throughput. Compared to the centralized deep reinforcement learning (C-DRL) method, the proposed method achieves an average improvement of 6.7% and 9.1% in the sum throughput of secondary users, respectively, under variations in the throughput requirements of the primary user and the number of secondary users.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于多智能体深度强化学习的联合通道选择与功率控制方法
针对认知无线网络底层模式下动态频谱接入导致系统性能下降的问题,提出了一种基于多智能体深度强化学习的联合信道选择与功率控制(MA-JCSPC)方法。该方法将底层模式下的频谱接入问题表述为联合信道选择和功率控制的优化问题,并将该优化问题转化为多智能体马尔可夫决策过程。通过设计集中训练、分散执行的多智能体深度强化学习框架,优化二次用户的信道选择和功率控制策略。在此过程中,通过引入惩罚项来设计非线性奖励函数,并采用基于行动指导项的新颖初始行动选择策略来解决奖励稀疏和无效探索问题。仿真结果表明,MA-JCSPC方法在收敛性、资源分配合理性和吞吐量等方面都优于比较方法。与集中式深度强化学习(C-DRL)方法相比,在主用户吞吐量需求和从用户数量变化的情况下,该方法在二级用户总吞吐量上平均分别提高了6.7%和9.1%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Computers & Electrical Engineering
Computers & Electrical Engineering 工程技术-工程:电子与电气
CiteScore
9.20
自引率
7.00%
发文量
661
审稿时长
47 days
期刊介绍: The impact of computers has nowhere been more revolutionary than in electrical engineering. The design, analysis, and operation of electrical and electronic systems are now dominated by computers, a transformation that has been motivated by the natural ease of interface between computers and electrical systems, and the promise of spectacular improvements in speed and efficiency. Published since 1973, Computers & Electrical Engineering provides rapid publication of topical research into the integration of computer technology and computational techniques with electrical and electronic systems. The journal publishes papers featuring novel implementations of computers and computational techniques in areas like signal and image processing, high-performance computing, parallel processing, and communications. Special attention will be paid to papers describing innovative architectures, algorithms, and software tools.
期刊最新文献
EVerGen: Optimal path planning for electric vehicle using modified genetic algorithm in internet of vehicular things SRAM PUF-based logic locking framework for IoT authentication and IP protection Neuroauthnet: A brainwave-based authentication framework using BCI and deep learning for privacy-preserving identity verification A novel hybrid gated recurrent ensemble model for stage-based clinical diagnosis of neurological disorders Chaotic atom search optimized fractional-order proportional-integral-derivative control for energy-efficient electric vehicles regenerative braking
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1