Joint Beamforming in RIS-Assisted Multi-User Transmission Design: A Model-Driven Deep Reinforcement Learning Framework

IF 8.3 2区 计算机科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Communications Pub Date : 2024-11-05 DOI:10.1109/TCOMM.2024.3492065
Weijie Jin;Jing Zhang;Chao-Kai Wen;Shi Jin;Fu-Chun Zheng
{"title":"Joint Beamforming in RIS-Assisted Multi-User Transmission Design: A Model-Driven Deep Reinforcement Learning Framework","authors":"Weijie Jin;Jing Zhang;Chao-Kai Wen;Shi Jin;Fu-Chun Zheng","doi":"10.1109/TCOMM.2024.3492065","DOIUrl":null,"url":null,"abstract":"The deployment of multiple reconfigurable intelligent surfaces (RIS) is a promising strategy to enhance wireless system performance. However, joint beamforming in multi-RIS assisted systems faces significant challenges due to the increased number of optimization variables, non-convex objective functions, and constraints. In this study, we propose an algorithm based on weighted minimum mean square error optimization and the successive convex approximation algorithm, maximizing the weighted sum rate in a double-RIS assisted downlink multi-user multiple-input single-output system. We also present a general framework for model-driven deep learning that addresses the limitations of existing methods, which often lack flexibility to different channels and suffer from a large training burden due to the high-dimensional action space of deep reinforcement learning (DRL). Initially, we configure the step size in the proposed algorithm as trainable, accelerating convergence. Then, a recurrent neural network generates the step size for iterations, allowing dynamic iteration extension in varying environmental conditions. We enhance the neural network’s self-adaptability by introducing a model-driven DRL algorithm, integrating expert knowledge into the DRL actor network’s design. Simulation results demonstrate up to 30% performance improvement over traditional algorithms, achieved by our model-driven framework. The proposed model-driven DRL shows higher capacity for dynamic extension and rapid adaptation to new environments.","PeriodicalId":13041,"journal":{"name":"IEEE Transactions on Communications","volume":"73 5","pages":"3184-3198"},"PeriodicalIF":8.3000,"publicationDate":"2024-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Communications","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10744542/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

The deployment of multiple reconfigurable intelligent surfaces (RIS) is a promising strategy to enhance wireless system performance. However, joint beamforming in multi-RIS assisted systems faces significant challenges due to the increased number of optimization variables, non-convex objective functions, and constraints. In this study, we propose an algorithm based on weighted minimum mean square error optimization and the successive convex approximation algorithm, maximizing the weighted sum rate in a double-RIS assisted downlink multi-user multiple-input single-output system. We also present a general framework for model-driven deep learning that addresses the limitations of existing methods, which often lack flexibility to different channels and suffer from a large training burden due to the high-dimensional action space of deep reinforcement learning (DRL). Initially, we configure the step size in the proposed algorithm as trainable, accelerating convergence. Then, a recurrent neural network generates the step size for iterations, allowing dynamic iteration extension in varying environmental conditions. We enhance the neural network’s self-adaptability by introducing a model-driven DRL algorithm, integrating expert knowledge into the DRL actor network’s design. Simulation results demonstrate up to 30% performance improvement over traditional algorithms, achieved by our model-driven framework. The proposed model-driven DRL shows higher capacity for dynamic extension and rapid adaptation to new environments.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
RIS 辅助多用户传输设计中的联合波束成形:模型驱动的深度强化学习框架
部署多个可重构智能表面(RIS)是提高无线系统性能的一种很有前途的策略。然而,由于优化变量、非凸目标函数和约束的增加,多ris辅助系统中的联合波束形成面临着重大挑战。在本研究中,我们提出了一种基于加权最小均方误差优化和连续凸逼近算法的算法,以最大化双ris辅助下行多用户多输入单输出系统的加权和率。我们还提出了一个模型驱动深度学习的通用框架,该框架解决了现有方法的局限性,这些方法通常缺乏对不同通道的灵活性,并且由于深度强化学习(DRL)的高维动作空间而遭受巨大的训练负担。最初,我们将算法中的步长配置为可训练的,从而加速收敛。然后,循环神经网络生成迭代的步长,允许在不同的环境条件下动态扩展迭代。通过引入模型驱动的DRL算法,将专家知识集成到DRL行动者网络的设计中,增强了神经网络的自适应能力。仿真结果表明,我们的模型驱动框架比传统算法的性能提高了30%。模型驱动的DRL具有较强的动态扩展能力和快速适应能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Communications
IEEE Transactions on Communications 工程技术-电信学
CiteScore
16.10
自引率
8.40%
发文量
528
审稿时长
4.1 months
期刊介绍: The IEEE Transactions on Communications is dedicated to publishing high-quality manuscripts that showcase advancements in the state-of-the-art of telecommunications. Our scope encompasses all aspects of telecommunications, including telephone, telegraphy, facsimile, and television, facilitated by electromagnetic propagation methods such as radio, wire, aerial, underground, coaxial, and submarine cables, as well as waveguides, communication satellites, and lasers. We cover telecommunications in various settings, including marine, aeronautical, space, and fixed station services, addressing topics such as repeaters, radio relaying, signal storage, regeneration, error detection and correction, multiplexing, carrier techniques, communication switching systems, data communications, and communication theory. Join us in advancing the field of telecommunications through groundbreaking research and innovation.
期刊最新文献
A Novel Cross-Entropy Receiver for Random Time-Hopping Covert Satellite Systems Rydberg-Atom-Based Superdirective Receivers: Array Modeling and Performance Analysis Resource Allocation in RIS-assisted Integrated Sensing, Communication, and Computation Network Coarse-to-Fine Doppler Compensation in 6G VLEO: Dual-Head Temporal Convolutional Network Acquisition with CP-Aided Tracking Accelerating Convergence in Ultra-Dense 6G Sensor Networks via Dual Shannon Entropy Control
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1