{"title":"Model-Driven Bayesian Reinforcement Learning for IRS-Assisted Massive MIMO-OFDM Channel Feedback, Beamforming, and IRS Control","authors":"Yuanyuan Bi;Vincent K. N. Lau;Danny H. K. Tsang","doi":"10.1109/TWC.2024.3522098","DOIUrl":null,"url":null,"abstract":"In Intelligent Reflecting Surface (IRS)-assisted massive Multiple-Input Multiple-Output (MIMO) systems, the downlink channel state information (CSI) needs to be fed back to the base station (BS) and utilized to perform the beamforming and IRS control for high spectral efficiency performance. However, the intricate nature of these systems, characterized by a vast number of antennas, subcarriers, and IRS elements, exacerbates the CSI feedback overhead and complicates the optimization of beamforming and IRS parameters, potentially compromising spectral efficiency. Addressing these challenges, this paper introduces a Bayesian Reinforcement Learning (BRL)-based approach, named IRS-CSI-BRL, for efficient CSI feedback, beamforming, and IRS control. Firstly, the IRS-CSI-BRL approach utilizes the equivalent CSI for optimization, aligning with current channel estimation protocols without necessitating extensive modifications. Secondly, it employs a practical IRS control model that optimizes the effective capacitance of IRS control circuits rather than IRS reflection coefficients, accurately reflecting the IRS’s frequency-responsive behavior to enhance system performance. Additionally, we advocate bypassing the reconstruction of the CSI at the BS to eliminate information irrelevant to beamforming and IRS control, thereby boosting feedback efficiency. Another distinctive feature of the proposed scheme is that its output format is probability distributions, which enables the incorporation of model-assisted knowledge about the latent space and boosts the algorithm’s robustness. Simulation results demonstrate that the proposed IRS-CSI-BRL scheme significantly outperforms start-of-the-art solutions in feedback overhead reduction and system data rate enhancement while maintaining exceptional robustness. Furthermore, this approach maintains flexibility, allowing for the incorporation of an additional training loss function for full CSI reconstruction if needed.","PeriodicalId":13431,"journal":{"name":"IEEE Transactions on Wireless Communications","volume":"24 3","pages":"2514-2529"},"PeriodicalIF":10.7000,"publicationDate":"2025-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Wireless Communications","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10820055/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
In Intelligent Reflecting Surface (IRS)-assisted massive Multiple-Input Multiple-Output (MIMO) systems, the downlink channel state information (CSI) needs to be fed back to the base station (BS) and utilized to perform the beamforming and IRS control for high spectral efficiency performance. However, the intricate nature of these systems, characterized by a vast number of antennas, subcarriers, and IRS elements, exacerbates the CSI feedback overhead and complicates the optimization of beamforming and IRS parameters, potentially compromising spectral efficiency. Addressing these challenges, this paper introduces a Bayesian Reinforcement Learning (BRL)-based approach, named IRS-CSI-BRL, for efficient CSI feedback, beamforming, and IRS control. Firstly, the IRS-CSI-BRL approach utilizes the equivalent CSI for optimization, aligning with current channel estimation protocols without necessitating extensive modifications. Secondly, it employs a practical IRS control model that optimizes the effective capacitance of IRS control circuits rather than IRS reflection coefficients, accurately reflecting the IRS’s frequency-responsive behavior to enhance system performance. Additionally, we advocate bypassing the reconstruction of the CSI at the BS to eliminate information irrelevant to beamforming and IRS control, thereby boosting feedback efficiency. Another distinctive feature of the proposed scheme is that its output format is probability distributions, which enables the incorporation of model-assisted knowledge about the latent space and boosts the algorithm’s robustness. Simulation results demonstrate that the proposed IRS-CSI-BRL scheme significantly outperforms start-of-the-art solutions in feedback overhead reduction and system data rate enhancement while maintaining exceptional robustness. Furthermore, this approach maintains flexibility, allowing for the incorporation of an additional training loss function for full CSI reconstruction if needed.
期刊介绍:
The IEEE Transactions on Wireless Communications is a prestigious publication that showcases cutting-edge advancements in wireless communications. It welcomes both theoretical and practical contributions in various areas. The scope of the Transactions encompasses a wide range of topics, including modulation and coding, detection and estimation, propagation and channel characterization, and diversity techniques. The journal also emphasizes the physical and link layer communication aspects of network architectures and protocols.
The journal is open to papers on specific topics or non-traditional topics related to specific application areas. This includes simulation tools and methodologies, orthogonal frequency division multiplexing, MIMO systems, and wireless over optical technologies.
Overall, the IEEE Transactions on Wireless Communications serves as a platform for high-quality manuscripts that push the boundaries of wireless communications and contribute to advancements in the field.