Power allocation using spatio-temporal graph neural networks and reinforcement learning

IF 2.1 4区 计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Wireless Networks Pub Date : 2024-07-17 DOI:10.1007/s11276-024-03814-1
Saeed Jamshidiha, Vahid Pourahmadi, Abbas Mohammadi, Mehdi Bennis
{"title":"Power allocation using spatio-temporal graph neural networks and reinforcement learning","authors":"Saeed Jamshidiha, Vahid Pourahmadi, Abbas Mohammadi, Mehdi Bennis","doi":"10.1007/s11276-024-03814-1","DOIUrl":null,"url":null,"abstract":"<p>This research investigates power allocation in wireless device-to-device (D2D) networks using spatio-temporal graph neural networks (STGNNs). Specifically, we address the challenge of sum-rate maximization in D2D networks by formulating it as a reinforcement learning problem. In our approach, STGNNs act as agents, generating optimal power allocations to maximize the reward function, which is the overall sum-rate of the network. Our study operates under the realistic assumption of delayed, local channel state information (CSI). Various user mobility patterns, including constant positions, velocities, and accelerations are simulated. The robustness of our proposed method is evaluated against delayed and noisy CSI, which are crucial factors in real-world scenarios. Furthermore, the fairness of our approach is compared to the well-established load-spillage algorithm, which is guaranteed to converge to the globally optimal solution of the alpha-fair utility maximization problem. Finally, the convergence behavior of our method is analyzed in comparison to the policy gradient approach. Our empirical results demonstrate that the proposed STGNN significantly outperforms both the WMMSE benchmark and memoryless graph neural networks (GNNs) across all simulated scenarios, and converges to the globally optimal solution of the load-spillage algorithm, with lower computational complexity. Specifically, it achieves a remarkable performance gap of over 400% compared to the WMMSE algorithm and approximately 10% improvement over the memoryless GNN. These findings underscore the efficacy of STGNNs in addressing power allocation challenges in wireless D2D networks.</p>","PeriodicalId":23750,"journal":{"name":"Wireless Networks","volume":"50 1","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Wireless Networks","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s11276-024-03814-1","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

This research investigates power allocation in wireless device-to-device (D2D) networks using spatio-temporal graph neural networks (STGNNs). Specifically, we address the challenge of sum-rate maximization in D2D networks by formulating it as a reinforcement learning problem. In our approach, STGNNs act as agents, generating optimal power allocations to maximize the reward function, which is the overall sum-rate of the network. Our study operates under the realistic assumption of delayed, local channel state information (CSI). Various user mobility patterns, including constant positions, velocities, and accelerations are simulated. The robustness of our proposed method is evaluated against delayed and noisy CSI, which are crucial factors in real-world scenarios. Furthermore, the fairness of our approach is compared to the well-established load-spillage algorithm, which is guaranteed to converge to the globally optimal solution of the alpha-fair utility maximization problem. Finally, the convergence behavior of our method is analyzed in comparison to the policy gradient approach. Our empirical results demonstrate that the proposed STGNN significantly outperforms both the WMMSE benchmark and memoryless graph neural networks (GNNs) across all simulated scenarios, and converges to the globally optimal solution of the load-spillage algorithm, with lower computational complexity. Specifically, it achieves a remarkable performance gap of over 400% compared to the WMMSE algorithm and approximately 10% improvement over the memoryless GNN. These findings underscore the efficacy of STGNNs in addressing power allocation challenges in wireless D2D networks.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用时空图神经网络和强化学习进行功率分配
本研究利用时空图神经网络(STGNN)研究无线设备对设备(D2D)网络中的功率分配。具体来说,我们将 D2D 网络中的总速率最大化问题表述为强化学习问题,从而解决了这一难题。在我们的方法中,STGNNs 作为代理,生成最优功率分配,以最大化奖励函数,即网络的总和速率。我们的研究是在延迟本地信道状态信息(CSI)的现实假设下进行的。我们模拟了各种用户移动模式,包括恒定位置、速度和加速度。评估了我们提出的方法对延迟和嘈杂 CSI 的鲁棒性,这些都是真实世界场景中的关键因素。此外,还将我们方法的公平性与成熟的负载溢出算法进行了比较,后者可保证收敛到阿尔法公平效用最大化问题的全局最优解。最后,将我们的方法与策略梯度方法进行了收敛行为分析。我们的实证结果表明,在所有模拟场景中,所提出的 STGNN 都明显优于 WMMSE 基准和无记忆图神经网络 (GNN),并能以更低的计算复杂度收敛到负载溢出算法的全局最优解。具体地说,与 WMMSE 算法相比,它实现了超过 400% 的显著性能差距,与无记忆 GNN 相比则提高了约 10%。这些发现强调了 STGNN 在解决无线 D2D 网络中功率分配难题方面的功效。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Wireless Networks
Wireless Networks 工程技术-电信学
CiteScore
7.70
自引率
3.30%
发文量
314
审稿时长
5.5 months
期刊介绍: The wireless communication revolution is bringing fundamental changes to data networking, telecommunication, and is making integrated networks a reality. By freeing the user from the cord, personal communications networks, wireless LAN''s, mobile radio networks and cellular systems, harbor the promise of fully distributed mobile computing and communications, any time, anywhere. Focusing on the networking and user aspects of the field, Wireless Networks provides a global forum for archival value contributions documenting these fast growing areas of interest. The journal publishes refereed articles dealing with research, experience and management issues of wireless networks. Its aim is to allow the reader to benefit from experience, problems and solutions described.
期刊最新文献
An EEG signal-based music treatment system for autistic children using edge computing devices A DV-Hop localization algorithm corrected based on multi-strategy sparrow algorithm in sea-surface wireless sensor networks Multi-Layer Collaborative Federated Learning architecture for 6G Open RAN Cloud-edge collaboration-based task offloading strategy in railway IoT for intelligent detection Exploiting data transmission for route discoveries in mobile ad hoc networks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1