Multi - Agent Deep Deterministic Policy Gradient Based Satellite Spectrum/Code Resource Scheduling with Multi-constraint

Zixian Chen, Xiang Chen, Yuhan Dong, Sihui Zheng
{"title":"Multi - Agent Deep Deterministic Policy Gradient Based Satellite Spectrum/Code Resource Scheduling with Multi-constraint","authors":"Zixian Chen, Xiang Chen, Yuhan Dong, Sihui Zheng","doi":"10.1109/ICCCWorkshops55477.2022.9896716","DOIUrl":null,"url":null,"abstract":"For multi-user satellite Internet of Things (IoT) systems operating at lower signal-to-noise ratio, spread spectrum techniques are usually used to combat narrowband interference. In addition, the communication performance in the spread spectrum system depends on the anti-jamming ability of the spreading codes (SCs). Therefore, how to design the SCs distributed scheduling strategies under multi-users requirements and resource constraints has become a crucial problem for satellite IoT systems. In this paper, the number of collisions and the amount of transmitted data are introduced as gauges to measure the distributed scheduling performance of the satellite multi-user systems. Specifically, terminal gateways (TGs) must efficiently and effectively select limited available SCs according to their state at each communication time slot independently. The SCs distributed scheduling problem is formulated as a Markov Decision Process (MDP) along with the observed environments composed of resource status and TGs status. Then a deep rein-forcement learning scheduling algorithm is devised by combining the A2C framework and the idea of multi-user. Simulation results show that the proposed algorithm can achieve much better performance than traditional algorithms in reducing scheduling conflicts and improving communication efficiency. Finally, we draw some conclusions.","PeriodicalId":148869,"journal":{"name":"2022 IEEE/CIC International Conference on Communications in China (ICCC Workshops)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE/CIC International Conference on Communications in China (ICCC Workshops)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCWorkshops55477.2022.9896716","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

For multi-user satellite Internet of Things (IoT) systems operating at lower signal-to-noise ratio, spread spectrum techniques are usually used to combat narrowband interference. In addition, the communication performance in the spread spectrum system depends on the anti-jamming ability of the spreading codes (SCs). Therefore, how to design the SCs distributed scheduling strategies under multi-users requirements and resource constraints has become a crucial problem for satellite IoT systems. In this paper, the number of collisions and the amount of transmitted data are introduced as gauges to measure the distributed scheduling performance of the satellite multi-user systems. Specifically, terminal gateways (TGs) must efficiently and effectively select limited available SCs according to their state at each communication time slot independently. The SCs distributed scheduling problem is formulated as a Markov Decision Process (MDP) along with the observed environments composed of resource status and TGs status. Then a deep rein-forcement learning scheduling algorithm is devised by combining the A2C framework and the idea of multi-user. Simulation results show that the proposed algorithm can achieve much better performance than traditional algorithms in reducing scheduling conflicts and improving communication efficiency. Finally, we draw some conclusions.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于多智能体深度确定性策略梯度的多约束卫星频谱/码资源调度
对于低信噪比的多用户卫星物联网(IoT)系统,通常采用扩频技术来对抗窄带干扰。此外,扩频系统的通信性能取决于扩频码的抗干扰能力。因此,如何在多用户需求和资源约束下设计卫星物联网分布式调度策略成为卫星物联网系统的关键问题。本文引入碰撞次数和传输数据量作为衡量卫星多用户系统分布式调度性能的指标。具体来说,终端网关(TGs)必须根据每个通信时隙的状态独立高效地选择有限的可用sc。将SCs分布式调度问题与观察到的由资源状态和tg状态组成的环境一起表述为马尔可夫决策过程(MDP)。然后结合A2C框架和多用户思想,设计了一种深度强化学习调度算法。仿真结果表明,该算法在减少调度冲突和提高通信效率方面比传统算法有更好的性能。最后,我们得出一些结论。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Data Importance-Assisted Multi-User Scheduling in MIMO Edge Learning Systems Artificial Intelligence Service by Satellite Networks based on Ensemble Learning with Cloud-Edge-End Integration CRS interference handling on NR and LTE overlapping spectrum: Analysis on performance and standard impact Energy Harvesting-Based UAV-Assisted Vehicular Edge Computing: A Deep Reinforcement Learning Approach How Can Reconfigurable Intelligent Surfaces Drive 5G-Advanced Wireless Networks: A Standardization Perspective
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1