Deep Reinforcement Learning-Based Routing for Space-Terrestrial Networks

Kai-Chu Tsai, Ting-Jui Yao, Pingmu Huang, Cheng-Sen Huang, Zhu Han, Li-Chun Wang
{"title":"Deep Reinforcement Learning-Based Routing for Space-Terrestrial Networks","authors":"Kai-Chu Tsai, Ting-Jui Yao, Pingmu Huang, Cheng-Sen Huang, Zhu Han, Li-Chun Wang","doi":"10.1109/VTC2022-Fall57202.2022.10013028","DOIUrl":null,"url":null,"abstract":"Satellite communication is one primary structure of the future wireless systems. Many corporations and academic institutions are committed to this research, e.g., StarLink. Their aim is to construct a constellation of satellite networks covering the whole world. Thanks to this design, our mobile devices can connect to the Internet anywhere in the world without being restricted by the coverage area of base stations. However, given that the constellation of satellites around the Earth is a system that varies in time, algorithms must adapt to the dynamic topology. To solve the problem, in this paper we investigate how to send data from the base station to the target satellite with minimal delay. We systematically formulate the transmission limitations, uplink, and downlink transmission delay. With well defined limits and delays on space-terrestrial transmissions, we can set rewards and punishments by mathematical analysis to simulate the agent in deep reinforcement learning (DRL), which can explore all choices in the system and optimize the routing algorithm. Finally, the simulation results corroborate a fully functional space-terrestrial network constellation to simulate the actual situation.","PeriodicalId":326047,"journal":{"name":"2022 IEEE 96th Vehicular Technology Conference (VTC2022-Fall)","volume":"174 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 96th Vehicular Technology Conference (VTC2022-Fall)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/VTC2022-Fall57202.2022.10013028","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Satellite communication is one primary structure of the future wireless systems. Many corporations and academic institutions are committed to this research, e.g., StarLink. Their aim is to construct a constellation of satellite networks covering the whole world. Thanks to this design, our mobile devices can connect to the Internet anywhere in the world without being restricted by the coverage area of base stations. However, given that the constellation of satellites around the Earth is a system that varies in time, algorithms must adapt to the dynamic topology. To solve the problem, in this paper we investigate how to send data from the base station to the target satellite with minimal delay. We systematically formulate the transmission limitations, uplink, and downlink transmission delay. With well defined limits and delays on space-terrestrial transmissions, we can set rewards and punishments by mathematical analysis to simulate the agent in deep reinforcement learning (DRL), which can explore all choices in the system and optimize the routing algorithm. Finally, the simulation results corroborate a fully functional space-terrestrial network constellation to simulate the actual situation.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于深度强化学习的地空网络路由
卫星通信是未来无线系统的主要结构之一。许多公司和学术机构都致力于这项研究,例如StarLink。他们的目标是建立一个覆盖全球的卫星网络星座。由于这种设计,我们的移动设备可以连接到世界上任何地方的互联网,而不受基站覆盖区域的限制。然而,考虑到地球周围的卫星星座是一个随时间变化的系统,算法必须适应动态拓扑结构。为了解决这一问题,本文研究了如何以最小的延迟从基站向目标卫星发送数据。系统地制定了传输限制、上行链路和下行链路的传输延迟。在明确了空间-地面传输的限制和延迟的情况下,我们可以通过数学分析来设置奖惩,以模拟深度强化学习(DRL)中的智能体,它可以探索系统中的所有选择并优化路由算法。最后,仿真结果证实了一个功能完备的空间-地面网络星座能够模拟实际情况。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Non-Orthogonal Neighbor Election Random Access for Distributed 6G Wireless Networks Coverage Performance Analysis of Piggyback Mobile IoT in 5G Vehicular Networks Performance Comparison of Error-Control Schemes in Collaborative Multiple-Input Multiple-Output Systems Valuation-Aware Federated Learning: An Auction-Based Approach for User Selection Design of Robust LoS-MIMO Transmission in HAPS Feeder Link
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1