Robust Q-learning for Fast And Optimal Flying Base Station Placement Aided By Digital Twin For Emergency Use

T. Guo
{"title":"Robust Q-learning for Fast And Optimal Flying Base Station Placement Aided By Digital Twin For Emergency Use","authors":"T. Guo","doi":"10.1109/VTC2022-Fall57202.2022.10012827","DOIUrl":null,"url":null,"abstract":"This paper studies a use case for quickly connecting users with an aerial bass station (BS) in emergency by leveraging digital twin (DT) and robust reinforcement learning (RL). Scattered communities of users are assigned a limited number of channels, and the flying BS is autonomously and optimally placed according to predefined criteria. Q-learning, a common type of RL, is employed as a solution to optimization of BS placement. Two optimization objectives are considered to maximize the per-user data rate in the worst condition and minimize the total BS transmitted power, respectively. To overcome resource limitations of the aerial BS, the RL training with many iterations is done in the DT virtual space connected to the physical space via an aerial BS. In particular, a practically sound max-min technique is proposed to handle the measurement and prediction uncertainties. It is shown that, even if the models used in DT are imperfect and measurements are inaccurate, nearly-optimal results can be obtained in DT. Compared to RL training 100% in the physical space, a huge number of BS moves can be avoided and a significant amount of time and energy can be saved. The assessment results suggest that model and measurement errors, especially when applying DT, should be seriously considered, and the robust optimization technique has potential to handle the uncertainties.","PeriodicalId":326047,"journal":{"name":"2022 IEEE 96th Vehicular Technology Conference (VTC2022-Fall)","volume":"97 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 96th Vehicular Technology Conference (VTC2022-Fall)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/VTC2022-Fall57202.2022.10012827","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

This paper studies a use case for quickly connecting users with an aerial bass station (BS) in emergency by leveraging digital twin (DT) and robust reinforcement learning (RL). Scattered communities of users are assigned a limited number of channels, and the flying BS is autonomously and optimally placed according to predefined criteria. Q-learning, a common type of RL, is employed as a solution to optimization of BS placement. Two optimization objectives are considered to maximize the per-user data rate in the worst condition and minimize the total BS transmitted power, respectively. To overcome resource limitations of the aerial BS, the RL training with many iterations is done in the DT virtual space connected to the physical space via an aerial BS. In particular, a practically sound max-min technique is proposed to handle the measurement and prediction uncertainties. It is shown that, even if the models used in DT are imperfect and measurements are inaccurate, nearly-optimal results can be obtained in DT. Compared to RL training 100% in the physical space, a huge number of BS moves can be avoided and a significant amount of time and energy can be saved. The assessment results suggest that model and measurement errors, especially when applying DT, should be seriously considered, and the robust optimization technique has potential to handle the uncertainties.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于数字孪生的快速优化飞行基站鲁棒q学习
本文研究了利用数字孪生(DT)和鲁棒强化学习(RL)在紧急情况下快速连接用户与空中低音站(BS)的用例。分散的用户社区被分配了有限数量的频道,飞行的BS是根据预定义的标准自主和最佳放置的。Q-learning是一种常见的RL类型,它被用来解决BS放置的优化问题。考虑了两个优化目标,分别是在最坏条件下最大化每用户数据速率和最小化总BS传输功率。为了克服空中BS的资源限制,在通过空中BS连接到物理空间的DT虚拟空间中进行多次迭代的RL训练。特别提出了一种实用的极大极小法来处理测量和预测的不确定性。结果表明,即使DT中使用的模型不完善,测量不准确,也可以获得接近最优的结果。与100%在物理空间进行RL训练相比,可以避免大量的BS动作,节省大量的时间和精力。评估结果表明,模型误差和测量误差,特别是应用DT时,应认真考虑,鲁棒优化技术具有处理不确定性的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Non-Orthogonal Neighbor Election Random Access for Distributed 6G Wireless Networks Coverage Performance Analysis of Piggyback Mobile IoT in 5G Vehicular Networks Performance Comparison of Error-Control Schemes in Collaborative Multiple-Input Multiple-Output Systems Valuation-Aware Federated Learning: An Auction-Based Approach for User Selection Design of Robust LoS-MIMO Transmission in HAPS Feeder Link
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1