无人机辅助MEC的快速自适应优化:一种约束决策转换器方法

IF 5.1 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS IEEE Wireless Communications Letters Pub Date : 2025-01-21 DOI:10.1109/LWC.2025.3532213
Changxu Ni;Zhe Wang;Yiyang Ni;Jun Li;Long Shi;Shi Jin
{"title":"无人机辅助MEC的快速自适应优化:一种约束决策转换器方法","authors":"Changxu Ni;Zhe Wang;Yiyang Ni;Jun Li;Long Shi;Shi Jin","doi":"10.1109/LWC.2025.3532213","DOIUrl":null,"url":null,"abstract":"Deep reinforcement learning (DRL) has been widely applied to dynamic resource allocation for the autonomous aerial vehicle (AAV)-assisted mobile edge computing (MEC) networks. However, it is often challenging to generalize a well-trained DRL model to new MEC scenarios once the system constraints change, since retraining DRL models from scratch is time and energy consuming. In this letter, we jointly optimize the AAV’s trajectory and computing resource allocation for maximizing the fairness-based throughput under the battery capacity and quality of service (QoS) constraints. The sequential optimization problem is formulated as a constrained Markov decision process (CMDP) and solved via the constrained DRL algorithms. To generalize the optimized resource allocation policies across various energy and QoS constraints, we propose an offline pre-training and online fine-tuning based constrained Decision Transformer (CDT) framework. In particular, the CDT is first pre-trained on the training samples collected by the constrained DRL algorithm offline, and then fine-tuned online for rapid adaptation to the unseen constraint thresholds. Simulation results show that compared with the benchmark DRL algorithms, the CDT is capable of effectively improving the fairness-based throughput under the battery capacity and QoS constraints, and demonstrates rapid convergence when constraints change.","PeriodicalId":13343,"journal":{"name":"IEEE Wireless Communications Letters","volume":"14 4","pages":"1084-1088"},"PeriodicalIF":5.1000,"publicationDate":"2025-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Fast Adaptive Optimization for AAV-Assisted MEC: A Constrained Decision Transformer Approach\",\"authors\":\"Changxu Ni;Zhe Wang;Yiyang Ni;Jun Li;Long Shi;Shi Jin\",\"doi\":\"10.1109/LWC.2025.3532213\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Deep reinforcement learning (DRL) has been widely applied to dynamic resource allocation for the autonomous aerial vehicle (AAV)-assisted mobile edge computing (MEC) networks. However, it is often challenging to generalize a well-trained DRL model to new MEC scenarios once the system constraints change, since retraining DRL models from scratch is time and energy consuming. In this letter, we jointly optimize the AAV’s trajectory and computing resource allocation for maximizing the fairness-based throughput under the battery capacity and quality of service (QoS) constraints. The sequential optimization problem is formulated as a constrained Markov decision process (CMDP) and solved via the constrained DRL algorithms. To generalize the optimized resource allocation policies across various energy and QoS constraints, we propose an offline pre-training and online fine-tuning based constrained Decision Transformer (CDT) framework. In particular, the CDT is first pre-trained on the training samples collected by the constrained DRL algorithm offline, and then fine-tuned online for rapid adaptation to the unseen constraint thresholds. Simulation results show that compared with the benchmark DRL algorithms, the CDT is capable of effectively improving the fairness-based throughput under the battery capacity and QoS constraints, and demonstrates rapid convergence when constraints change.\",\"PeriodicalId\":13343,\"journal\":{\"name\":\"IEEE Wireless Communications Letters\",\"volume\":\"14 4\",\"pages\":\"1084-1088\"},\"PeriodicalIF\":5.1000,\"publicationDate\":\"2025-01-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Wireless Communications Letters\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10848143/\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Wireless Communications Letters","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10848143/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

深度强化学习(DRL)已广泛应用于自主飞行器(AAV)辅助移动边缘计算(MEC)网络的动态资源分配。然而,一旦系统约束发生变化,将训练良好的DRL模型推广到新的MEC场景通常是具有挑战性的,因为从头开始重新训练DRL模型既费时又耗力。在这封信中,我们共同优化了AAV的轨迹和计算资源分配,以在电池容量和服务质量(QoS)约束下最大化基于公平的吞吐量。将序列优化问题表述为约束马尔可夫决策过程(CMDP),并通过约束DRL算法求解。为了推广各种能源和QoS约束下的优化资源分配策略,我们提出了一种基于离线预训练和在线微调的约束决策转换器(CDT)框架。特别是,CDT首先在约束DRL算法收集的训练样本上进行离线预训练,然后在线微调以快速适应不可见的约束阈值。仿真结果表明,与基准DRL算法相比,CDT在电池容量和QoS约束下能够有效提高基于公平性的吞吐量,并且在约束条件发生变化时收敛速度较快。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Fast Adaptive Optimization for AAV-Assisted MEC: A Constrained Decision Transformer Approach
Deep reinforcement learning (DRL) has been widely applied to dynamic resource allocation for the autonomous aerial vehicle (AAV)-assisted mobile edge computing (MEC) networks. However, it is often challenging to generalize a well-trained DRL model to new MEC scenarios once the system constraints change, since retraining DRL models from scratch is time and energy consuming. In this letter, we jointly optimize the AAV’s trajectory and computing resource allocation for maximizing the fairness-based throughput under the battery capacity and quality of service (QoS) constraints. The sequential optimization problem is formulated as a constrained Markov decision process (CMDP) and solved via the constrained DRL algorithms. To generalize the optimized resource allocation policies across various energy and QoS constraints, we propose an offline pre-training and online fine-tuning based constrained Decision Transformer (CDT) framework. In particular, the CDT is first pre-trained on the training samples collected by the constrained DRL algorithm offline, and then fine-tuned online for rapid adaptation to the unseen constraint thresholds. Simulation results show that compared with the benchmark DRL algorithms, the CDT is capable of effectively improving the fairness-based throughput under the battery capacity and QoS constraints, and demonstrates rapid convergence when constraints change.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
IEEE Wireless Communications Letters
IEEE Wireless Communications Letters Engineering-Electrical and Electronic Engineering
CiteScore
12.30
自引率
6.30%
发文量
481
期刊介绍: IEEE Wireless Communications Letters publishes short papers in a rapid publication cycle on advances in the state-of-the-art of wireless communications. Both theoretical contributions (including new techniques, concepts, and analyses) and practical contributions (including system experiments and prototypes, and new applications) are encouraged. This journal focuses on the physical layer and the link layer of wireless communication systems.
期刊最新文献
DQN-Based Beam Tracking Using Reward-Aware Exploration Without Additional Signaling Low-Complexity Fairness-Oriented Secure Beamforming Design for CAPA-Based Systems Feasibility-Guided Beamforming for Cell-Free ISAC: A Graph Neural Deep-Unrolling Approach Inversion-Free Fractional-Programming Precoding for RIS-aided OTFS LEO Satellite Communication Performance Analysis of RIS-Assisted Terahertz Communications: Joint Impact of Antenna Misalignment and Imperfect CSI on Physical Layer Security
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1