无人机辅助MEC的快速自适应优化：一种约束决策转换器方法

IF 5.1 3区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS IEEE Wireless Communications Letters Pub Date : 2025-01-21 DOI:10.1109/LWC.2025.3532213

Changxu Ni;Zhe Wang;Yiyang Ni;Jun Li;Long Shi;Shi Jin

{"title":"无人机辅助MEC的快速自适应优化：一种约束决策转换器方法","authors":"Changxu Ni;Zhe Wang;Yiyang Ni;Jun Li;Long Shi;Shi Jin","doi":"10.1109/LWC.2025.3532213","DOIUrl":null,"url":null,"abstract":"Deep reinforcement learning (DRL) has been widely applied to dynamic resource allocation for the autonomous aerial vehicle (AAV)-assisted mobile edge computing (MEC) networks. However, it is often challenging to generalize a well-trained DRL model to new MEC scenarios once the system constraints change, since retraining DRL models from scratch is time and energy consuming. In this letter, we jointly optimize the AAV’s trajectory and computing resource allocation for maximizing the fairness-based throughput under the battery capacity and quality of service (QoS) constraints. The sequential optimization problem is formulated as a constrained Markov decision process (CMDP) and solved via the constrained DRL algorithms. To generalize the optimized resource allocation policies across various energy and QoS constraints, we propose an offline pre-training and online fine-tuning based constrained Decision Transformer (CDT) framework. In particular, the CDT is first pre-trained on the training samples collected by the constrained DRL algorithm offline, and then fine-tuned online for rapid adaptation to the unseen constraint thresholds. Simulation results show that compared with the benchmark DRL algorithms, the CDT is capable of effectively improving the fairness-based throughput under the battery capacity and QoS constraints, and demonstrates rapid convergence when constraints change.","PeriodicalId":13343,"journal":{"name":"IEEE Wireless Communications Letters","volume":"14 4","pages":"1084-1088"},"PeriodicalIF":5.1000,"publicationDate":"2025-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Fast Adaptive Optimization for AAV-Assisted MEC: A Constrained Decision Transformer Approach\",\"authors\":\"Changxu Ni;Zhe Wang;Yiyang Ni;Jun Li;Long Shi;Shi Jin\",\"doi\":\"10.1109/LWC.2025.3532213\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Deep reinforcement learning (DRL) has been widely applied to dynamic resource allocation for the autonomous aerial vehicle (AAV)-assisted mobile edge computing (MEC) networks. However, it is often challenging to generalize a well-trained DRL model to new MEC scenarios once the system constraints change, since retraining DRL models from scratch is time and energy consuming. In this letter, we jointly optimize the AAV’s trajectory and computing resource allocation for maximizing the fairness-based throughput under the battery capacity and quality of service (QoS) constraints. The sequential optimization problem is formulated as a constrained Markov decision process (CMDP) and solved via the constrained DRL algorithms. To generalize the optimized resource allocation policies across various energy and QoS constraints, we propose an offline pre-training and online fine-tuning based constrained Decision Transformer (CDT) framework. In particular, the CDT is first pre-trained on the training samples collected by the constrained DRL algorithm offline, and then fine-tuned online for rapid adaptation to the unseen constraint thresholds. Simulation results show that compared with the benchmark DRL algorithms, the CDT is capable of effectively improving the fairness-based throughput under the battery capacity and QoS constraints, and demonstrates rapid convergence when constraints change.\",\"PeriodicalId\":13343,\"journal\":{\"name\":\"IEEE Wireless Communications Letters\",\"volume\":\"14 4\",\"pages\":\"1084-1088\"},\"PeriodicalIF\":5.1000,\"publicationDate\":\"2025-01-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Wireless Communications Letters\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10848143/\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Wireless Communications Letters","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10848143/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

深度强化学习（DRL）已广泛应用于自主飞行器（AAV）辅助移动边缘计算（MEC）网络的动态资源分配。然而，一旦系统约束发生变化，将训练良好的DRL模型推广到新的MEC场景通常是具有挑战性的，因为从头开始重新训练DRL模型既费时又耗力。在这封信中，我们共同优化了AAV的轨迹和计算资源分配，以在电池容量和服务质量（QoS）约束下最大化基于公平的吞吐量。将序列优化问题表述为约束马尔可夫决策过程（CMDP），并通过约束DRL算法求解。为了推广各种能源和QoS约束下的优化资源分配策略，我们提出了一种基于离线预训练和在线微调的约束决策转换器（CDT）框架。特别是，CDT首先在约束DRL算法收集的训练样本上进行离线预训练，然后在线微调以快速适应不可见的约束阈值。仿真结果表明，与基准DRL算法相比，CDT在电池容量和QoS约束下能够有效提高基于公平性的吞吐量，并且在约束条件发生变化时收敛速度较快。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Fast Adaptive Optimization for AAV-Assisted MEC: A Constrained Decision Transformer Approach

Deep reinforcement learning (DRL) has been widely applied to dynamic resource allocation for the autonomous aerial vehicle (AAV)-assisted mobile edge computing (MEC) networks. However, it is often challenging to generalize a well-trained DRL model to new MEC scenarios once the system constraints change, since retraining DRL models from scratch is time and energy consuming. In this letter, we jointly optimize the AAV’s trajectory and computing resource allocation for maximizing the fairness-based throughput under the battery capacity and quality of service (QoS) constraints. The sequential optimization problem is formulated as a constrained Markov decision process (CMDP) and solved via the constrained DRL algorithms. To generalize the optimized resource allocation policies across various energy and QoS constraints, we propose an offline pre-training and online fine-tuning based constrained Decision Transformer (CDT) framework. In particular, the CDT is first pre-trained on the training samples collected by the constrained DRL algorithm offline, and then fine-tuned online for rapid adaptation to the unseen constraint thresholds. Simulation results show that compared with the benchmark DRL algorithms, the CDT is capable of effectively improving the fairness-based throughput under the battery capacity and QoS constraints, and demonstrates rapid convergence when constraints change.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Wireless Communications Letters Engineering-Electrical and Electronic Engineering

CiteScore

12.30

自引率

6.30%

发文量

481

期刊介绍： IEEE Wireless Communications Letters publishes short papers in a rapid publication cycle on advances in the state-of-the-art of wireless communications. Both theoretical contributions (including new techniques, concepts, and analyses) and practical contributions (including system experiments and prototypes, and new applications) are encouraged. This journal focuses on the physical layer and the link layer of wireless communication systems.