Changxu Ni;Zhe Wang;Yiyang Ni;Jun Li;Long Shi;Shi Jin
{"title":"无人机辅助MEC的快速自适应优化:一种约束决策转换器方法","authors":"Changxu Ni;Zhe Wang;Yiyang Ni;Jun Li;Long Shi;Shi Jin","doi":"10.1109/LWC.2025.3532213","DOIUrl":null,"url":null,"abstract":"Deep reinforcement learning (DRL) has been widely applied to dynamic resource allocation for the autonomous aerial vehicle (AAV)-assisted mobile edge computing (MEC) networks. However, it is often challenging to generalize a well-trained DRL model to new MEC scenarios once the system constraints change, since retraining DRL models from scratch is time and energy consuming. In this letter, we jointly optimize the AAV’s trajectory and computing resource allocation for maximizing the fairness-based throughput under the battery capacity and quality of service (QoS) constraints. The sequential optimization problem is formulated as a constrained Markov decision process (CMDP) and solved via the constrained DRL algorithms. To generalize the optimized resource allocation policies across various energy and QoS constraints, we propose an offline pre-training and online fine-tuning based constrained Decision Transformer (CDT) framework. In particular, the CDT is first pre-trained on the training samples collected by the constrained DRL algorithm offline, and then fine-tuned online for rapid adaptation to the unseen constraint thresholds. Simulation results show that compared with the benchmark DRL algorithms, the CDT is capable of effectively improving the fairness-based throughput under the battery capacity and QoS constraints, and demonstrates rapid convergence when constraints change.","PeriodicalId":13343,"journal":{"name":"IEEE Wireless Communications Letters","volume":"14 4","pages":"1084-1088"},"PeriodicalIF":5.1000,"publicationDate":"2025-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Fast Adaptive Optimization for AAV-Assisted MEC: A Constrained Decision Transformer Approach\",\"authors\":\"Changxu Ni;Zhe Wang;Yiyang Ni;Jun Li;Long Shi;Shi Jin\",\"doi\":\"10.1109/LWC.2025.3532213\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Deep reinforcement learning (DRL) has been widely applied to dynamic resource allocation for the autonomous aerial vehicle (AAV)-assisted mobile edge computing (MEC) networks. However, it is often challenging to generalize a well-trained DRL model to new MEC scenarios once the system constraints change, since retraining DRL models from scratch is time and energy consuming. In this letter, we jointly optimize the AAV’s trajectory and computing resource allocation for maximizing the fairness-based throughput under the battery capacity and quality of service (QoS) constraints. The sequential optimization problem is formulated as a constrained Markov decision process (CMDP) and solved via the constrained DRL algorithms. To generalize the optimized resource allocation policies across various energy and QoS constraints, we propose an offline pre-training and online fine-tuning based constrained Decision Transformer (CDT) framework. In particular, the CDT is first pre-trained on the training samples collected by the constrained DRL algorithm offline, and then fine-tuned online for rapid adaptation to the unseen constraint thresholds. Simulation results show that compared with the benchmark DRL algorithms, the CDT is capable of effectively improving the fairness-based throughput under the battery capacity and QoS constraints, and demonstrates rapid convergence when constraints change.\",\"PeriodicalId\":13343,\"journal\":{\"name\":\"IEEE Wireless Communications Letters\",\"volume\":\"14 4\",\"pages\":\"1084-1088\"},\"PeriodicalIF\":5.1000,\"publicationDate\":\"2025-01-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Wireless Communications Letters\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10848143/\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Wireless Communications Letters","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10848143/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
Fast Adaptive Optimization for AAV-Assisted MEC: A Constrained Decision Transformer Approach
Deep reinforcement learning (DRL) has been widely applied to dynamic resource allocation for the autonomous aerial vehicle (AAV)-assisted mobile edge computing (MEC) networks. However, it is often challenging to generalize a well-trained DRL model to new MEC scenarios once the system constraints change, since retraining DRL models from scratch is time and energy consuming. In this letter, we jointly optimize the AAV’s trajectory and computing resource allocation for maximizing the fairness-based throughput under the battery capacity and quality of service (QoS) constraints. The sequential optimization problem is formulated as a constrained Markov decision process (CMDP) and solved via the constrained DRL algorithms. To generalize the optimized resource allocation policies across various energy and QoS constraints, we propose an offline pre-training and online fine-tuning based constrained Decision Transformer (CDT) framework. In particular, the CDT is first pre-trained on the training samples collected by the constrained DRL algorithm offline, and then fine-tuned online for rapid adaptation to the unseen constraint thresholds. Simulation results show that compared with the benchmark DRL algorithms, the CDT is capable of effectively improving the fairness-based throughput under the battery capacity and QoS constraints, and demonstrates rapid convergence when constraints change.
期刊介绍:
IEEE Wireless Communications Letters publishes short papers in a rapid publication cycle on advances in the state-of-the-art of wireless communications. Both theoretical contributions (including new techniques, concepts, and analyses) and practical contributions (including system experiments and prototypes, and new applications) are encouraged. This journal focuses on the physical layer and the link layer of wireless communication systems.