Decomposition-based learning in drone-assisted wireless-powered mobile edge computing networks

IF 7.5 2区计算机科学 Q1 TELECOMMUNICATIONS Digital Communications and Networks Pub Date : 2024-12-01 DOI:10.1016/j.dcan.2023.11.010

Xiaoyi Zhou , Liang Huang , Tong Ye , Weiqiang Sun

{"title":"Decomposition-based learning in drone-assisted wireless-powered mobile edge computing networks","authors":"Xiaoyi Zhou , Liang Huang , Tong Ye , Weiqiang Sun","doi":"10.1016/j.dcan.2023.11.010","DOIUrl":null,"url":null,"abstract":"<div><div>This paper investigates the multi-Unmanned Aerial Vehicle (UAV)-assisted wireless-powered Mobile Edge Computing (MEC) system, where UAVs provide computation and powering services to mobile terminals. We aim to maximize the number of completed computation tasks by jointly optimizing the offloading decisions of all terminals and the trajectory planning of all UAVs. The action space of the system is extremely large and grows exponentially with the number of UAVs. In this case, single-agent learning will require an overlarge neural network, resulting in insufficient exploration. However, the offloading decisions and trajectory planning are two subproblems performed by different executants, providing an opportunity for problem-solving. We thus adopt the idea of decomposition and propose a 2-Tiered Multi-agent Soft Actor-Critic (2T-MSAC) algorithm, decomposing a single neural network into multiple small-scale networks. In the first tier, a single agent is used for offloading decisions, and an online pretrained model based on imitation learning is specially designed to accelerate the training process of this agent. In the second tier, UAVs utilize multiple agents to plan their trajectories. Each agent exerts its influence on the parameter update of other agents through actions and rewards, thereby achieving joint optimization. Simulation results demonstrate that the proposed algorithm can be applied to scenarios with various location distributions of terminals, outperforming existing benchmarks that perform well only in specific scenarios. In particular, 2T-MSAC increases the number of completed tasks by 45.5% in the scenario with uneven terminal distributions. Moreover, the pretrained model based on imitation learning reduces the convergence time of 2T-MSAC by 58.2%.</div></div>","PeriodicalId":48631,"journal":{"name":"Digital Communications and Networks","volume":"10 6","pages":"Pages 1769-1781"},"PeriodicalIF":7.5000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Communications and Networks","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352864823001712","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

This paper investigates the multi-Unmanned Aerial Vehicle (UAV)-assisted wireless-powered Mobile Edge Computing (MEC) system, where UAVs provide computation and powering services to mobile terminals. We aim to maximize the number of completed computation tasks by jointly optimizing the offloading decisions of all terminals and the trajectory planning of all UAVs. The action space of the system is extremely large and grows exponentially with the number of UAVs. In this case, single-agent learning will require an overlarge neural network, resulting in insufficient exploration. However, the offloading decisions and trajectory planning are two subproblems performed by different executants, providing an opportunity for problem-solving. We thus adopt the idea of decomposition and propose a 2-Tiered Multi-agent Soft Actor-Critic (2T-MSAC) algorithm, decomposing a single neural network into multiple small-scale networks. In the first tier, a single agent is used for offloading decisions, and an online pretrained model based on imitation learning is specially designed to accelerate the training process of this agent. In the second tier, UAVs utilize multiple agents to plan their trajectories. Each agent exerts its influence on the parameter update of other agents through actions and rewards, thereby achieving joint optimization. Simulation results demonstrate that the proposed algorithm can be applied to scenarios with various location distributions of terminals, outperforming existing benchmarks that perform well only in specific scenarios. In particular, 2T-MSAC increases the number of completed tasks by 45.5% in the scenario with uneven terminal distributions. Moreover, the pretrained model based on imitation learning reduces the convergence time of 2T-MSAC by 58.2%.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

无人机辅助无线移动边缘计算网络中基于分解的学习

本文研究了多无人机（UAV）辅助的无线供电移动边缘计算（MEC）系统，其中无人机为移动终端提供计算和供电服务。通过联合优化所有终端的卸载决策和所有无人机的轨迹规划，以最大限度地完成计算任务。系统的作用空间非常大，并且随着无人机数量的增加呈指数增长。在这种情况下，单智能体学习将需要一个过大的神经网络，导致探索不足。然而，卸载决策和轨迹规划是由不同执行者执行的两个子问题，为解决问题提供了机会。因此，我们采用分解的思想，提出了一种两层多智能体软行为者评价（2T-MSAC）算法，将单个神经网络分解为多个小规模网络。在第一层，使用单个智能体进行决策卸载，并专门设计了基于模仿学习的在线预训练模型来加速该智能体的训练过程。在第二层，无人机利用多个代理来规划它们的轨迹。每个agent通过动作和奖励对其他agent的参数更新施加影响，从而实现联合优化。仿真结果表明，该算法可以应用于各种终端位置分布的场景，优于仅在特定场景下表现良好的现有基准。特别是在终端分布不均匀的场景下，2T-MSAC的任务完成量提高了45.5%。此外，基于模仿学习的预训练模型使2T-MSAC的收敛时间缩短了58.2%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Digital Communications and Networks Computer Science-Hardware and Architecture

CiteScore

12.80

自引率

5.10%

发文量

915

审稿时长

30 weeks

期刊介绍： Digital Communications and Networks is a prestigious journal that emphasizes on communication systems and networks. We publish only top-notch original articles and authoritative reviews, which undergo rigorous peer-review. We are proud to announce that all our articles are fully Open Access and can be accessed on ScienceDirect. Our journal is recognized and indexed by eminent databases such as the Science Citation Index Expanded (SCIE) and Scopus. In addition to regular articles, we may also consider exceptional conference papers that have been significantly expanded. Furthermore, we periodically release special issues that focus on specific aspects of the field. In conclusion, Digital Communications and Networks is a leading journal that guarantees exceptional quality and accessibility for researchers and scholars in the field of communication systems and networks.