Personalized Federated Reinforcement Learning for Multi-AAV Assisted Edge Computing

IF 5.5 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS IEEE Wireless Communications Letters Pub Date : 2025-04-21 DOI:10.1109/LWC.2025.3562611
Jinwei Chai;Zhe Wang;Chuan Ma;Guanyu Gao;Long Shi
{"title":"Personalized Federated Reinforcement Learning for Multi-AAV Assisted Edge Computing","authors":"Jinwei Chai;Zhe Wang;Chuan Ma;Guanyu Gao;Long Shi","doi":"10.1109/LWC.2025.3562611","DOIUrl":null,"url":null,"abstract":"In this letter, we propose an innovative autonomous aerial vehicle (AAV)-assisted computation offloading scheme, where multiple AAVs with heterogeneous computational capabilities collaborate to provide task offloading services for mobile users (MUs). Our objective is to minimize the weighted sum of the MUs’ maximum task processing delay and the AAVs’ average energy consumption by jointly optimizing the AAVs’ service locations, velocities, user association and offloading policies. We model this problem as a fully decentralized partially observable Markov decision process (Dec-POMDP) and propose a personalized federated reinforcement learning (FRL) algorithm, named federated knowledge distillation based parameterized deep Q-network (FKD-PDQN), to collaboratively optimize the AAVs’ policies in a decentralized and cost-efficient manner. In this approach, each AAV optimizes its local policy using PDQN, and updates its model by aggregating the distilled knowledge from the neighboring AAVs through an attention mechanism. Simulation results reveal that the proposed algorithm effectively reduces the average delay, energy consumption, and communication overhead, offering significant improvements over baseline algorithms such as Independent PDQN (IPDQN), Federated PDQN (FPDQN), and Krum.","PeriodicalId":13343,"journal":{"name":"IEEE Wireless Communications Letters","volume":"14 7","pages":"2074-2078"},"PeriodicalIF":5.5000,"publicationDate":"2025-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Wireless Communications Letters","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10970733/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

In this letter, we propose an innovative autonomous aerial vehicle (AAV)-assisted computation offloading scheme, where multiple AAVs with heterogeneous computational capabilities collaborate to provide task offloading services for mobile users (MUs). Our objective is to minimize the weighted sum of the MUs’ maximum task processing delay and the AAVs’ average energy consumption by jointly optimizing the AAVs’ service locations, velocities, user association and offloading policies. We model this problem as a fully decentralized partially observable Markov decision process (Dec-POMDP) and propose a personalized federated reinforcement learning (FRL) algorithm, named federated knowledge distillation based parameterized deep Q-network (FKD-PDQN), to collaboratively optimize the AAVs’ policies in a decentralized and cost-efficient manner. In this approach, each AAV optimizes its local policy using PDQN, and updates its model by aggregating the distilled knowledge from the neighboring AAVs through an attention mechanism. Simulation results reveal that the proposed algorithm effectively reduces the average delay, energy consumption, and communication overhead, offering significant improvements over baseline algorithms such as Independent PDQN (IPDQN), Federated PDQN (FPDQN), and Krum.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
面向多自动驾驶辅助边缘计算的个性化联邦强化学习
在这封信中,我们提出了一种创新的自主飞行器(AAV)辅助计算卸载方案,其中多个具有异构计算能力的AAV协作为移动用户(mu)提供任务卸载服务。我们的目标是通过联合优化自动驾驶汽车的服务位置、速度、用户关联和卸载策略,使自动驾驶汽车的最大任务处理延迟和平均能耗的加权和最小。我们将这一问题建模为一个完全分散的部分可观察马尔可夫决策过程(Dec-POMDP),并提出了一种个性化的联邦强化学习(FRL)算法,称为基于联邦知识蒸馏的参数化深度q -网络(FKD-PDQN),以分散和经济高效的方式协同优化aav的策略。在该方法中,每个AAV使用PDQN优化其本地策略,并通过关注机制通过聚合来自相邻AAV的提炼知识来更新其模型。仿真结果表明,该算法有效地降低了平均延迟、能耗和通信开销,与独立PDQN (IPDQN)、联邦PDQN (FPDQN)和Krum等基线算法相比有显著改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Wireless Communications Letters
IEEE Wireless Communications Letters Engineering-Electrical and Electronic Engineering
CiteScore
12.30
自引率
6.30%
发文量
481
期刊介绍: IEEE Wireless Communications Letters publishes short papers in a rapid publication cycle on advances in the state-of-the-art of wireless communications. Both theoretical contributions (including new techniques, concepts, and analyses) and practical contributions (including system experiments and prototypes, and new applications) are encouraged. This journal focuses on the physical layer and the link layer of wireless communication systems.
期刊最新文献
ADSIM: Novel Adaptive Deep-Stacked Intelligent Metasurfaces for AAV-Assisted Covert Communications Two-Timescale Antenna Movement for UAV-Mounted Movable Antenna-Enabled Integrated Sensing and Communication Multi-Satellite Coordinated Beam Hopping for Interference Mitigation under Tilted Beam Effects: A Graph-Theoretic Approach CUMA Uplink for Interference-Resilient Multi-Cell Networks A Game of Replay Attack With an Age of Information Perspective
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1