Privacy-Preserving Multi-Agent Deep Reinforcement Learning for Effective Resource Auction in Multi-Access Edge Computing

IF 7 1区 计算机科学 Q1 TELECOMMUNICATIONS IEEE Transactions on Cognitive Communications and Networking Pub Date : 2024-11-21 DOI:10.1109/TCCN.2024.3499342
Feiran You;Xin Yuan;Wei Ni;Abbas Jamalipour
{"title":"Privacy-Preserving Multi-Agent Deep Reinforcement Learning for Effective Resource Auction in Multi-Access Edge Computing","authors":"Feiran You;Xin Yuan;Wei Ni;Abbas Jamalipour","doi":"10.1109/TCCN.2024.3499342","DOIUrl":null,"url":null,"abstract":"Multi-access edge computing (MEC) offloads services for mobile users to facilitate the integration of idle cloudlet resources and bring cloud services closer to users. Existing studies have focused primarily on task coordination and resource allocation with strict time constraints, and typically overlooked the potential privacy leakage of users’ participation strategies in MEC. This paper proposes a novel solution to computation offloading and privacy protection in MEC networks using a Multi-agent Deep Deterministic Policy Gradient (MADDPG) framework. Our approach utilizes game theory to encourage computation offloading by modeling the interaction between cloudlets, Data Center Operators (DCOs), and users as a stochastic auction game. We formulate the computation offloading as an auction game with multiple bidders and incomplete information, and use MADDPG to find an optimal solution. To ensure privacy protection, we design a local Differential Privacy (DP) method in the MADDPG algorithm. With an <inline-formula> <tex-math>$(\\epsilon, \\delta)$ </tex-math></inline-formula>-DP mechanism, the local DP ensures that the sampled transitions, including the information on users’ actions, states, and corresponding rewards, are protected from exploitation. Analyses corroborate the effectiveness of our approach in satisfying DP and converging to an equilibrium. Simulations demonstrate the approach achieves 126.75% better quality-of-experience than a knapsack-based benchmark, when there are 60 cloudlets and up to 100 users.","PeriodicalId":13069,"journal":{"name":"IEEE Transactions on Cognitive Communications and Networking","volume":"11 3","pages":"1887-1901"},"PeriodicalIF":7.0000,"publicationDate":"2024-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Cognitive Communications and Networking","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10763434/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Multi-access edge computing (MEC) offloads services for mobile users to facilitate the integration of idle cloudlet resources and bring cloud services closer to users. Existing studies have focused primarily on task coordination and resource allocation with strict time constraints, and typically overlooked the potential privacy leakage of users’ participation strategies in MEC. This paper proposes a novel solution to computation offloading and privacy protection in MEC networks using a Multi-agent Deep Deterministic Policy Gradient (MADDPG) framework. Our approach utilizes game theory to encourage computation offloading by modeling the interaction between cloudlets, Data Center Operators (DCOs), and users as a stochastic auction game. We formulate the computation offloading as an auction game with multiple bidders and incomplete information, and use MADDPG to find an optimal solution. To ensure privacy protection, we design a local Differential Privacy (DP) method in the MADDPG algorithm. With an $(\epsilon, \delta)$ -DP mechanism, the local DP ensures that the sampled transitions, including the information on users’ actions, states, and corresponding rewards, are protected from exploitation. Analyses corroborate the effectiveness of our approach in satisfying DP and converging to an equilibrium. Simulations demonstrate the approach achieves 126.75% better quality-of-experience than a knapsack-based benchmark, when there are 60 cloudlets and up to 100 users.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
为多接入边缘计算中的有效资源拍卖提供隐私保护的多代理深度强化学习
MEC (Multi-access edge computing)为移动用户卸载业务,便于整合闲置的云资源,让云服务更贴近用户。现有的研究主要集中在严格时间约束下的任务协调和资源分配,往往忽视了MEC中用户参与策略的潜在隐私泄露。本文提出了一种基于多智能体深度确定性策略梯度(madpg)框架的MEC网络计算卸载和隐私保护的新解决方案。我们的方法利用博弈论通过将云、数据中心运营商(dco)和用户之间的交互建模为随机拍卖游戏来鼓励计算卸载。我们将计算卸载描述为具有多个投标人和不完全信息的拍卖博弈,并使用madpg来寻找最优解。为了保证隐私保护,我们在madpg算法中设计了一种局部差分隐私(DP)方法。通过$(\epsilon, \delta)$ -DP机制,本地DP确保采样的转换(包括用户的操作、状态和相应奖励的信息)不被利用。分析证实了我们的方法在满足DP和收敛到平衡点上的有效性。仿真结果表明,该方法达到了126.75% better quality-of-experience than a knapsack-based benchmark, when there are 60 cloudlets and up to 100 users.
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Cognitive Communications and Networking
IEEE Transactions on Cognitive Communications and Networking Computer Science-Artificial Intelligence
CiteScore
15.50
自引率
7.00%
发文量
108
期刊介绍: The IEEE Transactions on Cognitive Communications and Networking (TCCN) aims to publish high-quality manuscripts that push the boundaries of cognitive communications and networking research. Cognitive, in this context, refers to the application of perception, learning, reasoning, memory, and adaptive approaches in communication system design. The transactions welcome submissions that explore various aspects of cognitive communications and networks, focusing on innovative and holistic approaches to complex system design. Key topics covered include architecture, protocols, cross-layer design, and cognition cycle design for cognitive networks. Additionally, research on machine learning, artificial intelligence, end-to-end and distributed intelligence, software-defined networking, cognitive radios, spectrum sharing, and security and privacy issues in cognitive networks are of interest. The publication also encourages papers addressing novel services and applications enabled by these cognitive concepts.
期刊最新文献
Toward Robust UAV Swarm Coordination in Low-Altitude Networks: Online Learning Control under System and Channel Uncertainty Beamforming Design for Beyond Diagonal STAR-RIS-Assisted Near-Field ISAC Systems: A Power Minimization Approach Hierarchical Attention-Driven Multi-Agent Reinforcement Learning for Resource Allocation in Cell-Free Massive MIMO with Integrated Sensing and Communication Enhancing Backscatter Communication via Time-Reversal Filtering: Scheme Design and Performance Analysis Joint Routing and Model Pruning for Decentralized Federated Learning in Bandwidth-Constrained Multi-Hop Wireless Networks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1