Two-Timescale Synchronization and Migration for Digital Twin Networks: A Multi-Agent Deep Reinforcement Learning Approach

arXiv - CS - Networking and Internet Architecture Pub Date : 2024-09-02 DOI:arxiv-2409.01092

Wenshuai Liu, Yaru Fu, Yongna Guo, Fu Lee Wang, Wen Sun, Yan Zhang

{"title":"Two-Timescale Synchronization and Migration for Digital Twin Networks: A Multi-Agent Deep Reinforcement Learning Approach","authors":"Wenshuai Liu, Yaru Fu, Yongna Guo, Fu Lee Wang, Wen Sun, Yan Zhang","doi":"arxiv-2409.01092","DOIUrl":null,"url":null,"abstract":"Digital twins (DTs) have emerged as a promising enabler for representing the\nreal-time states of physical worlds and realizing self-sustaining systems. In\npractice, DTs of physical devices, such as mobile users (MUs), are commonly\ndeployed in multi-access edge computing (MEC) networks for the sake of reducing\nlatency. To ensure the accuracy and fidelity of DTs, it is essential for MUs to\nregularly synchronize their status with their DTs. However, MU mobility\nintroduces significant challenges to DT synchronization. Firstly, MU mobility\ntriggers DT migration which could cause synchronization failures. Secondly, MUs\nrequire frequent synchronization with their DTs to ensure DT fidelity.\nNonetheless, DT migration among MEC servers, caused by MU mobility, may occur\ninfrequently. Accordingly, we propose a two-timescale DT synchronization and\nmigration framework with reliability consideration by establishing a non-convex\nstochastic problem to minimize the long-term average energy consumption of MUs.\nWe use Lyapunov theory to convert the reliability constraints and reformulate\nthe new problem as a partially observable Markov decision-making process\n(POMDP). Furthermore, we develop a heterogeneous agent proximal policy\noptimization with Beta distribution (Beta-HAPPO) method to solve it. Numerical\nresults show that our proposed Beta-HAPPO method achieves significant\nimprovements in energy savings when compared with other benchmarks.","PeriodicalId":501280,"journal":{"name":"arXiv - CS - Networking and Internet Architecture","volume":"70 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Networking and Internet Architecture","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.01092","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Digital twins (DTs) have emerged as a promising enabler for representing the real-time states of physical worlds and realizing self-sustaining systems. In practice, DTs of physical devices, such as mobile users (MUs), are commonly deployed in multi-access edge computing (MEC) networks for the sake of reducing latency. To ensure the accuracy and fidelity of DTs, it is essential for MUs to regularly synchronize their status with their DTs. However, MU mobility introduces significant challenges to DT synchronization. Firstly, MU mobility triggers DT migration which could cause synchronization failures. Secondly, MUs require frequent synchronization with their DTs to ensure DT fidelity. Nonetheless, DT migration among MEC servers, caused by MU mobility, may occur infrequently. Accordingly, we propose a two-timescale DT synchronization and migration framework with reliability consideration by establishing a non-convex stochastic problem to minimize the long-term average energy consumption of MUs. We use Lyapunov theory to convert the reliability constraints and reformulate the new problem as a partially observable Markov decision-making process (POMDP). Furthermore, we develop a heterogeneous agent proximal policy optimization with Beta distribution (Beta-HAPPO) method to solve it. Numerical results show that our proposed Beta-HAPPO method achieves significant improvements in energy savings when compared with other benchmarks.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

数字孪生网络的双时标同步和迁移：多代理深度强化学习方法

数字孪生（DTs）已成为表示物理世界实时状态和实现自持系统的一种前景广阔的工具。在实践中，移动用户（MU）等物理设备的数字孪生通常部署在多接入边缘计算（MEC）网络中，以降低延迟。为了确保 DT 的准确性和保真度，MU 必须定期将其状态与 DT 同步。然而，MU 的移动性给 DT 同步带来了巨大挑战。首先，MU 移动会引发 DT 迁移，从而导致同步失败。其次，MU 需要与其 DT 频繁同步，以确保 DT 的保真度。然而，由 MU 移动性引起的 MEC 服务器之间的 DT 迁移可能会频繁发生。因此，我们通过建立一个非凸随机问题来最小化 MU 的长期平均能耗，从而提出了一个考虑可靠性的双时标 DT 同步和迁移框架。我们使用 Lyapunov 理论来转换可靠性约束，并将新问题重新表述为部分可观测马尔可夫决策过程（POMDP）。此外，我们还开发了一种采用 Beta 分布的异构代理近端策略优化（Beta-HAPPO）方法来解决该问题。数值结果表明，与其他基准相比，我们提出的 Beta-HAPPO 方法显著提高了节能效果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

arXiv - CS - Networking and Internet Architecture

自引率

0.00%

发文量

期刊最新文献

CEF: Connecting Elaborate Federal QKD Networks Age-of-Information and Energy Optimization in Digital Twin Edge Networks Blockchain-Enabled IoV: Secure Communication and Trustworthy Decision-Making Micro-orchestration of RAN functions accelerated in FPGA SoC devices LoRa Communication for Agriculture 4.0: Opportunities, Challenges, and Future Directions