Jingjing Wang;Hui Zhang;Xu Han;Jiaxiang Zhao;Jiangzhou Wang
{"title":"Lyapunov-Assisted Decentralized Dynamic Offloading Strategy Based on Deep Reinforcement Learning","authors":"Jingjing Wang;Hui Zhang;Xu Han;Jiaxiang Zhao;Jiangzhou Wang","doi":"10.1109/JIOT.2024.3498839","DOIUrl":null,"url":null,"abstract":"To enhance the edge offloading capabilities of massive Internet of Things (IoT) devices with limited resources, a novel task offloading algorithm, namely, reduced target deep deterministic policy gradient (RT-DDPG), is proposed, which can generate near-optimal offloading decisions on the user and edge server sides, especially in mobile edge computing (MEC) and multiuser multiple input multiple output (MIMO) scenarios. In the RT-DDPG algorithm, the combination of Lyapunov optimization and improved deep deterministic policy gradient (DDPG) not only reduces the Q-value estimation bias of the neural network, but also constrains the long-term stability of the queue and reduces buffering delay. Moreover, by placing the algorithm agent independently on the device side, each device can adaptively formulate a decentralized computing offloading strategy based on environmental information. The simulation results show that with the help of the RT-DDPG algorithm, the optimal dynamic offloading strategy can be learned in the continuous action space. Compared with traditional reinforcement learning and other greedy strategy algorithms, the RT-DDPG algorithm can reduce the long-term average computing cost of users by 50%.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 7","pages":"8368-8380"},"PeriodicalIF":8.9000,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10753491/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
To enhance the edge offloading capabilities of massive Internet of Things (IoT) devices with limited resources, a novel task offloading algorithm, namely, reduced target deep deterministic policy gradient (RT-DDPG), is proposed, which can generate near-optimal offloading decisions on the user and edge server sides, especially in mobile edge computing (MEC) and multiuser multiple input multiple output (MIMO) scenarios. In the RT-DDPG algorithm, the combination of Lyapunov optimization and improved deep deterministic policy gradient (DDPG) not only reduces the Q-value estimation bias of the neural network, but also constrains the long-term stability of the queue and reduces buffering delay. Moreover, by placing the algorithm agent independently on the device side, each device can adaptively formulate a decentralized computing offloading strategy based on environmental information. The simulation results show that with the help of the RT-DDPG algorithm, the optimal dynamic offloading strategy can be learned in the continuous action space. Compared with traditional reinforcement learning and other greedy strategy algorithms, the RT-DDPG algorithm can reduce the long-term average computing cost of users by 50%.
期刊介绍:
The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.