Multi-Agent DRL-Based Energy Harvesting for Freshness of Data in UAV-Assisted Wireless Sensor Networks

IF 5.4 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS IEEE Transactions on Network and Service Management Pub Date : 2024-09-04 DOI:10.1109/TNSM.2024.3454217

Mesfin Leranso Betalo;Supeng Leng;Hayla Nahom Abishu;Abegaz Mohammed Seid;Maged Fakirah;Aiman Erbad;Mohsen Guizani

{"title":"Multi-Agent DRL-Based Energy Harvesting for Freshness of Data in UAV-Assisted Wireless Sensor Networks","authors":"Mesfin Leranso Betalo;Supeng Leng;Hayla Nahom Abishu;Abegaz Mohammed Seid;Maged Fakirah;Aiman Erbad;Mohsen Guizani","doi":"10.1109/TNSM.2024.3454217","DOIUrl":null,"url":null,"abstract":"In sixth-generation (6G) networks, unmanned aerial vehicles (UAVs) are expected to be widely used as aerial base stations (ABS) due to their adaptability, low deployment costs, and ultra-low latency responses. However, UAVs consume large amounts of power to collect data from multiple sensor nodes (SNs). This can limit their flight time and transmission efficiency, resulting in delays and low information freshness. In this paper, we present a multi-access edge computing (MEC)-integrated UAV-assisted wireless sensor network (WSN) with a laser technology-based energy harvesting (EH) system that makes the UAV act as a flying energy charger to address these issues. This work aims to minimize the age of information (AoI) and improve energy efficiency by jointly optimizing the UAV trajectories, EH, task scheduling, and data offloading. The joint optimization problem is formulated as a Markov decision process (MDP) and then transformed into a stochastic game model to handle the complexity and dynamics of the environment. We adopt a multi-agent deep Q-network (MADQN) algorithm to solve the formulated optimization problem. With the MADQN algorithm, UAVs can determine the best data collection and EH decisions to minimize their energy consumption and efficiently collect data from multiple SNs, leading to reduced AoI and improved energy efficiency. Compared to the benchmark algorithms such as deep deterministic policy gradient (DDPG), Dueling DQN, asynchronous advantage actor-critic (A3C) and Greedy, the MADQN algorithm has a lower average AoI and improves energy efficiency by 95.5%, 89.9%, 78.02% and 65.52% respectively.","PeriodicalId":13423,"journal":{"name":"IEEE Transactions on Network and Service Management","volume":"21 6","pages":"6527-6541"},"PeriodicalIF":5.4000,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Network and Service Management","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10664472/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

In sixth-generation (6G) networks, unmanned aerial vehicles (UAVs) are expected to be widely used as aerial base stations (ABS) due to their adaptability, low deployment costs, and ultra-low latency responses. However, UAVs consume large amounts of power to collect data from multiple sensor nodes (SNs). This can limit their flight time and transmission efficiency, resulting in delays and low information freshness. In this paper, we present a multi-access edge computing (MEC)-integrated UAV-assisted wireless sensor network (WSN) with a laser technology-based energy harvesting (EH) system that makes the UAV act as a flying energy charger to address these issues. This work aims to minimize the age of information (AoI) and improve energy efficiency by jointly optimizing the UAV trajectories, EH, task scheduling, and data offloading. The joint optimization problem is formulated as a Markov decision process (MDP) and then transformed into a stochastic game model to handle the complexity and dynamics of the environment. We adopt a multi-agent deep Q-network (MADQN) algorithm to solve the formulated optimization problem. With the MADQN algorithm, UAVs can determine the best data collection and EH decisions to minimize their energy consumption and efficiently collect data from multiple SNs, leading to reduced AoI and improved energy efficiency. Compared to the benchmark algorithms such as deep deterministic policy gradient (DDPG), Dueling DQN, asynchronous advantage actor-critic (A3C) and Greedy, the MADQN algorithm has a lower average AoI and improves energy efficiency by 95.5%, 89.9%, 78.02% and 65.52% respectively.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于 DRL 的多代理能量收集，提高无人机辅助无线传感器网络的数据新鲜度

在第六代（6G）网络中，无人驾驶飞行器（uav）由于其适应性强、部署成本低、超低延迟响应等优点，有望广泛应用于空中基站（ABS）。然而，无人机从多个传感器节点（SNs）收集数据需要消耗大量的功率。这限制了他们的飞行时间和传输效率，导致延误和信息新鲜度低。在本文中，我们提出了一种集成多接入边缘计算（MEC）的无人机辅助无线传感器网络（WSN），该网络具有基于激光技术的能量收集（EH）系统，使无人机充当飞行能量充电器来解决这些问题。本研究旨在通过联合优化无人机轨迹、EH、任务调度和数据卸载，最大限度地减少信息时代（AoI），提高能源效率。将联合优化问题表述为马尔可夫决策过程（MDP），然后将其转化为随机博弈模型来处理环境的复杂性和动态性。我们采用多智能体深度q -网络（MADQN）算法来解决公式化的优化问题。利用MADQN算法，无人机可以确定最佳的数据收集和EH决策，以最小化其能量消耗，并有效地从多个SNs收集数据，从而降低AoI并提高能源效率。与deep deterministic policy gradient （DDPG）、Dueling DQN、asynchronous advantage actor-critic （A3C）和Greedy等基准算法相比，MADQN算法的平均AoI更低，能效分别提高了95.5%、89.9%、78.02%和65.52%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Network and Service Management Computer Science-Computer Networks and Communications

CiteScore

9.30

自引率

15.10%

发文量

325

期刊介绍： IEEE Transactions on Network and Service Management will publish (online only) peerreviewed archival quality papers that advance the state-of-the-art and practical applications of network and service management. Theoretical research contributions (presenting new concepts and techniques) and applied contributions (reporting on experiences and experiments with actual systems) will be encouraged. These transactions will focus on the key technical issues related to: Management Models, Architectures and Frameworks; Service Provisioning, Reliability and Quality Assurance; Management Functions; Enabling Technologies; Information and Communication Models; Policies; Applications and Case Studies; Emerging Technologies and Standards.