Joint AoI-Aware UAVs Trajectory Planning and Data Collection in UAV-Based IoT Systems: A Deep Reinforcement Learning Approach

IF 10.9 2区计算机科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Consumer Electronics Pub Date : 2024-08-09 DOI:10.1109/TCE.2024.3440406

Xiongbing Xiao;Xiumin Wang;Weiwei Lin

{"title":"Joint AoI-Aware UAVs Trajectory Planning and Data Collection in UAV-Based IoT Systems: A Deep Reinforcement Learning Approach","authors":"Xiongbing Xiao;Xiumin Wang;Weiwei Lin","doi":"10.1109/TCE.2024.3440406","DOIUrl":null,"url":null,"abstract":"Unmanned Aerial Vehicles (UAVs) have recently received considerable attention in Internet of Things (IoT), because of their flexible deployment and extendable collection coverage. To collect data timely, the trajectory of the UAV should be intelligently planned. However, existing works mainly focus on the trajectory planning of a single UAV, ignoring the consideration of multiple UAVs. Although multiple UAVs greatly enhance the timeliness of data collection, they also pose challenges to UAVs collaboration and coordination. To address this issue, this paper formulates a joint multi-UAVs trajectory planning and data collection problem as a Mixed Integer Non-Linear Programming (MINLP), aiming at minimizing the Age of Information (AoI) and energy consumption. Due to the difficulty of the problem and the dynamic environment of IoT system, we reformulate it as a Markov Decision Process (MDP), and design a Deep Reinforcement Learning (DRL) approach to obtain the trajectory planning of the UAVs. Based on this, a deterministic data collection decision is made with a minimum cost bipartite matching in an auxiliary graph. Theoretical analysis shows that the designed deterministic data collection algorithm achieves the optimal data collection decision with the minimum weighted sum of the AoI and IoT devices’ energy consumption. Finally, simulations are conducted to confirm the efficiency of the proposed algorithms.","PeriodicalId":13208,"journal":{"name":"IEEE Transactions on Consumer Electronics","volume":"70 4","pages":"6484-6495"},"PeriodicalIF":10.9000,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Consumer Electronics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10632079/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

Unmanned Aerial Vehicles (UAVs) have recently received considerable attention in Internet of Things (IoT), because of their flexible deployment and extendable collection coverage. To collect data timely, the trajectory of the UAV should be intelligently planned. However, existing works mainly focus on the trajectory planning of a single UAV, ignoring the consideration of multiple UAVs. Although multiple UAVs greatly enhance the timeliness of data collection, they also pose challenges to UAVs collaboration and coordination. To address this issue, this paper formulates a joint multi-UAVs trajectory planning and data collection problem as a Mixed Integer Non-Linear Programming (MINLP), aiming at minimizing the Age of Information (AoI) and energy consumption. Due to the difficulty of the problem and the dynamic environment of IoT system, we reformulate it as a Markov Decision Process (MDP), and design a Deep Reinforcement Learning (DRL) approach to obtain the trajectory planning of the UAVs. Based on this, a deterministic data collection decision is made with a minimum cost bipartite matching in an auxiliary graph. Theoretical analysis shows that the designed deterministic data collection algorithm achieves the optimal data collection decision with the minimum weighted sum of the AoI and IoT devices’ energy consumption. Finally, simulations are conducted to confirm the efficiency of the proposed algorithms.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于无人机的物联网系统中的联合 AoI 感知无人机轨迹规划和数据收集：深度强化学习方法

近年来，无人机以其灵活部署和可扩展的采集覆盖范围在物联网领域受到广泛关注。为了及时收集数据，需要对无人机的飞行轨迹进行智能规划。然而，现有的工作主要集中在单架无人机的轨迹规划上，忽略了对多架无人机的考虑。虽然多架无人机极大地提高了数据采集的及时性，但也对无人机的协同和协调提出了挑战。针对这一问题，本文以最小化信息时代（Age of Information, AoI）和最小化能耗为目标，将多无人机联合轨迹规划和数据采集问题表述为混合整数非线性规划（MINLP）。考虑到问题的难度和物联网系统的动态环境，将其重新表述为马尔可夫决策过程（MDP），并设计了一种深度强化学习（DRL）方法来获得无人机的轨迹规划。在此基础上，在辅助图中以最小代价的二部匹配进行确定性的数据采集决策。理论分析表明，所设计的确定性数据采集算法以AoI与物联网设备能耗加权和最小为最优数据采集决策。最后通过仿真验证了所提算法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Consumer Electronics 工程技术-电信学

CiteScore

7.70

自引率

9.30%

发文量

审稿时长

3.3 months

期刊介绍： The main focus for the IEEE Transactions on Consumer Electronics is the engineering and research aspects of the theory, design, construction, manufacture or end use of mass market electronics, systems, software and services for consumers.