Meta-ETI: Meta-Reinforcement Learning With Explicit Task Inference for AAV-IoT Coverage

IF 8.9 1区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS IEEE Internet of Things Journal Pub Date : 2025-03-21 DOI:10.1109/JIOT.2025.3553808

Songjun Huang;Chuanneng Sun;Dario Pompili

{"title":"Meta-ETI: Meta-Reinforcement Learning With Explicit Task Inference for AAV-IoT Coverage","authors":"Songjun Huang;Chuanneng Sun;Dario Pompili","doi":"10.1109/JIOT.2025.3553808","DOIUrl":null,"url":null,"abstract":"To better enhance the network service for different user devices in various scenarios, autonomous aerial vehicles (AAVs) are increasingly used as aerial base stations (ABSs). However, optimizing coverage for user devices via AAV team control is an NP-hard problem and escalates exponentially in complexity with the growing number of user devices. To address this challenge, researchers have turned to reinforcement learning (RL) for a more practical solution. With the growing prevalence of the Internet of Things (IoT), the diversity of user devices increases, posing challenges for traditional RL, as 1) the spatial distribution of devices becomes more complex; 2) variations in device types and device mobility increase the training latency; 3) the high-speed movement of IoT devices can lead to performance deterioration in widely used RL algorithms with discrete action space; and 4) traditional RL struggles to adapt to new environments. To solve these problems, we propose a new meta-RL framework, Meta-RL with explicit task inference (Meta-ETI). Then, we apply this framework to efficiently train an energy-efficient AAV control policy for fair and effective coverage in 3-D dynamic environments. Meta-ETI is evaluated in both theoretical and application-related aspects and demonstrates superior performance compared to the baseline frameworks. The result shows that Meta-ETI demonstrates 2–3 times faster adaptation speed and a decent performance in sample efficiency. Furthermore, in the AAV-IoT coverage application, Meta-ETI shows 30%–50% better in energy efficiency and 40%–60% more served devices because of the fair coverage.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 13","pages":"23852-23865"},"PeriodicalIF":8.9000,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10937077/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

To better enhance the network service for different user devices in various scenarios, autonomous aerial vehicles (AAVs) are increasingly used as aerial base stations (ABSs). However, optimizing coverage for user devices via AAV team control is an NP-hard problem and escalates exponentially in complexity with the growing number of user devices. To address this challenge, researchers have turned to reinforcement learning (RL) for a more practical solution. With the growing prevalence of the Internet of Things (IoT), the diversity of user devices increases, posing challenges for traditional RL, as 1) the spatial distribution of devices becomes more complex; 2) variations in device types and device mobility increase the training latency; 3) the high-speed movement of IoT devices can lead to performance deterioration in widely used RL algorithms with discrete action space; and 4) traditional RL struggles to adapt to new environments. To solve these problems, we propose a new meta-RL framework, Meta-RL with explicit task inference (Meta-ETI). Then, we apply this framework to efficiently train an energy-efficient AAV control policy for fair and effective coverage in 3-D dynamic environments. Meta-ETI is evaluated in both theoretical and application-related aspects and demonstrates superior performance compared to the baseline frameworks. The result shows that Meta-ETI demonstrates 2–3 times faster adaptation speed and a decent performance in sample efficiency. Furthermore, in the AAV-IoT coverage application, Meta-ETI shows 30%–50% better in energy efficiency and 40%–60% more served devices because of the fair coverage.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Meta-ETI：无人机-物联网覆盖的元强化学习与显式任务推理

为了更好地增强不同场景下不同用户设备的网络服务，自动飞行器（autonomous aerial vehicle, aav）越来越多地被用作空中基站（aerial base station, abs）。然而，通过AAV团队控制优化用户设备的覆盖是一个np难题，并且随着用户设备数量的增加，复杂性呈指数级增长。为了应对这一挑战，研究人员转向强化学习（RL）寻求更实用的解决方案。随着物联网（IoT）的日益普及，用户设备的多样性增加，对传统RL提出了挑战：1)设备的空间分布变得更加复杂；2)设备类型和设备移动性的变化增加了训练延迟；3)物联网设备的高速移动会导致广泛使用的具有离散动作空间的强化学习算法的性能下降；4)传统强化学习难以适应新环境。为了解决这些问题，我们提出了一个新的元rl框架，即带有显式任务推理的元rl （Meta-ETI）。然后，我们应用该框架有效地训练了一个节能的AAV控制策略，以在三维动态环境中实现公平有效的覆盖。Meta-ETI在理论和应用相关方面进行了评估，与基线框架相比，表现出优越的性能。结果表明，Meta-ETI的自适应速度提高了2-3倍，在样本效率方面表现良好。此外，在AAV-IoT覆盖应用中，由于覆盖公平，Meta-ETI的能源效率提高了30%-50%，服务设备增加了40%-60%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Internet of Things Journal Computer Science-Information Systems

CiteScore

17.60

自引率

13.20%

发文量

1982

期刊介绍： The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.