Cooperation With Humans of Unknown Intentions in Confined Spaces Using the Stackelberg Friend-or-Foe Game

IF 5.7 2区计算机科学 Q1 ENGINEERING, AEROSPACE IEEE Transactions on Aerospace and Electronic Systems Pub Date : 2024-12-24 DOI:10.1109/TAES.2024.3521935

Xiaofeng Zhao;Hanyao Hu;Dengfeng Sun

{"title":"Cooperation With Humans of Unknown Intentions in Confined Spaces Using the Stackelberg Friend-or-Foe Game","authors":"Xiaofeng Zhao;Hanyao Hu;Dengfeng Sun","doi":"10.1109/TAES.2024.3521935","DOIUrl":null,"url":null,"abstract":"In confined indoor spaces, effectively modeling the cooperative behavior of uncrewed aerial vehicles (UAVs) with humans is critical to avoid moving obstacles and resolve deadlock situations. However, the unpredictable nature of the moving obstacles and uncertainty of human intentions pose significant challenges to autonomous navigation. In this work, we address this issue by formulating the problem as the “Imagined Friend-or-Foe Game,” where the UAV considers humans as friends and moving obstacles as foes. We introduce the Stackelberg friend-or-foe multiagent deep deterministic policy gradient algorithm to mitigate cycling, accelerate convergence, and enhance performance through the information advantage. Built upon the multiagent deep deterministic policy gradient framework, the proposed end-to-end learning architecture with reward shaping enables the UAV to cooperate with humans of unknown intentions based on local information. We empirically evaluate our proposed algorithm and architecture in narrow indoor scenarios, demonstrating that the Stackelberg friend-or-foe deep deterministic policy gradient algorithm improves deadlock relief and outperforms baseline algorithms.","PeriodicalId":13157,"journal":{"name":"IEEE Transactions on Aerospace and Electronic Systems","volume":"61 3","pages":"5814-5825"},"PeriodicalIF":5.7000,"publicationDate":"2024-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Aerospace and Electronic Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10813372/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, AEROSPACE","Score":null,"Total":0}

引用次数: 0

Abstract

In confined indoor spaces, effectively modeling the cooperative behavior of uncrewed aerial vehicles (UAVs) with humans is critical to avoid moving obstacles and resolve deadlock situations. However, the unpredictable nature of the moving obstacles and uncertainty of human intentions pose significant challenges to autonomous navigation. In this work, we address this issue by formulating the problem as the “Imagined Friend-or-Foe Game,” where the UAV considers humans as friends and moving obstacles as foes. We introduce the Stackelberg friend-or-foe multiagent deep deterministic policy gradient algorithm to mitigate cycling, accelerate convergence, and enhance performance through the information advantage. Built upon the multiagent deep deterministic policy gradient framework, the proposed end-to-end learning architecture with reward shaping enables the UAV to cooperate with humans of unknown intentions based on local information. We empirically evaluate our proposed algorithm and architecture in narrow indoor scenarios, demonstrating that the Stackelberg friend-or-foe deep deterministic policy gradient algorithm improves deadlock relief and outperforms baseline algorithms.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

使用Stackelberg朋友或敌人游戏在密闭空间中与意图不明的人类合作

在受限的室内空间中，对无人机与人类的合作行为进行有效建模对于避开移动障碍物和解决僵局情况至关重要。然而，移动障碍物的不可预测性和人类意图的不确定性给自主导航带来了重大挑战。在这项工作中，我们通过将问题表述为“想象的朋友或敌人游戏”来解决这个问题，其中无人机将人类视为朋友，将移动障碍物视为敌人。我们引入Stackelberg友敌多智能体深度确定性策略梯度算法，以减轻循环，加速收敛，并通过信息优势提高性能。在多智能体深度确定性策略梯度框架的基础上，提出了基于奖励塑造的端到端学习架构，使无人机能够基于局部信息与意图未知的人类进行合作。我们在狭窄的室内场景中对我们提出的算法和架构进行了经验评估，证明Stackelberg友敌深度确定性策略梯度算法改善了死锁缓解，并且优于基线算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Aerospace and Electronic Systems 工程技术-电信学

CiteScore

7.80

自引率

13.60%

发文量

433

审稿时长

8.7 months

期刊介绍： IEEE Transactions on Aerospace and Electronic Systems focuses on the organization, design, development, integration, and operation of complex systems for space, air, ocean, or ground environment. These systems include, but are not limited to, navigation, avionics, spacecraft, aerospace power, radar, sonar, telemetry, defense, transportation, automated testing, and command and control.