{"title":"Cooperation With Humans of Unknown Intentions in Confined Spaces Using the Stackelberg Friend-or-Foe Game","authors":"Xiaofeng Zhao;Hanyao Hu;Dengfeng Sun","doi":"10.1109/TAES.2024.3521935","DOIUrl":null,"url":null,"abstract":"In confined indoor spaces, effectively modeling the cooperative behavior of uncrewed aerial vehicles (UAVs) with humans is critical to avoid moving obstacles and resolve deadlock situations. However, the unpredictable nature of the moving obstacles and uncertainty of human intentions pose significant challenges to autonomous navigation. In this work, we address this issue by formulating the problem as the “Imagined Friend-or-Foe Game,” where the UAV considers humans as friends and moving obstacles as foes. We introduce the Stackelberg friend-or-foe multiagent deep deterministic policy gradient algorithm to mitigate cycling, accelerate convergence, and enhance performance through the information advantage. Built upon the multiagent deep deterministic policy gradient framework, the proposed end-to-end learning architecture with reward shaping enables the UAV to cooperate with humans of unknown intentions based on local information. We empirically evaluate our proposed algorithm and architecture in narrow indoor scenarios, demonstrating that the Stackelberg friend-or-foe deep deterministic policy gradient algorithm improves deadlock relief and outperforms baseline algorithms.","PeriodicalId":13157,"journal":{"name":"IEEE Transactions on Aerospace and Electronic Systems","volume":"61 3","pages":"5814-5825"},"PeriodicalIF":5.7000,"publicationDate":"2024-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Aerospace and Electronic Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10813372/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, AEROSPACE","Score":null,"Total":0}
引用次数: 0
Abstract
In confined indoor spaces, effectively modeling the cooperative behavior of uncrewed aerial vehicles (UAVs) with humans is critical to avoid moving obstacles and resolve deadlock situations. However, the unpredictable nature of the moving obstacles and uncertainty of human intentions pose significant challenges to autonomous navigation. In this work, we address this issue by formulating the problem as the “Imagined Friend-or-Foe Game,” where the UAV considers humans as friends and moving obstacles as foes. We introduce the Stackelberg friend-or-foe multiagent deep deterministic policy gradient algorithm to mitigate cycling, accelerate convergence, and enhance performance through the information advantage. Built upon the multiagent deep deterministic policy gradient framework, the proposed end-to-end learning architecture with reward shaping enables the UAV to cooperate with humans of unknown intentions based on local information. We empirically evaluate our proposed algorithm and architecture in narrow indoor scenarios, demonstrating that the Stackelberg friend-or-foe deep deterministic policy gradient algorithm improves deadlock relief and outperforms baseline algorithms.
期刊介绍:
IEEE Transactions on Aerospace and Electronic Systems focuses on the organization, design, development, integration, and operation of complex systems for space, air, ocean, or ground environment. These systems include, but are not limited to, navigation, avionics, spacecraft, aerospace power, radar, sonar, telemetry, defense, transportation, automated testing, and command and control.