Cooperation With Humans of Unknown Intentions in Confined Spaces Using the Stackelberg Friend-or-Foe Game

IF 5.7 2区 计算机科学 Q1 ENGINEERING, AEROSPACE IEEE Transactions on Aerospace and Electronic Systems Pub Date : 2024-12-24 DOI:10.1109/TAES.2024.3521935
Xiaofeng Zhao;Hanyao Hu;Dengfeng Sun
{"title":"Cooperation With Humans of Unknown Intentions in Confined Spaces Using the Stackelberg Friend-or-Foe Game","authors":"Xiaofeng Zhao;Hanyao Hu;Dengfeng Sun","doi":"10.1109/TAES.2024.3521935","DOIUrl":null,"url":null,"abstract":"In confined indoor spaces, effectively modeling the cooperative behavior of uncrewed aerial vehicles (UAVs) with humans is critical to avoid moving obstacles and resolve deadlock situations. However, the unpredictable nature of the moving obstacles and uncertainty of human intentions pose significant challenges to autonomous navigation. In this work, we address this issue by formulating the problem as the “Imagined Friend-or-Foe Game,” where the UAV considers humans as friends and moving obstacles as foes. We introduce the Stackelberg friend-or-foe multiagent deep deterministic policy gradient algorithm to mitigate cycling, accelerate convergence, and enhance performance through the information advantage. Built upon the multiagent deep deterministic policy gradient framework, the proposed end-to-end learning architecture with reward shaping enables the UAV to cooperate with humans of unknown intentions based on local information. We empirically evaluate our proposed algorithm and architecture in narrow indoor scenarios, demonstrating that the Stackelberg friend-or-foe deep deterministic policy gradient algorithm improves deadlock relief and outperforms baseline algorithms.","PeriodicalId":13157,"journal":{"name":"IEEE Transactions on Aerospace and Electronic Systems","volume":"61 3","pages":"5814-5825"},"PeriodicalIF":5.7000,"publicationDate":"2024-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Aerospace and Electronic Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10813372/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, AEROSPACE","Score":null,"Total":0}
引用次数: 0

Abstract

In confined indoor spaces, effectively modeling the cooperative behavior of uncrewed aerial vehicles (UAVs) with humans is critical to avoid moving obstacles and resolve deadlock situations. However, the unpredictable nature of the moving obstacles and uncertainty of human intentions pose significant challenges to autonomous navigation. In this work, we address this issue by formulating the problem as the “Imagined Friend-or-Foe Game,” where the UAV considers humans as friends and moving obstacles as foes. We introduce the Stackelberg friend-or-foe multiagent deep deterministic policy gradient algorithm to mitigate cycling, accelerate convergence, and enhance performance through the information advantage. Built upon the multiagent deep deterministic policy gradient framework, the proposed end-to-end learning architecture with reward shaping enables the UAV to cooperate with humans of unknown intentions based on local information. We empirically evaluate our proposed algorithm and architecture in narrow indoor scenarios, demonstrating that the Stackelberg friend-or-foe deep deterministic policy gradient algorithm improves deadlock relief and outperforms baseline algorithms.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用Stackelberg朋友或敌人游戏在密闭空间中与意图不明的人类合作
在受限的室内空间中,对无人机与人类的合作行为进行有效建模对于避开移动障碍物和解决僵局情况至关重要。然而,移动障碍物的不可预测性和人类意图的不确定性给自主导航带来了重大挑战。在这项工作中,我们通过将问题表述为“想象的朋友或敌人游戏”来解决这个问题,其中无人机将人类视为朋友,将移动障碍物视为敌人。我们引入Stackelberg友敌多智能体深度确定性策略梯度算法,以减轻循环,加速收敛,并通过信息优势提高性能。在多智能体深度确定性策略梯度框架的基础上,提出了基于奖励塑造的端到端学习架构,使无人机能够基于局部信息与意图未知的人类进行合作。我们在狭窄的室内场景中对我们提出的算法和架构进行了经验评估,证明Stackelberg友敌深度确定性策略梯度算法改善了死锁缓解,并且优于基线算法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
7.80
自引率
13.60%
发文量
433
审稿时长
8.7 months
期刊介绍: IEEE Transactions on Aerospace and Electronic Systems focuses on the organization, design, development, integration, and operation of complex systems for space, air, ocean, or ground environment. These systems include, but are not limited to, navigation, avionics, spacecraft, aerospace power, radar, sonar, telemetry, defense, transportation, automated testing, and command and control.
期刊最新文献
Trajectory Prediction of Dynamic UAV Swarm with Interaction and Quantity Uncertainty under Saturation Attack Mission MMOF-AGM: Multi-Model Adversarial Attack for Security Assessment of Neural Network-Based Radar Signal Recognition Event-Triggered Distributed Formation Tracking of Cooperative Quadrotors Subject to Performance Constraints and Leader Uncertainty Improving Single-Frequency PPP with Multilayer Perceptron-based Ionospheric Delay Model Integration of an ANN-Based Variable Speed Integral and Adaptive Control Algorithm for Gas Flow Regulation in Solid Ducted Rockets
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1