Game of Drones: Intelligent Online Decision Making of Multi-UAV Confrontation

IF 5.3 3区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE IEEE Transactions on Emerging Topics in Computational Intelligence Pub Date : 2024-02-13 DOI:10.1109/TETCI.2024.3360282

Da Liu;Qun Zong;Xiuyun Zhang;Ruilong Zhang;Liqian Dou;Bailing Tian

{"title":"Game of Drones: Intelligent Online Decision Making of Multi-UAV Confrontation","authors":"Da Liu;Qun Zong;Xiuyun Zhang;Ruilong Zhang;Liqian Dou;Bailing Tian","doi":"10.1109/TETCI.2024.3360282","DOIUrl":null,"url":null,"abstract":"Due to the characteristics of the small size and low cost of unmanned aerial vehicles (UAVs), Multi-UAV confrontation will play an important role in future wars. The Multi-UAV confrontation game in the air combat environment is investigated in this paper. To truly deduce the confrontation scene, a physics engine is established based on the Multi-UAV Confrontation Scenario (MCS) framework, enabling the real-time interaction between the agent and environment while making the learned strategies more realistic. To form an effective confrontation strategy, the Graph Attention Multi-agent Soft Actor Critic Reinforcement Learning with Target Predicting Network (GA-MASAC-TP Net) is firstly proposed for Multi-UAV confrontation game. The merits lie in that the Multi-UAV trajectory prediction, considering interactions among targets, is incorporated innovatively into the Multi-agent reinforcement learning (MARL), enabling Multi-UAVs to make decisions more accurately based on situation prediction. Specifically, the Soft Actor Critic (SAC) algorithm is extended to the Multi-agent domain and embed with the graph attention neural network into the Actor, Critic network, so the UAV could aggregate the information of the spatial neighbor teammates based on the attention mechanism for better collaboration. The comparative experiment and ablation study demonstrate the effectiveness of the proposed algorithm and the state-of-art performance in the MCS.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":null,"pages":null},"PeriodicalIF":5.3000,"publicationDate":"2024-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Emerging Topics in Computational Intelligence","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10433866/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Due to the characteristics of the small size and low cost of unmanned aerial vehicles (UAVs), Multi-UAV confrontation will play an important role in future wars. The Multi-UAV confrontation game in the air combat environment is investigated in this paper. To truly deduce the confrontation scene, a physics engine is established based on the Multi-UAV Confrontation Scenario (MCS) framework, enabling the real-time interaction between the agent and environment while making the learned strategies more realistic. To form an effective confrontation strategy, the Graph Attention Multi-agent Soft Actor Critic Reinforcement Learning with Target Predicting Network (GA-MASAC-TP Net) is firstly proposed for Multi-UAV confrontation game. The merits lie in that the Multi-UAV trajectory prediction, considering interactions among targets, is incorporated innovatively into the Multi-agent reinforcement learning (MARL), enabling Multi-UAVs to make decisions more accurately based on situation prediction. Specifically, the Soft Actor Critic (SAC) algorithm is extended to the Multi-agent domain and embed with the graph attention neural network into the Actor, Critic network, so the UAV could aggregate the information of the spatial neighbor teammates based on the attention mechanism for better collaboration. The comparative experiment and ablation study demonstrate the effectiveness of the proposed algorithm and the state-of-art performance in the MCS.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

无人机游戏：多无人机对抗的智能在线决策

由于无人机体积小、成本低的特点，多无人机对抗将在未来战争中扮演重要角色。本文研究了空战环境下的多无人机对抗博弈。为了真实推演对抗场景，本文建立了基于多无人机对抗场景（MCS）框架的物理引擎，实现了代理与环境的实时交互，同时使学习到的策略更加逼真。为了形成有效的对抗策略，首先提出了针对多无人机对抗博弈的图注意多代理软代理批评强化学习与目标预测网络（GA-MASAC-TP Net）。其优点在于创新性地将考虑目标间相互作用的多无人机轨迹预测纳入多代理强化学习（MARL），使多无人机能够根据情况预测做出更准确的决策。具体而言，将软行动者批判（Soft Actor Critic，SAC）算法扩展到多机器人领域，并将图注意神经网络嵌入到行动者、批判网络中，使无人机可以基于注意机制聚合空间相邻队友的信息，以实现更好的协作。对比实验和消融研究证明了所提算法的有效性，以及在 MCS 中的先进性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Emerging Topics in Computational Intelligence Mathematics-Control and Optimization

CiteScore

10.30

自引率

7.50%

发文量

147

期刊介绍： The IEEE Transactions on Emerging Topics in Computational Intelligence (TETCI) publishes original articles on emerging aspects of computational intelligence, including theory, applications, and surveys. TETCI is an electronics only publication. TETCI publishes six issues per year. Authors are encouraged to submit manuscripts in any emerging topic in computational intelligence, especially nature-inspired computing topics not covered by other IEEE Computational Intelligence Society journals. A few such illustrative examples are glial cell networks, computational neuroscience, Brain Computer Interface, ambient intelligence, non-fuzzy computing with words, artificial life, cultural learning, artificial endocrine networks, social reasoning, artificial hormone networks, computational intelligence for the IoT and Smart-X technologies.

期刊最新文献

Table of Contents IEEE Computational Intelligence Society Information IEEE Transactions on Emerging Topics in Computational Intelligence Information for Authors IEEE Transactions on Emerging Topics in Computational Intelligence Publication Information A Novel Multi-Source Information Fusion Method Based on Dependency Interval