{"title":"Multi-Agent Cooperative Attacker-Defender-Target Task Decision Based on PF-MADDPG","authors":"Maomao Zhao, Shaojie Zhang, Bin Jiang","doi":"10.1109/ISAS59543.2023.10164609","DOIUrl":null,"url":null,"abstract":"A novel potential function multi-agent deep deterministic policy gradient (PF-MADDPG) algorithm is proposed for the multi-agent Attacker-Defender-Target (ADT). A multi-agent continuous state space and a continuous action space are established. The potential function rewards of target and defenders are designed to accelerate the game confrontation training speed, and the MADDPG algorithm is utilized to obtain effective strategies, so as to describe the influence of different actions on attackers. Finally, simulations are given to verify the effectiveness of the proposed PF-MADDPG algorithm.","PeriodicalId":199115,"journal":{"name":"2023 6th International Symposium on Autonomous Systems (ISAS)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 6th International Symposium on Autonomous Systems (ISAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISAS59543.2023.10164609","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
A novel potential function multi-agent deep deterministic policy gradient (PF-MADDPG) algorithm is proposed for the multi-agent Attacker-Defender-Target (ADT). A multi-agent continuous state space and a continuous action space are established. The potential function rewards of target and defenders are designed to accelerate the game confrontation training speed, and the MADDPG algorithm is utilized to obtain effective strategies, so as to describe the influence of different actions on attackers. Finally, simulations are given to verify the effectiveness of the proposed PF-MADDPG algorithm.