{"title":"基于pf - madpg的多智能体协同攻击防御目标任务决策","authors":"Maomao Zhao, Shaojie Zhang, Bin Jiang","doi":"10.1109/ISAS59543.2023.10164609","DOIUrl":null,"url":null,"abstract":"A novel potential function multi-agent deep deterministic policy gradient (PF-MADDPG) algorithm is proposed for the multi-agent Attacker-Defender-Target (ADT). A multi-agent continuous state space and a continuous action space are established. The potential function rewards of target and defenders are designed to accelerate the game confrontation training speed, and the MADDPG algorithm is utilized to obtain effective strategies, so as to describe the influence of different actions on attackers. Finally, simulations are given to verify the effectiveness of the proposed PF-MADDPG algorithm.","PeriodicalId":199115,"journal":{"name":"2023 6th International Symposium on Autonomous Systems (ISAS)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-Agent Cooperative Attacker-Defender-Target Task Decision Based on PF-MADDPG\",\"authors\":\"Maomao Zhao, Shaojie Zhang, Bin Jiang\",\"doi\":\"10.1109/ISAS59543.2023.10164609\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A novel potential function multi-agent deep deterministic policy gradient (PF-MADDPG) algorithm is proposed for the multi-agent Attacker-Defender-Target (ADT). A multi-agent continuous state space and a continuous action space are established. The potential function rewards of target and defenders are designed to accelerate the game confrontation training speed, and the MADDPG algorithm is utilized to obtain effective strategies, so as to describe the influence of different actions on attackers. Finally, simulations are given to verify the effectiveness of the proposed PF-MADDPG algorithm.\",\"PeriodicalId\":199115,\"journal\":{\"name\":\"2023 6th International Symposium on Autonomous Systems (ISAS)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 6th International Symposium on Autonomous Systems (ISAS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISAS59543.2023.10164609\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 6th International Symposium on Autonomous Systems (ISAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISAS59543.2023.10164609","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Multi-Agent Cooperative Attacker-Defender-Target Task Decision Based on PF-MADDPG
A novel potential function multi-agent deep deterministic policy gradient (PF-MADDPG) algorithm is proposed for the multi-agent Attacker-Defender-Target (ADT). A multi-agent continuous state space and a continuous action space are established. The potential function rewards of target and defenders are designed to accelerate the game confrontation training speed, and the MADDPG algorithm is utilized to obtain effective strategies, so as to describe the influence of different actions on attackers. Finally, simulations are given to verify the effectiveness of the proposed PF-MADDPG algorithm.