Yanghui Fu, Xingxing Liang, Yang Ma, Kuihua Huang, Yan Li
{"title":"协同多智能体深度强化学习在战争游戏中的应用","authors":"Yanghui Fu, Xingxing Liang, Yang Ma, Kuihua Huang, Yan Li","doi":"10.1145/3446132.3446137","DOIUrl":null,"url":null,"abstract":"The successful application of deep reinforcement learning in RTS games such as StarCraft II has inspired people to apply multi-agent deep reinforcement learning(MADRL) to more fields. In the field of wargame, hexagonal maps are often used for simulation, which can't adapt to the rapid development of wargame. In continuous space of wargame, we construct a ship-defense scenario that includes multiple aircraft and ships. We apply deep Q network(DQN) method to MADRL, CNN to extract the features of multiple entities, and a centralized and distributed decision-making training architecture to control the aircraft's fixed-wing module components. Experiment results demonstrate the effectiveness of the proposed formulation, which show that the CNN-based feature extraction model can effectively defeat the built-in rule bot with multiple levels, and the training effect of CNN-based is better than the feature extraction method by full connection.","PeriodicalId":125388,"journal":{"name":"Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Coordinating Multi-Agent Deep Reinforcement Learning in Wargame\",\"authors\":\"Yanghui Fu, Xingxing Liang, Yang Ma, Kuihua Huang, Yan Li\",\"doi\":\"10.1145/3446132.3446137\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The successful application of deep reinforcement learning in RTS games such as StarCraft II has inspired people to apply multi-agent deep reinforcement learning(MADRL) to more fields. In the field of wargame, hexagonal maps are often used for simulation, which can't adapt to the rapid development of wargame. In continuous space of wargame, we construct a ship-defense scenario that includes multiple aircraft and ships. We apply deep Q network(DQN) method to MADRL, CNN to extract the features of multiple entities, and a centralized and distributed decision-making training architecture to control the aircraft's fixed-wing module components. Experiment results demonstrate the effectiveness of the proposed formulation, which show that the CNN-based feature extraction model can effectively defeat the built-in rule bot with multiple levels, and the training effect of CNN-based is better than the feature extraction method by full connection.\",\"PeriodicalId\":125388,\"journal\":{\"name\":\"Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3446132.3446137\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3446132.3446137","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Coordinating Multi-Agent Deep Reinforcement Learning in Wargame
The successful application of deep reinforcement learning in RTS games such as StarCraft II has inspired people to apply multi-agent deep reinforcement learning(MADRL) to more fields. In the field of wargame, hexagonal maps are often used for simulation, which can't adapt to the rapid development of wargame. In continuous space of wargame, we construct a ship-defense scenario that includes multiple aircraft and ships. We apply deep Q network(DQN) method to MADRL, CNN to extract the features of multiple entities, and a centralized and distributed decision-making training architecture to control the aircraft's fixed-wing module components. Experiment results demonstrate the effectiveness of the proposed formulation, which show that the CNN-based feature extraction model can effectively defeat the built-in rule bot with multiple levels, and the training effect of CNN-based is better than the feature extraction method by full connection.