协同多智能体深度强化学习在战争游戏中的应用

Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence Pub Date : 2020-12-24 DOI:10.1145/3446132.3446137

Yanghui Fu, Xingxing Liang, Yang Ma, Kuihua Huang, Yan Li

{"title":"协同多智能体深度强化学习在战争游戏中的应用","authors":"Yanghui Fu, Xingxing Liang, Yang Ma, Kuihua Huang, Yan Li","doi":"10.1145/3446132.3446137","DOIUrl":null,"url":null,"abstract":"The successful application of deep reinforcement learning in RTS games such as StarCraft II has inspired people to apply multi-agent deep reinforcement learning(MADRL) to more fields. In the field of wargame, hexagonal maps are often used for simulation, which can't adapt to the rapid development of wargame. In continuous space of wargame, we construct a ship-defense scenario that includes multiple aircraft and ships. We apply deep Q network(DQN) method to MADRL, CNN to extract the features of multiple entities, and a centralized and distributed decision-making training architecture to control the aircraft's fixed-wing module components. Experiment results demonstrate the effectiveness of the proposed formulation, which show that the CNN-based feature extraction model can effectively defeat the built-in rule bot with multiple levels, and the training effect of CNN-based is better than the feature extraction method by full connection.","PeriodicalId":125388,"journal":{"name":"Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Coordinating Multi-Agent Deep Reinforcement Learning in Wargame\",\"authors\":\"Yanghui Fu, Xingxing Liang, Yang Ma, Kuihua Huang, Yan Li\",\"doi\":\"10.1145/3446132.3446137\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The successful application of deep reinforcement learning in RTS games such as StarCraft II has inspired people to apply multi-agent deep reinforcement learning(MADRL) to more fields. In the field of wargame, hexagonal maps are often used for simulation, which can't adapt to the rapid development of wargame. In continuous space of wargame, we construct a ship-defense scenario that includes multiple aircraft and ships. We apply deep Q network(DQN) method to MADRL, CNN to extract the features of multiple entities, and a centralized and distributed decision-making training architecture to control the aircraft's fixed-wing module components. Experiment results demonstrate the effectiveness of the proposed formulation, which show that the CNN-based feature extraction model can effectively defeat the built-in rule bot with multiple levels, and the training effect of CNN-based is better than the feature extraction method by full connection.\",\"PeriodicalId\":125388,\"journal\":{\"name\":\"Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3446132.3446137\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3446132.3446137","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

深度强化学习在《星际争霸II》等即时战略游戏中的成功应用，激发了人们将多智能体深度强化学习(MADRL)应用到更多领域。在兵棋模拟领域，多采用六角形地图进行模拟，已不能适应兵棋模拟的快速发展。在连续空间的兵棋推演中，我们构建了一个包含多架飞机和舰船的舰艇防御场景。我们将深度Q网络(DQN)方法应用于MADRL, CNN提取多实体特征，并采用集中式和分布式的决策训练架构控制飞机固定翼模块组件。实验结果证明了所提公式的有效性，表明基于cnn的特征提取模型能够有效击败内置的多级规则机器人，并且基于cnn的训练效果优于全连接的特征提取方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Coordinating Multi-Agent Deep Reinforcement Learning in Wargame

The successful application of deep reinforcement learning in RTS games such as StarCraft II has inspired people to apply multi-agent deep reinforcement learning(MADRL) to more fields. In the field of wargame, hexagonal maps are often used for simulation, which can't adapt to the rapid development of wargame. In continuous space of wargame, we construct a ship-defense scenario that includes multiple aircraft and ships. We apply deep Q network(DQN) method to MADRL, CNN to extract the features of multiple entities, and a centralized and distributed decision-making training architecture to control the aircraft's fixed-wing module components. Experiment results demonstrate the effectiveness of the proposed formulation, which show that the CNN-based feature extraction model can effectively defeat the built-in rule bot with multiple levels, and the training effect of CNN-based is better than the feature extraction method by full connection.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence

自引率

0.00%

发文量