协同多智能体深度强化学习在战争游戏中的应用

Yanghui Fu, Xingxing Liang, Yang Ma, Kuihua Huang, Yan Li
{"title":"协同多智能体深度强化学习在战争游戏中的应用","authors":"Yanghui Fu, Xingxing Liang, Yang Ma, Kuihua Huang, Yan Li","doi":"10.1145/3446132.3446137","DOIUrl":null,"url":null,"abstract":"The successful application of deep reinforcement learning in RTS games such as StarCraft II has inspired people to apply multi-agent deep reinforcement learning(MADRL) to more fields. In the field of wargame, hexagonal maps are often used for simulation, which can't adapt to the rapid development of wargame. In continuous space of wargame, we construct a ship-defense scenario that includes multiple aircraft and ships. We apply deep Q network(DQN) method to MADRL, CNN to extract the features of multiple entities, and a centralized and distributed decision-making training architecture to control the aircraft's fixed-wing module components. Experiment results demonstrate the effectiveness of the proposed formulation, which show that the CNN-based feature extraction model can effectively defeat the built-in rule bot with multiple levels, and the training effect of CNN-based is better than the feature extraction method by full connection.","PeriodicalId":125388,"journal":{"name":"Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Coordinating Multi-Agent Deep Reinforcement Learning in Wargame\",\"authors\":\"Yanghui Fu, Xingxing Liang, Yang Ma, Kuihua Huang, Yan Li\",\"doi\":\"10.1145/3446132.3446137\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The successful application of deep reinforcement learning in RTS games such as StarCraft II has inspired people to apply multi-agent deep reinforcement learning(MADRL) to more fields. In the field of wargame, hexagonal maps are often used for simulation, which can't adapt to the rapid development of wargame. In continuous space of wargame, we construct a ship-defense scenario that includes multiple aircraft and ships. We apply deep Q network(DQN) method to MADRL, CNN to extract the features of multiple entities, and a centralized and distributed decision-making training architecture to control the aircraft's fixed-wing module components. Experiment results demonstrate the effectiveness of the proposed formulation, which show that the CNN-based feature extraction model can effectively defeat the built-in rule bot with multiple levels, and the training effect of CNN-based is better than the feature extraction method by full connection.\",\"PeriodicalId\":125388,\"journal\":{\"name\":\"Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3446132.3446137\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3446132.3446137","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

深度强化学习在《星际争霸II》等即时战略游戏中的成功应用,激发了人们将多智能体深度强化学习(MADRL)应用到更多领域。在兵棋模拟领域,多采用六角形地图进行模拟,已不能适应兵棋模拟的快速发展。在连续空间的兵棋推演中,我们构建了一个包含多架飞机和舰船的舰艇防御场景。我们将深度Q网络(DQN)方法应用于MADRL, CNN提取多实体特征,并采用集中式和分布式的决策训练架构控制飞机固定翼模块组件。实验结果证明了所提公式的有效性,表明基于cnn的特征提取模型能够有效击败内置的多级规则机器人,并且基于cnn的训练效果优于全连接的特征提取方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Coordinating Multi-Agent Deep Reinforcement Learning in Wargame
The successful application of deep reinforcement learning in RTS games such as StarCraft II has inspired people to apply multi-agent deep reinforcement learning(MADRL) to more fields. In the field of wargame, hexagonal maps are often used for simulation, which can't adapt to the rapid development of wargame. In continuous space of wargame, we construct a ship-defense scenario that includes multiple aircraft and ships. We apply deep Q network(DQN) method to MADRL, CNN to extract the features of multiple entities, and a centralized and distributed decision-making training architecture to control the aircraft's fixed-wing module components. Experiment results demonstrate the effectiveness of the proposed formulation, which show that the CNN-based feature extraction model can effectively defeat the built-in rule bot with multiple levels, and the training effect of CNN-based is better than the feature extraction method by full connection.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Lane Detection Combining Details and Integrity: an Advanced Method for Lane Detection The Cat's Eye Effect Target Recognition Method Based on deep convolutional neural network Leveraging Different Context for Response Generation through Topic-guided Multi-head Attention Siamese Multiplicative LSTM for Semantic Text Similarity Multi-constrained Vehicle Routing Problem Solution based on Adaptive Genetic Algorithm
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1