利用改进的 MADRL 进行多 UAV 协同机动决策以实现追击-规避

IF 5 Q1 ENGINEERING, MULTIDISCIPLINARY Defence Technology(防务技术) Pub Date : 2024-05-01 DOI:10.1016/j.dt.2023.11.013
Delin Luo , Zihao Fan , Ziyi Yang , Yang Xu
{"title":"利用改进的 MADRL 进行多 UAV 协同机动决策以实现追击-规避","authors":"Delin Luo ,&nbsp;Zihao Fan ,&nbsp;Ziyi Yang ,&nbsp;Yang Xu","doi":"10.1016/j.dt.2023.11.013","DOIUrl":null,"url":null,"abstract":"<div><p>Aiming at the problem of multi-UAV pursuit-evasion confrontation, a UAV cooperative maneuver method based on an improved multi-agent deep reinforcement learning (MADRL) is proposed. In this method, an improved CommNet network based on a communication mechanism is introduced into a deep reinforcement learning algorithm to solve the multi-agent problem. A layer of gated recurrent unit (GRU) is added to the actor-network structure to remember historical environmental states. Subsequently, another GRU is designed as a communication channel in the CommNet core network layer to refine communication information between UAVs. Finally, the simulation results of the algorithm in two sets of scenarios are given, and the results show that the method has good effectiveness and applicability.</p></div>","PeriodicalId":58209,"journal":{"name":"Defence Technology(防务技术)","volume":"35 ","pages":"Pages 187-197"},"PeriodicalIF":5.0000,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S221491472300301X/pdfft?md5=f29cfb1a4d800c0646b4bec364fb1a5e&pid=1-s2.0-S221491472300301X-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Multi-UAV cooperative maneuver decision-making for pursuit-evasion using improved MADRL\",\"authors\":\"Delin Luo ,&nbsp;Zihao Fan ,&nbsp;Ziyi Yang ,&nbsp;Yang Xu\",\"doi\":\"10.1016/j.dt.2023.11.013\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Aiming at the problem of multi-UAV pursuit-evasion confrontation, a UAV cooperative maneuver method based on an improved multi-agent deep reinforcement learning (MADRL) is proposed. In this method, an improved CommNet network based on a communication mechanism is introduced into a deep reinforcement learning algorithm to solve the multi-agent problem. A layer of gated recurrent unit (GRU) is added to the actor-network structure to remember historical environmental states. Subsequently, another GRU is designed as a communication channel in the CommNet core network layer to refine communication information between UAVs. Finally, the simulation results of the algorithm in two sets of scenarios are given, and the results show that the method has good effectiveness and applicability.</p></div>\",\"PeriodicalId\":58209,\"journal\":{\"name\":\"Defence Technology(防务技术)\",\"volume\":\"35 \",\"pages\":\"Pages 187-197\"},\"PeriodicalIF\":5.0000,\"publicationDate\":\"2024-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S221491472300301X/pdfft?md5=f29cfb1a4d800c0646b4bec364fb1a5e&pid=1-s2.0-S221491472300301X-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Defence Technology(防务技术)\",\"FirstCategoryId\":\"1087\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S221491472300301X\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Defence Technology(防务技术)","FirstCategoryId":"1087","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S221491472300301X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

针对多无人机追逐-规避对抗问题,提出了一种基于改进的多代理深度强化学习(MADRL)的无人机协同机动方法。在该方法中,基于通信机制的改进 CommNet 网络被引入到深度强化学习算法中,以解决多代理问题。在行动者网络结构中添加了一层门控递归单元(GRU),用于记忆历史环境状态。随后,在 CommNet 核心网络层中设计了另一个 GRU 作为通信通道,以完善无人机之间的通信信息。最后,给出了该算法在两组场景中的仿真结果,结果表明该方法具有良好的有效性和适用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Multi-UAV cooperative maneuver decision-making for pursuit-evasion using improved MADRL

Aiming at the problem of multi-UAV pursuit-evasion confrontation, a UAV cooperative maneuver method based on an improved multi-agent deep reinforcement learning (MADRL) is proposed. In this method, an improved CommNet network based on a communication mechanism is introduced into a deep reinforcement learning algorithm to solve the multi-agent problem. A layer of gated recurrent unit (GRU) is added to the actor-network structure to remember historical environmental states. Subsequently, another GRU is designed as a communication channel in the CommNet core network layer to refine communication information between UAVs. Finally, the simulation results of the algorithm in two sets of scenarios are given, and the results show that the method has good effectiveness and applicability.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Defence Technology(防务技术)
Defence Technology(防务技术) Mechanical Engineering, Control and Systems Engineering, Industrial and Manufacturing Engineering
CiteScore
8.70
自引率
0.00%
发文量
728
审稿时长
25 days
期刊介绍: Defence Technology, a peer reviewed journal, is published monthly and aims to become the best international academic exchange platform for the research related to defence technology. It publishes original research papers having direct bearing on defence, with a balanced coverage on analytical, experimental, numerical simulation and applied investigations. It covers various disciplines of science, technology and engineering.
期刊最新文献
IFC - Editorial Board Analysis model for damage of reinforced bars in RC beams under contact explosion Modelling of internal ballistics of gun systems: A review A tensile wearable SHF antenna with efficient communication in defense beacon technology An isogeometric analysis approach for dynamic response of doubly-curved magneto electro elastic composite shallow shell subjected to blast loading
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1