{"title":"A Cooperative Guidance Law for Multiple Missiles based on Reinforcement Learning","authors":"Hongxu Chen, Jianglong Yu, Xiwang Dong","doi":"10.1109/ICUS55513.2022.9986718","DOIUrl":null,"url":null,"abstract":"The traditional proportional guidance law lacks the limitation of time and field of view. In order to realize the coordinated attack of multiple missiles on targets and improve the attack efficiency, a reinforcement learning cooperative guidance law based on deep deterministic policy gradient descent neural network is proposed. According to the particularity of guidance process, the reinforcement learning agent is obtained by constructing state space, action space and reward function training. The simulation results show that the enhanced learning guidance law can strike maneuvering targets simultaneously and satisfy the field of view constraint, which is superior to the traditional cooperative proportional guidance law.","PeriodicalId":345773,"journal":{"name":"2022 IEEE International Conference on Unmanned Systems (ICUS)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Unmanned Systems (ICUS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICUS55513.2022.9986718","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The traditional proportional guidance law lacks the limitation of time and field of view. In order to realize the coordinated attack of multiple missiles on targets and improve the attack efficiency, a reinforcement learning cooperative guidance law based on deep deterministic policy gradient descent neural network is proposed. According to the particularity of guidance process, the reinforcement learning agent is obtained by constructing state space, action space and reward function training. The simulation results show that the enhanced learning guidance law can strike maneuvering targets simultaneously and satisfy the field of view constraint, which is superior to the traditional cooperative proportional guidance law.