{"title":"利用强化学习和预测校正器技术进行具有禁飞区约束的综合入境引导","authors":"Yuan Gao, Rui Zhou, Jinyong Chen","doi":"10.1177/09544100241236995","DOIUrl":null,"url":null,"abstract":"This paper presents an integrated entry guidance law for hypersonic glide vehicles with no-fly zone constraint. Existing methods that employ predictor-corrector technique and lateral guidance logic for both guidance and avoidance, may have limitations in response time and maneuverability when facing sudden threats, because the guidance cycle is limited by computational efficiency and the bank angle magnitude cannot be adjusted according to the urgency of the avoidance. To overcome these challenges, the proposed method divides the entry process into safe flight stages and no-fly zone avoidance stages, and introduces reinforcement learning to develop an intelligent avoidance strategy for the latter. This division reduces the complexity of the learning problem by restricting the state space and increases the applicability in the presence of multiple no-fly zones. The trained avoidance strategy can directly output continuous bank angle command through a single forward calculation, considering both guidance and avoidance requirements. This enables the full utilization of the vehicle’s maneuverability and supports a high command update frequency to effectively handle threats. Additionally, a network trained via supervised learning is employed to generate reference commands, accelerating the training convergence of reinforcement learning. Simulation results demonstrate the effectiveness of the proposed guidance law, highlighting its high computational efficiency, command stability, and robustness. Importantly, the approach offers convenience in extending to multiple no-fly zones and accommodating vast initial state spaces.","PeriodicalId":54566,"journal":{"name":"Proceedings of the Institution of Mechanical Engineers Part G-Journal of Aerospace Engineering","volume":"225 1","pages":""},"PeriodicalIF":1.0000,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Integrated entry guidance with no-fly zone constraint using reinforcement learning and predictor-corrector technique\",\"authors\":\"Yuan Gao, Rui Zhou, Jinyong Chen\",\"doi\":\"10.1177/09544100241236995\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents an integrated entry guidance law for hypersonic glide vehicles with no-fly zone constraint. Existing methods that employ predictor-corrector technique and lateral guidance logic for both guidance and avoidance, may have limitations in response time and maneuverability when facing sudden threats, because the guidance cycle is limited by computational efficiency and the bank angle magnitude cannot be adjusted according to the urgency of the avoidance. To overcome these challenges, the proposed method divides the entry process into safe flight stages and no-fly zone avoidance stages, and introduces reinforcement learning to develop an intelligent avoidance strategy for the latter. This division reduces the complexity of the learning problem by restricting the state space and increases the applicability in the presence of multiple no-fly zones. The trained avoidance strategy can directly output continuous bank angle command through a single forward calculation, considering both guidance and avoidance requirements. This enables the full utilization of the vehicle’s maneuverability and supports a high command update frequency to effectively handle threats. Additionally, a network trained via supervised learning is employed to generate reference commands, accelerating the training convergence of reinforcement learning. Simulation results demonstrate the effectiveness of the proposed guidance law, highlighting its high computational efficiency, command stability, and robustness. Importantly, the approach offers convenience in extending to multiple no-fly zones and accommodating vast initial state spaces.\",\"PeriodicalId\":54566,\"journal\":{\"name\":\"Proceedings of the Institution of Mechanical Engineers Part G-Journal of Aerospace Engineering\",\"volume\":\"225 1\",\"pages\":\"\"},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2024-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the Institution of Mechanical Engineers Part G-Journal of Aerospace Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1177/09544100241236995\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, AEROSPACE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Institution of Mechanical Engineers Part G-Journal of Aerospace Engineering","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1177/09544100241236995","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, AEROSPACE","Score":null,"Total":0}
Integrated entry guidance with no-fly zone constraint using reinforcement learning and predictor-corrector technique
This paper presents an integrated entry guidance law for hypersonic glide vehicles with no-fly zone constraint. Existing methods that employ predictor-corrector technique and lateral guidance logic for both guidance and avoidance, may have limitations in response time and maneuverability when facing sudden threats, because the guidance cycle is limited by computational efficiency and the bank angle magnitude cannot be adjusted according to the urgency of the avoidance. To overcome these challenges, the proposed method divides the entry process into safe flight stages and no-fly zone avoidance stages, and introduces reinforcement learning to develop an intelligent avoidance strategy for the latter. This division reduces the complexity of the learning problem by restricting the state space and increases the applicability in the presence of multiple no-fly zones. The trained avoidance strategy can directly output continuous bank angle command through a single forward calculation, considering both guidance and avoidance requirements. This enables the full utilization of the vehicle’s maneuverability and supports a high command update frequency to effectively handle threats. Additionally, a network trained via supervised learning is employed to generate reference commands, accelerating the training convergence of reinforcement learning. Simulation results demonstrate the effectiveness of the proposed guidance law, highlighting its high computational efficiency, command stability, and robustness. Importantly, the approach offers convenience in extending to multiple no-fly zones and accommodating vast initial state spaces.
期刊介绍:
The Journal of Aerospace Engineering is dedicated to the publication of high quality research in all branches of applied sciences and technology dealing with aircraft and spacecraft, and their support systems. "Our authorship is truly international and all efforts are made to ensure that each paper is presented in the best possible way and reaches a wide audience.
"The Editorial Board is composed of recognized experts representing the technical communities of fifteen countries. The Board Members work in close cooperation with the editors, reviewers, and authors to achieve a consistent standard of well written and presented papers."Professor Rodrigo Martinez-Val, Universidad Politécnica de Madrid, Spain
This journal is a member of the Committee on Publication Ethics (COPE).