Integrated entry guidance with no-fly zone constraint using reinforcement learning and predictor-corrector technique

Yuan Gao, Rui Zhou, Jinyong Chen
{"title":"Integrated entry guidance with no-fly zone constraint using reinforcement learning and predictor-corrector technique","authors":"Yuan Gao, Rui Zhou, Jinyong Chen","doi":"10.1177/09544100241236995","DOIUrl":null,"url":null,"abstract":"This paper presents an integrated entry guidance law for hypersonic glide vehicles with no-fly zone constraint. Existing methods that employ predictor-corrector technique and lateral guidance logic for both guidance and avoidance, may have limitations in response time and maneuverability when facing sudden threats, because the guidance cycle is limited by computational efficiency and the bank angle magnitude cannot be adjusted according to the urgency of the avoidance. To overcome these challenges, the proposed method divides the entry process into safe flight stages and no-fly zone avoidance stages, and introduces reinforcement learning to develop an intelligent avoidance strategy for the latter. This division reduces the complexity of the learning problem by restricting the state space and increases the applicability in the presence of multiple no-fly zones. The trained avoidance strategy can directly output continuous bank angle command through a single forward calculation, considering both guidance and avoidance requirements. This enables the full utilization of the vehicle’s maneuverability and supports a high command update frequency to effectively handle threats. Additionally, a network trained via supervised learning is employed to generate reference commands, accelerating the training convergence of reinforcement learning. Simulation results demonstrate the effectiveness of the proposed guidance law, highlighting its high computational efficiency, command stability, and robustness. Importantly, the approach offers convenience in extending to multiple no-fly zones and accommodating vast initial state spaces.","PeriodicalId":54566,"journal":{"name":"Proceedings of the Institution of Mechanical Engineers Part G-Journal of Aerospace Engineering","volume":null,"pages":null},"PeriodicalIF":1.0000,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Institution of Mechanical Engineers Part G-Journal of Aerospace Engineering","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1177/09544100241236995","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, AEROSPACE","Score":null,"Total":0}
引用次数: 0

Abstract

This paper presents an integrated entry guidance law for hypersonic glide vehicles with no-fly zone constraint. Existing methods that employ predictor-corrector technique and lateral guidance logic for both guidance and avoidance, may have limitations in response time and maneuverability when facing sudden threats, because the guidance cycle is limited by computational efficiency and the bank angle magnitude cannot be adjusted according to the urgency of the avoidance. To overcome these challenges, the proposed method divides the entry process into safe flight stages and no-fly zone avoidance stages, and introduces reinforcement learning to develop an intelligent avoidance strategy for the latter. This division reduces the complexity of the learning problem by restricting the state space and increases the applicability in the presence of multiple no-fly zones. The trained avoidance strategy can directly output continuous bank angle command through a single forward calculation, considering both guidance and avoidance requirements. This enables the full utilization of the vehicle’s maneuverability and supports a high command update frequency to effectively handle threats. Additionally, a network trained via supervised learning is employed to generate reference commands, accelerating the training convergence of reinforcement learning. Simulation results demonstrate the effectiveness of the proposed guidance law, highlighting its high computational efficiency, command stability, and robustness. Importantly, the approach offers convenience in extending to multiple no-fly zones and accommodating vast initial state spaces.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用强化学习和预测校正器技术进行具有禁飞区约束的综合入境引导
本文提出了一种具有禁飞区约束的高超音速滑翔飞行器综合进入制导法。由于制导周期受限于计算效率,且无法根据规避的紧迫性调整倾角大小,因此在面对突发威胁时,采用预测-修正技术和横向制导逻辑进行制导和规避的现有方法在响应时间和机动性方面可能存在局限性。为了克服这些挑战,所提出的方法将进入过程分为安全飞行阶段和禁飞区规避阶段,并引入强化学习为后者制定智能规避策略。这种划分通过限制状态空间降低了学习问题的复杂性,并提高了在存在多个禁飞区时的适用性。经过训练的避让策略可以通过一次前向计算直接输出连续的倾角指令,同时考虑制导和避让要求。这样就能充分利用飞行器的机动性,并支持较高的指令更新频率,从而有效应对各种威胁。此外,还采用了通过监督学习训练的网络来生成参考指令,从而加快了强化学习的训练收敛速度。仿真结果证明了所提出的制导法则的有效性,突出了它的高计算效率、指令稳定性和鲁棒性。重要的是,该方法在扩展到多个禁飞区和适应广阔的初始状态空间方面提供了便利。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
2.40
自引率
18.20%
发文量
212
审稿时长
5.7 months
期刊介绍: The Journal of Aerospace Engineering is dedicated to the publication of high quality research in all branches of applied sciences and technology dealing with aircraft and spacecraft, and their support systems. "Our authorship is truly international and all efforts are made to ensure that each paper is presented in the best possible way and reaches a wide audience. "The Editorial Board is composed of recognized experts representing the technical communities of fifteen countries. The Board Members work in close cooperation with the editors, reviewers, and authors to achieve a consistent standard of well written and presented papers."Professor Rodrigo Martinez-Val, Universidad Politécnica de Madrid, Spain This journal is a member of the Committee on Publication Ethics (COPE).
期刊最新文献
Fatigue life analysis of a composite materials structure using allowable strain criteria Feasibility study of carbon-fiber reinforced polymer linerless pressure vessel tank Testability modeling of aeroengine and analysis optimization method based on improved correlation matrix Research on a backstepping flight control method improved by STFT in atmospheric disturbance applications Evaluating the effect of frigate hangar shape modifications on helicopter recovery using piloted flight simulation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1