{"title":"Robust Incremental Learning of Approximate Dynamic Programming for Nonlinear Optimal Guidance","authors":"Han Wang;Lin Cheng;Shengping Gong;Xu Huang","doi":"10.1109/TAES.2025.3525609","DOIUrl":null,"url":null,"abstract":"Existing nonlinear guidance methods are difficult to reconcile performance optimality with stability assurance. This study proposes a concept of robust incremental learning for approximate optimal control of nonlinear terminal guidance problems. It transitions incrementally and stably from a traditional analytically formulated guidance law to an approximate optimal guidance policy. Specifically, we propose an incremental policy iteration algorithm, where a base guidance law is utilized to mitigate the initial instability and warm-start the learning process. Then, a robustness enhancement technique is proposed to theoretically guarantee the stability of learning process, where the guidance command is refined leveraging a virtual Lyapunov-based energy function. As a result, a robust and efficient learning method for nonlinear optimal guidance problems is developed. Simulation results for a specific impact-angle-constrained guidance problem verify advantages of the proposed method on efficiency, stability, and optimality.","PeriodicalId":13157,"journal":{"name":"IEEE Transactions on Aerospace and Electronic Systems","volume":"61 3","pages":"6041-6052"},"PeriodicalIF":5.7000,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Aerospace and Electronic Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10833863/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, AEROSPACE","Score":null,"Total":0}
引用次数: 0
Abstract
Existing nonlinear guidance methods are difficult to reconcile performance optimality with stability assurance. This study proposes a concept of robust incremental learning for approximate optimal control of nonlinear terminal guidance problems. It transitions incrementally and stably from a traditional analytically formulated guidance law to an approximate optimal guidance policy. Specifically, we propose an incremental policy iteration algorithm, where a base guidance law is utilized to mitigate the initial instability and warm-start the learning process. Then, a robustness enhancement technique is proposed to theoretically guarantee the stability of learning process, where the guidance command is refined leveraging a virtual Lyapunov-based energy function. As a result, a robust and efficient learning method for nonlinear optimal guidance problems is developed. Simulation results for a specific impact-angle-constrained guidance problem verify advantages of the proposed method on efficiency, stability, and optimality.
期刊介绍:
IEEE Transactions on Aerospace and Electronic Systems focuses on the organization, design, development, integration, and operation of complex systems for space, air, ocean, or ground environment. These systems include, but are not limited to, navigation, avionics, spacecraft, aerospace power, radar, sonar, telemetry, defense, transportation, automated testing, and command and control.