{"title":"具有不等式约束的非线性系统的自适应庞特里亚金最大原则的并发学习","authors":"Bin Zhang, Yuqi Zhang, Yingmin Jia","doi":"10.1002/rnc.7630","DOIUrl":null,"url":null,"abstract":"<p>In this article, a finite-horizon adaptive Pontryagin's maximum principle is presented for nonlinear systems with state inequality constraints. Concurrent learning (CL) technique is adopted to identify the unknown parameters of the dynamic systems. Based on the identification model, a novel adaptive iterative algorithm under the Pontryagin's framework is introduced to learn the finite-horizon optimal control solution. Convergence analysis of the algorithm is provided by showing that the cost function sequence is monotonically decreasing. Furthermore, we extend the adaptive iterative algorithm to time-varying nonlinear systems. The new algorithm overcomes the technical obstacles of the existing adaptive/approximate dynamic programming (ADP) approaches to deal with the time-varying characteristic of Hamilton–Jacobi–Bellman (HJB) partial differential equation (PDE), especially when state constraints exist. Simulation examples are carried out to validate the effectiveness of the theoretical results.</p>","PeriodicalId":50291,"journal":{"name":"International Journal of Robust and Nonlinear Control","volume":"34 18","pages":"12431-12449"},"PeriodicalIF":3.2000,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Concurrent learning for adaptive pontryagin's maximum principle of nonlinear systems with inequality constraints\",\"authors\":\"Bin Zhang, Yuqi Zhang, Yingmin Jia\",\"doi\":\"10.1002/rnc.7630\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>In this article, a finite-horizon adaptive Pontryagin's maximum principle is presented for nonlinear systems with state inequality constraints. Concurrent learning (CL) technique is adopted to identify the unknown parameters of the dynamic systems. Based on the identification model, a novel adaptive iterative algorithm under the Pontryagin's framework is introduced to learn the finite-horizon optimal control solution. Convergence analysis of the algorithm is provided by showing that the cost function sequence is monotonically decreasing. Furthermore, we extend the adaptive iterative algorithm to time-varying nonlinear systems. The new algorithm overcomes the technical obstacles of the existing adaptive/approximate dynamic programming (ADP) approaches to deal with the time-varying characteristic of Hamilton–Jacobi–Bellman (HJB) partial differential equation (PDE), especially when state constraints exist. Simulation examples are carried out to validate the effectiveness of the theoretical results.</p>\",\"PeriodicalId\":50291,\"journal\":{\"name\":\"International Journal of Robust and Nonlinear Control\",\"volume\":\"34 18\",\"pages\":\"12431-12449\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2024-09-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Robust and Nonlinear Control\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/rnc.7630\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Robust and Nonlinear Control","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/rnc.7630","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Concurrent learning for adaptive pontryagin's maximum principle of nonlinear systems with inequality constraints
In this article, a finite-horizon adaptive Pontryagin's maximum principle is presented for nonlinear systems with state inequality constraints. Concurrent learning (CL) technique is adopted to identify the unknown parameters of the dynamic systems. Based on the identification model, a novel adaptive iterative algorithm under the Pontryagin's framework is introduced to learn the finite-horizon optimal control solution. Convergence analysis of the algorithm is provided by showing that the cost function sequence is monotonically decreasing. Furthermore, we extend the adaptive iterative algorithm to time-varying nonlinear systems. The new algorithm overcomes the technical obstacles of the existing adaptive/approximate dynamic programming (ADP) approaches to deal with the time-varying characteristic of Hamilton–Jacobi–Bellman (HJB) partial differential equation (PDE), especially when state constraints exist. Simulation examples are carried out to validate the effectiveness of the theoretical results.
期刊介绍:
Papers that do not include an element of robust or nonlinear control and estimation theory will not be considered by the journal, and all papers will be expected to include significant novel content. The focus of the journal is on model based control design approaches rather than heuristic or rule based methods. Papers on neural networks will have to be of exceptional novelty to be considered for the journal.