{"title":"An online single-network adaptive algorithm for continuous-time nonlinear optimal control","authors":"Jae Young Lee, Jin Bae Park, Y. Choi, Keun Uk Lee","doi":"10.1109/ICCAS.2013.6704205","DOIUrl":null,"url":null,"abstract":"In this paper, we propose an online adaptive neural-algorithm to solve the CT nonlinear optimal control problems. Compared to the existing methods, which adopt the architecture with two neural networks (NNs) for actor-critic implementations, only one NN for critic is used to implement the algorithm, simplifying the structure of the computation model. Moreover, we also provide a generalized learning rule for updating the NN weights, which covers the existing critic update rules as special cases. The theoretical and numerical results are given under the required persistent excitation condition to verify and analyze stability and performance of the proposed method.","PeriodicalId":415263,"journal":{"name":"2013 13th International Conference on Control, Automation and Systems (ICCAS 2013)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 13th International Conference on Control, Automation and Systems (ICCAS 2013)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCAS.2013.6704205","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we propose an online adaptive neural-algorithm to solve the CT nonlinear optimal control problems. Compared to the existing methods, which adopt the architecture with two neural networks (NNs) for actor-critic implementations, only one NN for critic is used to implement the algorithm, simplifying the structure of the computation model. Moreover, we also provide a generalized learning rule for updating the NN weights, which covers the existing critic update rules as special cases. The theoretical and numerical results are given under the required persistent excitation condition to verify and analyze stability and performance of the proposed method.