{"title":"带阶段成本学习的输入受限非线性离散时间系统的自适应最优控制","authors":"Jianfeng Wang, Yan Wang, Zhicheng Ji","doi":"10.1007/s12555-023-0460-1","DOIUrl":null,"url":null,"abstract":"<p>This paper investigates the problem of input-constrained optimal control for nonaffine nonlinear discrete-time systems in the presence of an inaccurate model. To address this problem, a bounded function is used to convert the input-constrained optimal control problem into an unconstrained counterpart. Additionally, an optimal stage cost function is introduced to quantify the discrepancy between the inaccurate model and the true system dynamics. Subsequently, a policy iteration based stage cost learning (PISCL) algorithm is proposed to obtain the optimal stage cost function and the convergence of the algorithm is proved. The proposed approach provides a new framework for addressing input-constrained control problems of nonaffine nonlinear systems with an inaccurate model, bridging the gap between model-based and data-based approximate dynamic programming techniques. Numerical experiments validate the effectiveness of the PISCL algorithm in obtaining the constrained optimal control policies for nonaffine nonlinear discrete-time systems without precise system models.</p>","PeriodicalId":54965,"journal":{"name":"International Journal of Control Automation and Systems","volume":"46 1","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Adaptive Optimal Control for Input-constrained Nonlinear Discrete-time System With Stage Cost Learning\",\"authors\":\"Jianfeng Wang, Yan Wang, Zhicheng Ji\",\"doi\":\"10.1007/s12555-023-0460-1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>This paper investigates the problem of input-constrained optimal control for nonaffine nonlinear discrete-time systems in the presence of an inaccurate model. To address this problem, a bounded function is used to convert the input-constrained optimal control problem into an unconstrained counterpart. Additionally, an optimal stage cost function is introduced to quantify the discrepancy between the inaccurate model and the true system dynamics. Subsequently, a policy iteration based stage cost learning (PISCL) algorithm is proposed to obtain the optimal stage cost function and the convergence of the algorithm is proved. The proposed approach provides a new framework for addressing input-constrained control problems of nonaffine nonlinear systems with an inaccurate model, bridging the gap between model-based and data-based approximate dynamic programming techniques. Numerical experiments validate the effectiveness of the PISCL algorithm in obtaining the constrained optimal control policies for nonaffine nonlinear discrete-time systems without precise system models.</p>\",\"PeriodicalId\":54965,\"journal\":{\"name\":\"International Journal of Control Automation and Systems\",\"volume\":\"46 1\",\"pages\":\"\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2024-07-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Control Automation and Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1007/s12555-023-0460-1\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Control Automation and Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s12555-023-0460-1","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Adaptive Optimal Control for Input-constrained Nonlinear Discrete-time System With Stage Cost Learning
This paper investigates the problem of input-constrained optimal control for nonaffine nonlinear discrete-time systems in the presence of an inaccurate model. To address this problem, a bounded function is used to convert the input-constrained optimal control problem into an unconstrained counterpart. Additionally, an optimal stage cost function is introduced to quantify the discrepancy between the inaccurate model and the true system dynamics. Subsequently, a policy iteration based stage cost learning (PISCL) algorithm is proposed to obtain the optimal stage cost function and the convergence of the algorithm is proved. The proposed approach provides a new framework for addressing input-constrained control problems of nonaffine nonlinear systems with an inaccurate model, bridging the gap between model-based and data-based approximate dynamic programming techniques. Numerical experiments validate the effectiveness of the PISCL algorithm in obtaining the constrained optimal control policies for nonaffine nonlinear discrete-time systems without precise system models.
期刊介绍:
International Journal of Control, Automation and Systems is a joint publication of the Institute of Control, Robotics and Systems (ICROS) and the Korean Institute of Electrical Engineers (KIEE).
The journal covers three closly-related research areas including control, automation, and systems.
The technical areas include
Control Theory
Control Applications
Robotics and Automation
Intelligent and Information Systems
The Journal addresses research areas focused on control, automation, and systems in electrical, mechanical, aerospace, chemical, and industrial engineering in order to create a strong synergy effect throughout the interdisciplinary research areas.