Pub Date : 2024-07-31DOI: 10.1007/s12555-023-0467-7
Hongwu Ye, Tingting Gao, Yong Zhou, Jiangshuai Huang
In this paper, tracking control of a class of series elastic actuators subject to unknown time-varying input delay and additive disturbances is investigated. To address the input delay, a novel predictor-like method is proposed in the control input which uses a predictor to compensate for the delay. Lyapunov-Krasovskii functions are applied within Lyapunov-based stability analysis to show semi-globally uniformly ultimately bounded tracking errors. The control scheme is extended to the case that the input delay is unknown and time-varying. A constant estimate of the delay is determined to establish uniformly ultimately bounded convergence of the tracking error. Numerical simulation results illustrate the performance of the developed robust controller.
{"title":"Predictor-based Tracking Control of a Class of Series Elastic Actuators With Input Delay","authors":"Hongwu Ye, Tingting Gao, Yong Zhou, Jiangshuai Huang","doi":"10.1007/s12555-023-0467-7","DOIUrl":"https://doi.org/10.1007/s12555-023-0467-7","url":null,"abstract":"<p>In this paper, tracking control of a class of series elastic actuators subject to unknown time-varying input delay and additive disturbances is investigated. To address the input delay, a novel predictor-like method is proposed in the control input which uses a predictor to compensate for the delay. Lyapunov-Krasovskii functions are applied within Lyapunov-based stability analysis to show semi-globally uniformly ultimately bounded tracking errors. The control scheme is extended to the case that the input delay is unknown and time-varying. A constant estimate of the delay is determined to establish uniformly ultimately bounded convergence of the tracking error. Numerical simulation results illustrate the performance of the developed robust controller.</p>","PeriodicalId":54965,"journal":{"name":"International Journal of Control Automation and Systems","volume":"6 1","pages":""},"PeriodicalIF":3.2,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141865193","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-31DOI: 10.1007/s12555-023-0695-x
Hyuntae Kim, Hamin Chang, Hyungbo Shim
This paper addresses the challenge of data-driven control of nonlinear systems, focusing on the limitations and capabilities of model reference Gaussian process regression (MR-GPR) and its evolved counterpart, model reference neural networks (MR-NN). MR-GPR, based on Gaussian processes renowned for their adaptability to diverse data structures, encounters scalability issues especially when handling large datasets. To address these limitations, this paper introduces MR-NN, an extension of MR-GPR, leveraging neural networks (NN) to manage large datasets and capture complex nonlinear dynamics effectively. We present a comprehensive evaluation of both methods through a classical control problem of the inverted pendulum, a system well-recognized for its nonlinear behavior. Numerical experiments are conducted to compare the methods in terms of control performance, computational efficiency, and reliability.
{"title":"Evaluating MR-GPR and MR-NN: An Exploration of Data-driven Control Methods for Nonlinear Systems","authors":"Hyuntae Kim, Hamin Chang, Hyungbo Shim","doi":"10.1007/s12555-023-0695-x","DOIUrl":"https://doi.org/10.1007/s12555-023-0695-x","url":null,"abstract":"<p>This paper addresses the challenge of data-driven control of nonlinear systems, focusing on the limitations and capabilities of model reference Gaussian process regression (MR-GPR) and its evolved counterpart, model reference neural networks (MR-NN). MR-GPR, based on Gaussian processes renowned for their adaptability to diverse data structures, encounters scalability issues especially when handling large datasets. To address these limitations, this paper introduces MR-NN, an extension of MR-GPR, leveraging neural networks (NN) to manage large datasets and capture complex nonlinear dynamics effectively. We present a comprehensive evaluation of both methods through a classical control problem of the inverted pendulum, a system well-recognized for its nonlinear behavior. Numerical experiments are conducted to compare the methods in terms of control performance, computational efficiency, and reliability.</p>","PeriodicalId":54965,"journal":{"name":"International Journal of Control Automation and Systems","volume":"42 1","pages":""},"PeriodicalIF":3.2,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141865194","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-31DOI: 10.1007/s12555-023-0025-3
Zhihui Wu, Guo-Ping Liu, Jun Hu, Hui Yu, Dongyan Chen
This paper mainly discusses the impacts from the time delay of advertising and the reference price of consumer onto the cooperative advertising decisions. In our model, both the retailer’s platform goodwill (RPG) and the manufacturer’s brand goodwill (MBG) involved with their own advertising efforts are modeled by the delay differential equations. In addition, the RPG and the MBG have positive effects on the consumer’s reference price. The equilibrium advertising strategies of two channel members are firstly derived within the centralized and decentralized scenarios by resorting to the optimal control theory. Besides, the corresponding equilibrium advertising strategies and optimal channel profits are compared, which shows that the centralization brings higher profit to whole channel only when the time delays locate within a specified geometric region and the existence of time delay caused by manufacturer’s advertising also implies smaller advertising participation rate provided by the retailer. Subsequently, a new mechanism is designed to coordinate the supply chain system in the decentralized case under the consignment mode. Finally, a numerical example is employed to illustrate the impacts from key system parameters onto the coordination results when the supply chain is coordinated.
{"title":"Optimal Control Strategies for a Supply Chain System With Delay and Reference Price Effects","authors":"Zhihui Wu, Guo-Ping Liu, Jun Hu, Hui Yu, Dongyan Chen","doi":"10.1007/s12555-023-0025-3","DOIUrl":"https://doi.org/10.1007/s12555-023-0025-3","url":null,"abstract":"<p>This paper mainly discusses the impacts from the time delay of advertising and the reference price of consumer onto the cooperative advertising decisions. In our model, both the retailer’s platform goodwill (RPG) and the manufacturer’s brand goodwill (MBG) involved with their own advertising efforts are modeled by the delay differential equations. In addition, the RPG and the MBG have positive effects on the consumer’s reference price. The equilibrium advertising strategies of two channel members are firstly derived within the centralized and decentralized scenarios by resorting to the optimal control theory. Besides, the corresponding equilibrium advertising strategies and optimal channel profits are compared, which shows that the centralization brings higher profit to whole channel only when the time delays locate within a specified geometric region and the existence of time delay caused by manufacturer’s advertising also implies smaller advertising participation rate provided by the retailer. Subsequently, a new mechanism is designed to coordinate the supply chain system in the decentralized case under the consignment mode. Finally, a numerical example is employed to illustrate the impacts from key system parameters onto the coordination results when the supply chain is coordinated.</p>","PeriodicalId":54965,"journal":{"name":"International Journal of Control Automation and Systems","volume":"54 1","pages":""},"PeriodicalIF":3.2,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141865191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-02DOI: 10.1007/s12555-022-0927-5
Chaoqian Qiao, Weiwei Sun, Qi Zhang
The robust control problem of permanent magnet synchronous motor (PMSM) is discussed in this paper. Both iron loss and voltage saturation are considered in the analysis process in order to close to the actual projects. A controller design method, which is called the Hamiltonian-based method, is proposed for the PMSM control system. The responses of each current component and speed are analyzed when load torque disturbance exists. The proposed controller enables the closed-loop system to have robust performance criteria without violating saturation. It is worth noting that a more general truncation-inequality technique is used to deal with saturation, which reduces the conservatism of parameter selection. In addition, the corresponding results are given for two special cases. When there is no saturation and no load torque disturbance, the system is asymptotically stable at the equilibrium point. Simulation results and experimental study show that the proposed method has good load disturbance suppression ability and can ensure the robust performance of the PMSM in EV driving systems.
{"title":"A Hamiltonian-based Method for PMSM in Electric Vehicle Considering Iron Loss and Saturation","authors":"Chaoqian Qiao, Weiwei Sun, Qi Zhang","doi":"10.1007/s12555-022-0927-5","DOIUrl":"https://doi.org/10.1007/s12555-022-0927-5","url":null,"abstract":"<p>The robust control problem of permanent magnet synchronous motor (PMSM) is discussed in this paper. Both iron loss and voltage saturation are considered in the analysis process in order to close to the actual projects. A controller design method, which is called the Hamiltonian-based method, is proposed for the PMSM control system. The responses of each current component and speed are analyzed when load torque disturbance exists. The proposed controller enables the closed-loop system to have robust performance criteria without violating saturation. It is worth noting that a more general truncation-inequality technique is used to deal with saturation, which reduces the conservatism of parameter selection. In addition, the corresponding results are given for two special cases. When there is no saturation and no load torque disturbance, the system is asymptotically stable at the equilibrium point. Simulation results and experimental study show that the proposed method has good load disturbance suppression ability and can ensure the robust performance of the PMSM in EV driving systems.</p>","PeriodicalId":54965,"journal":{"name":"International Journal of Control Automation and Systems","volume":"126 1","pages":""},"PeriodicalIF":3.2,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141515819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-02DOI: 10.1007/s12555-023-0588-z
Tareg Mohammed, Espen Oland, Lorenzo Fagiano
A fault-tolerant control approach is proposed, for a pumping airborne wind energy system (AWES) comprising a tethered fixed-wing aircraft with integrated propellers for vertical take-off and landing (VTOL). First, the flight control design for the traction phase of the system, when the tethered aircraft has to fly in loops using the rudder, is presented. Then, the presence of the propellers, that are normally not used in the traction phase, is exploited to obtain a fault tolerant controller in case of rudder malfunctioning. The approach detects a possible discrete control surface fault and compensates for the loss in actuation by using the VTOL system. A sophisticated model of the system is used to analyse the performance of the proposed technique. The main finding is that the approach is able to handle abrupt rudder faults with high tolerance.
{"title":"Fault Tolerant Flight Control for the Traction Phase of Pumping Airborne Wind Energy Systems","authors":"Tareg Mohammed, Espen Oland, Lorenzo Fagiano","doi":"10.1007/s12555-023-0588-z","DOIUrl":"https://doi.org/10.1007/s12555-023-0588-z","url":null,"abstract":"<p>A fault-tolerant control approach is proposed, for a pumping airborne wind energy system (AWES) comprising a tethered fixed-wing aircraft with integrated propellers for vertical take-off and landing (VTOL). First, the flight control design for the traction phase of the system, when the tethered aircraft has to fly in loops using the rudder, is presented. Then, the presence of the propellers, that are normally not used in the traction phase, is exploited to obtain a fault tolerant controller in case of rudder malfunctioning. The approach detects a possible discrete control surface fault and compensates for the loss in actuation by using the VTOL system. A sophisticated model of the system is used to analyse the performance of the proposed technique. The main finding is that the approach is able to handle abrupt rudder faults with high tolerance.</p>","PeriodicalId":54965,"journal":{"name":"International Journal of Control Automation and Systems","volume":"25 1","pages":""},"PeriodicalIF":3.2,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141515821","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-02DOI: 10.1007/s12555-023-0230-0
Yingqi Zhang, Qing Zhang, Caixia Liu
This paper investigates the event-triggered quantized stabilization of uncertain neutral systems. Specifically, based on the suitable Lyapunov functional and relax quadratic function negative determination property, event-triggered quantized controllers and other variable parameters are co-designed, allowing neutral network systems with or without uncertainties to stabilize asymptotically via variable separation approaches. Additionally, the feasible criteria are characterized as convex optimization problems. An example demonstrates the effectiveness and less conservativeness of the proposed method.
{"title":"Event-triggered Quantized Stabilization for Uncertain Neutral Systems","authors":"Yingqi Zhang, Qing Zhang, Caixia Liu","doi":"10.1007/s12555-023-0230-0","DOIUrl":"https://doi.org/10.1007/s12555-023-0230-0","url":null,"abstract":"<p>This paper investigates the event-triggered quantized stabilization of uncertain neutral systems. Specifically, based on the suitable Lyapunov functional and relax quadratic function negative determination property, event-triggered quantized controllers and other variable parameters are co-designed, allowing neutral network systems with or without uncertainties to stabilize asymptotically via variable separation approaches. Additionally, the feasible criteria are characterized as convex optimization problems. An example demonstrates the effectiveness and less conservativeness of the proposed method.</p>","PeriodicalId":54965,"journal":{"name":"International Journal of Control Automation and Systems","volume":"12 1","pages":""},"PeriodicalIF":3.2,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141515986","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-02DOI: 10.1007/s12555-023-0511-7
Shenglong Xie, Wenyuan Liu, Shiyuan Bian
Iterative learning control (ILC) has been considered as a promising alternative for the control of pneumatic muscle actuator (PMA). However, this controller suffers from a challenge that it is difficult to deal with the complex nonlinear characteristics of PMA. To solve this problem, a novel iterative learning model predictive control (ILMPC) approach, by utilizing the data-driven model, is designed and analyzed in this article. Firstly, the dynamics of PMA is converted into Takagi-Sugeno (T-S) fuzzy nonlinear auto-regression with exogenous inputs (NARX) model, and the differential evolution (DE) estimation algorithm is applied to estimate parameters of the NARX model by utilizing the input and output data. Secondly, the controller of ILMPC is designed and the convergence performance of the controller is verified through theoretical analysis. Finally, the capability of this control method is confirmed via experimental study. Experimental results demonstrate that the proposed ILMPC can achieve satisfactory tracking control and exhibits robustness against load varying.
{"title":"Data-driven Iterative Learning Model Predictive Control for Pneumatic Muscle Actuators","authors":"Shenglong Xie, Wenyuan Liu, Shiyuan Bian","doi":"10.1007/s12555-023-0511-7","DOIUrl":"https://doi.org/10.1007/s12555-023-0511-7","url":null,"abstract":"<p>Iterative learning control (ILC) has been considered as a promising alternative for the control of pneumatic muscle actuator (PMA). However, this controller suffers from a challenge that it is difficult to deal with the complex nonlinear characteristics of PMA. To solve this problem, a novel iterative learning model predictive control (ILMPC) approach, by utilizing the data-driven model, is designed and analyzed in this article. Firstly, the dynamics of PMA is converted into Takagi-Sugeno (T-S) fuzzy nonlinear auto-regression with exogenous inputs (NARX) model, and the differential evolution (DE) estimation algorithm is applied to estimate parameters of the NARX model by utilizing the input and output data. Secondly, the controller of ILMPC is designed and the convergence performance of the controller is verified through theoretical analysis. Finally, the capability of this control method is confirmed via experimental study. Experimental results demonstrate that the proposed ILMPC can achieve satisfactory tracking control and exhibits robustness against load varying.</p>","PeriodicalId":54965,"journal":{"name":"International Journal of Control Automation and Systems","volume":"177 1","pages":""},"PeriodicalIF":3.2,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141515820","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-02DOI: 10.1007/s12555-023-0135-y
Guanglei Zhao, Gaoge Dai, Bingkang Peng, Hailong Cui
This article investigates a heterogeneous vehicular platoon control problem, in which prescribed tracking performance can be achieved, it is assumed that the vehicle subjects to asymmetric nonlinear actuator saturation, dead-zone nonlinearity and actuator faults. Based on improved exponential spacing policy, an adaptive fault-tolerant sliding-mode control scheme is proposed to guarantee individual vehicle stability, string stability and traffic flow stability. The hypothesis that the spacing errors at initial time are zero is removed by employing the novel exponential spacing policy. Furthermore, to attenuate the harmful effects of actuator faults, saturation and dead-zone nonlinearity, a compensation system based on radial basis neural network (RBFNN) is established. Finally, the effectiveness of the proposed control scheme is verified by simulation results.
{"title":"Adaptive Fault-tolerant Control of Platoons With Prescribed Tracking Performance","authors":"Guanglei Zhao, Gaoge Dai, Bingkang Peng, Hailong Cui","doi":"10.1007/s12555-023-0135-y","DOIUrl":"https://doi.org/10.1007/s12555-023-0135-y","url":null,"abstract":"<p>This article investigates a heterogeneous vehicular platoon control problem, in which prescribed tracking performance can be achieved, it is assumed that the vehicle subjects to asymmetric nonlinear actuator saturation, dead-zone nonlinearity and actuator faults. Based on improved exponential spacing policy, an adaptive fault-tolerant sliding-mode control scheme is proposed to guarantee individual vehicle stability, string stability and traffic flow stability. The hypothesis that the spacing errors at initial time are zero is removed by employing the novel exponential spacing policy. Furthermore, to attenuate the harmful effects of actuator faults, saturation and dead-zone nonlinearity, a compensation system based on radial basis neural network (RBFNN) is established. Finally, the effectiveness of the proposed control scheme is verified by simulation results.</p>","PeriodicalId":54965,"journal":{"name":"International Journal of Control Automation and Systems","volume":"14 1","pages":""},"PeriodicalIF":3.2,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141515990","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-02DOI: 10.1007/s12555-023-0110-7
Hui-Min Zhong, Tao Han, Bo Xiao, Xi-Sheng Zhan, Huaicheng Yan
In this article, a predefined-time control framework is presented to accomplish the leader-following consensus for Euler-Lagrange (EL) systems, in which a dynamical leader is considered and only a subset of the followers can obtain the information from leader. In order to address the aforementioned problem, a predefined-time control algorithm has been proposed. To be specific, a set of distributed estimators are firstly established to acquire the estimated states of the leader at a predefined time. Then, by applying the designed controller based on sliding surface control, the predefined-time leader-following consensus (PTLFC) problem has been addressed. Moreover, based on Lyapunov argument and input to state stability criterion, sufficient conditions for predefined-time stability are derived. Finally, a large number of simulation examples are given to illustrate the validity of the main results.
{"title":"Predefined-time Leader-following Consensus for Networked Euler-Lagrange Systems With External Disturbance","authors":"Hui-Min Zhong, Tao Han, Bo Xiao, Xi-Sheng Zhan, Huaicheng Yan","doi":"10.1007/s12555-023-0110-7","DOIUrl":"https://doi.org/10.1007/s12555-023-0110-7","url":null,"abstract":"<p>In this article, a predefined-time control framework is presented to accomplish the leader-following consensus for Euler-Lagrange (EL) systems, in which a dynamical leader is considered and only a subset of the followers can obtain the information from leader. In order to address the aforementioned problem, a predefined-time control algorithm has been proposed. To be specific, a set of distributed estimators are firstly established to acquire the estimated states of the leader at a predefined time. Then, by applying the designed controller based on sliding surface control, the predefined-time leader-following consensus (PTLFC) problem has been addressed. Moreover, based on Lyapunov argument and input to state stability criterion, sufficient conditions for predefined-time stability are derived. Finally, a large number of simulation examples are given to illustrate the validity of the main results.</p>","PeriodicalId":54965,"journal":{"name":"International Journal of Control Automation and Systems","volume":"2 1","pages":""},"PeriodicalIF":3.2,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141515992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-02DOI: 10.1007/s12555-023-0460-1
Jianfeng Wang, Yan Wang, Zhicheng Ji
This paper investigates the problem of input-constrained optimal control for nonaffine nonlinear discrete-time systems in the presence of an inaccurate model. To address this problem, a bounded function is used to convert the input-constrained optimal control problem into an unconstrained counterpart. Additionally, an optimal stage cost function is introduced to quantify the discrepancy between the inaccurate model and the true system dynamics. Subsequently, a policy iteration based stage cost learning (PISCL) algorithm is proposed to obtain the optimal stage cost function and the convergence of the algorithm is proved. The proposed approach provides a new framework for addressing input-constrained control problems of nonaffine nonlinear systems with an inaccurate model, bridging the gap between model-based and data-based approximate dynamic programming techniques. Numerical experiments validate the effectiveness of the PISCL algorithm in obtaining the constrained optimal control policies for nonaffine nonlinear discrete-time systems without precise system models.
{"title":"Adaptive Optimal Control for Input-constrained Nonlinear Discrete-time System With Stage Cost Learning","authors":"Jianfeng Wang, Yan Wang, Zhicheng Ji","doi":"10.1007/s12555-023-0460-1","DOIUrl":"https://doi.org/10.1007/s12555-023-0460-1","url":null,"abstract":"<p>This paper investigates the problem of input-constrained optimal control for nonaffine nonlinear discrete-time systems in the presence of an inaccurate model. To address this problem, a bounded function is used to convert the input-constrained optimal control problem into an unconstrained counterpart. Additionally, an optimal stage cost function is introduced to quantify the discrepancy between the inaccurate model and the true system dynamics. Subsequently, a policy iteration based stage cost learning (PISCL) algorithm is proposed to obtain the optimal stage cost function and the convergence of the algorithm is proved. The proposed approach provides a new framework for addressing input-constrained control problems of nonaffine nonlinear systems with an inaccurate model, bridging the gap between model-based and data-based approximate dynamic programming techniques. Numerical experiments validate the effectiveness of the PISCL algorithm in obtaining the constrained optimal control policies for nonaffine nonlinear discrete-time systems without precise system models.</p>","PeriodicalId":54965,"journal":{"name":"International Journal of Control Automation and Systems","volume":"46 1","pages":""},"PeriodicalIF":3.2,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141502944","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}