{"title":"Takagi-Sugeno模糊系统的最优输出反馈跟踪控制","authors":"Wenting Song;Shaocheng Tong","doi":"10.1109/TAI.2024.3443004","DOIUrl":null,"url":null,"abstract":"In this study, an optimal output feedback tracking control approach with a Q-learning algorithm is presented for Takagi–Sugeno (T–S) fuzzy discrete-time systems with immeasurable states. First, a state reconstruction method based on the measured output data and input data is applied to handle immeasurable states problem. Then, the optimal output feedback tracking control input policy is designed and boiled down to the algebraic Riccati equations (AREs). To obtain the solution to AREs, a Q-learning value iteration (VI) algorithm is formulated, which directly learns each state-action value. Consequently, the sufficient conditions for the convergence of the proposed optimal algorithm are derived by constructing an approximate Q-function. It is proved that the presented optimal output feedback tracking control method can guarantee the controlled systems to be stable and output track the given reference signal. Finally, we take the truck-trailer system as the simulation example, the simulation results validate feasibility of the presented optimal control methodology.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 12","pages":"6320-6329"},"PeriodicalIF":0.0000,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimal Output Feedback Tracking Control for Takagi–Sugeno Fuzzy Systems\",\"authors\":\"Wenting Song;Shaocheng Tong\",\"doi\":\"10.1109/TAI.2024.3443004\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this study, an optimal output feedback tracking control approach with a Q-learning algorithm is presented for Takagi–Sugeno (T–S) fuzzy discrete-time systems with immeasurable states. First, a state reconstruction method based on the measured output data and input data is applied to handle immeasurable states problem. Then, the optimal output feedback tracking control input policy is designed and boiled down to the algebraic Riccati equations (AREs). To obtain the solution to AREs, a Q-learning value iteration (VI) algorithm is formulated, which directly learns each state-action value. Consequently, the sufficient conditions for the convergence of the proposed optimal algorithm are derived by constructing an approximate Q-function. It is proved that the presented optimal output feedback tracking control method can guarantee the controlled systems to be stable and output track the given reference signal. Finally, we take the truck-trailer system as the simulation example, the simulation results validate feasibility of the presented optimal control methodology.\",\"PeriodicalId\":73305,\"journal\":{\"name\":\"IEEE transactions on artificial intelligence\",\"volume\":\"5 12\",\"pages\":\"6320-6329\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE transactions on artificial intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10634553/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on artificial intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10634553/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
针对状态不可测的Takagi-Sugeno (T-S)模糊离散系统,提出了一种基于q学习算法的最优输出反馈跟踪控制方法。首先,采用一种基于测量输出数据和输入数据的状态重构方法来处理状态不可测问题。然后,设计了最优输出反馈跟踪控制输入策略,并将其归结为代数Riccati方程(AREs)。为了得到AREs的解,提出了一种q -学习值迭代(Q-learning value iteration, VI)算法,该算法直接学习每个状态-动作值。因此,通过构造一个近似的q函数,得到了该最优算法收敛的充分条件。实验证明,所提出的最优输出反馈跟踪控制方法能够保证被控系统稳定,输出跟踪给定参考信号。最后,以汽车挂车系统为例进行了仿真,仿真结果验证了所提出的最优控制方法的可行性。
Optimal Output Feedback Tracking Control for Takagi–Sugeno Fuzzy Systems
In this study, an optimal output feedback tracking control approach with a Q-learning algorithm is presented for Takagi–Sugeno (T–S) fuzzy discrete-time systems with immeasurable states. First, a state reconstruction method based on the measured output data and input data is applied to handle immeasurable states problem. Then, the optimal output feedback tracking control input policy is designed and boiled down to the algebraic Riccati equations (AREs). To obtain the solution to AREs, a Q-learning value iteration (VI) algorithm is formulated, which directly learns each state-action value. Consequently, the sufficient conditions for the convergence of the proposed optimal algorithm are derived by constructing an approximate Q-function. It is proved that the presented optimal output feedback tracking control method can guarantee the controlled systems to be stable and output track the given reference signal. Finally, we take the truck-trailer system as the simulation example, the simulation results validate feasibility of the presented optimal control methodology.