通过自适应动态编程实现具有半马尔可夫跳跃参数的未知非线性系统的最优控制

IF 8.7 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS IEEE Transactions on Systems Man Cybernetics-Systems Pub Date : 2024-07-23 DOI:10.1109/TSMC.2024.3421658

Huaguang Zhang;Lulu Zhang;Jiayue Sun;Tianbiao Wang

{"title":"通过自适应动态编程实现具有半马尔可夫跳跃参数的未知非线性系统的最优控制","authors":"Huaguang Zhang;Lulu Zhang;Jiayue Sun;Tianbiao Wang","doi":"10.1109/TSMC.2024.3421658","DOIUrl":null,"url":null,"abstract":"This article investigates the optimal control problem for the discrete-time (DT) nonlinear semi-Markovian jump systems (s-MJSs) that possess unknown dynamics. The study uses the semi-Markovian kernel approach to address the problem of mode-switching in these systems. This approach employs the transition probability and the sojourn-time distribution function to jointly determine the transitions between different modes. Then, with a neural network (NN) identifier, the demand for accurate information on the system dynamics is eliminated, and an optimal control method for the nonlinear s-MJSs is utilized to solve the Hamilton-Jacobi–Bellman equation (HJBE) built upon adaptive dynamic programming methodology. Additionally, a detailed analysis of the convergence of a value iteration-based algorithm, which solves the optimal control issue for the DT s-MJSs, is thoroughly discussed. Furthermore, an actor-critic NN is trained to attain an estimated solution to the relevant HJBE. Finally, to validate the designed approach, two simulations are performed to prove its effectiveness.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"54 10","pages":"6255-6264"},"PeriodicalIF":8.7000,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimal Control for Unknown Nonlinear System With Semi-Markovian Jump Parameters via Adaptive Dynamic Programming\",\"authors\":\"Huaguang Zhang;Lulu Zhang;Jiayue Sun;Tianbiao Wang\",\"doi\":\"10.1109/TSMC.2024.3421658\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article investigates the optimal control problem for the discrete-time (DT) nonlinear semi-Markovian jump systems (s-MJSs) that possess unknown dynamics. The study uses the semi-Markovian kernel approach to address the problem of mode-switching in these systems. This approach employs the transition probability and the sojourn-time distribution function to jointly determine the transitions between different modes. Then, with a neural network (NN) identifier, the demand for accurate information on the system dynamics is eliminated, and an optimal control method for the nonlinear s-MJSs is utilized to solve the Hamilton-Jacobi–Bellman equation (HJBE) built upon adaptive dynamic programming methodology. Additionally, a detailed analysis of the convergence of a value iteration-based algorithm, which solves the optimal control issue for the DT s-MJSs, is thoroughly discussed. Furthermore, an actor-critic NN is trained to attain an estimated solution to the relevant HJBE. Finally, to validate the designed approach, two simulations are performed to prove its effectiveness.\",\"PeriodicalId\":48915,\"journal\":{\"name\":\"IEEE Transactions on Systems Man Cybernetics-Systems\",\"volume\":\"54 10\",\"pages\":\"6255-6264\"},\"PeriodicalIF\":8.7000,\"publicationDate\":\"2024-07-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Systems Man Cybernetics-Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10607965/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Systems Man Cybernetics-Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10607965/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

本文研究了具有未知动态的离散时间（DT）非线性半马尔可夫跃迁系统（s-MJS）的最优控制问题。研究采用半马尔可夫核方法来解决这些系统中的模式切换问题。这种方法利用转换概率和逗留时间分布函数来共同确定不同模式之间的转换。然后，利用神经网络（NN）识别器，消除了对系统动态精确信息的需求，并利用非线性 s-MJS 的最优控制方法，在自适应动态编程方法的基础上求解汉密尔顿-雅各比-贝尔曼方程（HJBE）。此外，还详细分析了基于值迭代的算法的收敛性，该算法解决了 DT s-MJS 的优化控制问题。此外，还训练了一个行为批评 NN，以获得相关 HJBE 的估计解。最后，为了验证所设计的方法，我们进行了两次模拟，以证明其有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Optimal Control for Unknown Nonlinear System With Semi-Markovian Jump Parameters via Adaptive Dynamic Programming

This article investigates the optimal control problem for the discrete-time (DT) nonlinear semi-Markovian jump systems (s-MJSs) that possess unknown dynamics. The study uses the semi-Markovian kernel approach to address the problem of mode-switching in these systems. This approach employs the transition probability and the sojourn-time distribution function to jointly determine the transitions between different modes. Then, with a neural network (NN) identifier, the demand for accurate information on the system dynamics is eliminated, and an optimal control method for the nonlinear s-MJSs is utilized to solve the Hamilton-Jacobi–Bellman equation (HJBE) built upon adaptive dynamic programming methodology. Additionally, a detailed analysis of the convergence of a value iteration-based algorithm, which solves the optimal control issue for the DT s-MJSs, is thoroughly discussed. Furthermore, an actor-critic NN is trained to attain an estimated solution to the relevant HJBE. Finally, to validate the designed approach, two simulations are performed to prove its effectiveness.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Transactions on Systems Man Cybernetics-Systems AUTOMATION & CONTROL SYSTEMS-COMPUTER SCIENCE, CYBERNETICS

CiteScore

18.50

自引率

11.50%

发文量

812

审稿时长

6 months

期刊介绍： The IEEE Transactions on Systems, Man, and Cybernetics: Systems encompasses the fields of systems engineering, covering issue formulation, analysis, and modeling throughout the systems engineering lifecycle phases. It addresses decision-making, issue interpretation, systems management, processes, and various methods such as optimization, modeling, and simulation in the development and deployment of large systems.

期刊最新文献

Introducing IEEE Collabratec IEEE Systems, Man, and Cybernetics Society Information TechRxiv: Share Your Preprint Research With the World! Reinforcement Learning-Based Optimized Adaptive Secure Control for Constrained Fractional-Order Nonlinear Systems Under FDI Attacks Learning Multilayer Feature Projection for Homogeneous and Heterogeneous Palmprint Recognition