基于近似动态规划的状态约束非线性系统离散最优控制

IF 3.2 3区计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS International Journal of Robust and Nonlinear Control Pub Date : 2024-10-20 DOI:10.1002/rnc.7685

Shijie Song, Dawei Gong, Minglei Zhu, Yuyang Zhao

{"title":"基于近似动态规划的状态约束非线性系统离散最优控制","authors":"Shijie Song, Dawei Gong, Minglei Zhu, Yuyang Zhao","doi":"10.1002/rnc.7685","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>This article investigates the optimal control problem (OCP) for a class of discrete-time nonlinear systems with state constraints. First, to overcome the challenge caused by the constraints, the original constrained OCP is transformed into an unconstrained OCP by utilizing the system transformation technique. Second, a new cost function is designed to alleviate the effect of system transformation on the optimality of the original system. Further, a novel off-policy deterministic approximate dynamic programming (ADP) scheme is developed to obtain a near-optimal solution for the transformed OCP. Compared to existing off-policy deterministic ADP schemes, the developed scheme relaxes the requirement on the learning data and saves computing resources from the perspective of training neural networks. Third, considering approximation errors, we analyze the convergence and stability of the developed ADP scheme. Finally, the developed ADP with the designed cost function is tested in two numerical cases, and simulation results confirm its effectiveness.</p>\n </div>","PeriodicalId":50291,"journal":{"name":"International Journal of Robust and Nonlinear Control","volume":"35 3","pages":"858-871"},"PeriodicalIF":3.2000,"publicationDate":"2024-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Discrete-Time Optimal Control of State-Constrained Nonlinear Systems Using Approximate Dynamic Programming\",\"authors\":\"Shijie Song, Dawei Gong, Minglei Zhu, Yuyang Zhao\",\"doi\":\"10.1002/rnc.7685\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>This article investigates the optimal control problem (OCP) for a class of discrete-time nonlinear systems with state constraints. First, to overcome the challenge caused by the constraints, the original constrained OCP is transformed into an unconstrained OCP by utilizing the system transformation technique. Second, a new cost function is designed to alleviate the effect of system transformation on the optimality of the original system. Further, a novel off-policy deterministic approximate dynamic programming (ADP) scheme is developed to obtain a near-optimal solution for the transformed OCP. Compared to existing off-policy deterministic ADP schemes, the developed scheme relaxes the requirement on the learning data and saves computing resources from the perspective of training neural networks. Third, considering approximation errors, we analyze the convergence and stability of the developed ADP scheme. Finally, the developed ADP with the designed cost function is tested in two numerical cases, and simulation results confirm its effectiveness.</p>\\n </div>\",\"PeriodicalId\":50291,\"journal\":{\"name\":\"International Journal of Robust and Nonlinear Control\",\"volume\":\"35 3\",\"pages\":\"858-871\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2024-10-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Robust and Nonlinear Control\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/rnc.7685\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Robust and Nonlinear Control","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/rnc.7685","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

研究了一类具有状态约束的离散非线性系统的最优控制问题。首先，为了克服约束带来的挑战，利用系统变换技术将原有的有约束OCP转化为无约束OCP。其次，设计了一个新的成本函数，以减轻系统转换对原系统最优性的影响。在此基础上，提出了一种新的非策略确定性近似动态规划（ADP）方法，以获得变换后的OCP的近最优解。与现有的脱策略确定性ADP方案相比，从训练神经网络的角度出发，该方案降低了对学习数据的要求，节省了计算资源。第三，考虑逼近误差，分析了所提出的ADP方案的收敛性和稳定性。最后，利用所设计的代价函数对所开发的ADP进行了两个数值实例的测试，仿真结果验证了其有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Discrete-Time Optimal Control of State-Constrained Nonlinear Systems Using Approximate Dynamic Programming

This article investigates the optimal control problem (OCP) for a class of discrete-time nonlinear systems with state constraints. First, to overcome the challenge caused by the constraints, the original constrained OCP is transformed into an unconstrained OCP by utilizing the system transformation technique. Second, a new cost function is designed to alleviate the effect of system transformation on the optimality of the original system. Further, a novel off-policy deterministic approximate dynamic programming (ADP) scheme is developed to obtain a near-optimal solution for the transformed OCP. Compared to existing off-policy deterministic ADP schemes, the developed scheme relaxes the requirement on the learning data and saves computing resources from the perspective of training neural networks. Third, considering approximation errors, we analyze the convergence and stability of the developed ADP scheme. Finally, the developed ADP with the designed cost function is tested in two numerical cases, and simulation results confirm its effectiveness.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Journal of Robust and Nonlinear Control 工程技术-工程：电子与电气

CiteScore

6.70

自引率

20.50%

发文量

505

审稿时长

2.7 months

期刊介绍： Papers that do not include an element of robust or nonlinear control and estimation theory will not be considered by the journal, and all papers will be expected to include significant novel content. The focus of the journal is on model based control design approaches rather than heuristic or rule based methods. Papers on neural networks will have to be of exceptional novelty to be considered for the journal.

期刊最新文献

Issue Information Issue Information A Constructive Approach to Multi-Variable Extremum Seeking With Discrete-Time Delayed Noisy Measurements Affine Formation Control for End-Effectors of Networked Manipulators With Maneuvering Leaders and Unknown System Parameters Neural Network-Observer-Based ILC of Nonlinear Systems With Event-Driven Strategy