Lifelong Safe Optimal Adaptive Tracking Control of Nonlinear Strict-Feedback Discrete-Time Systems

IF 3.8 4区 计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS International Journal of Adaptive Control and Signal Processing Pub Date : 2024-12-12 DOI:10.1002/acs.3950
Behzad Farzanegan, S. Jagannathan
{"title":"Lifelong Safe Optimal Adaptive Tracking Control of Nonlinear Strict-Feedback Discrete-Time Systems","authors":"Behzad Farzanegan,&nbsp;S. Jagannathan","doi":"10.1002/acs.3950","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>This paper presents a comprehensive approach for achieving multi-task safe optimal adaptive tracking (MSOAT) for a class of nonlinear discrete-time systems, particularly those in strict-feedback form, utilizing a multi-layer neural network (MNN)-based framework. To begin, a cost function with a novel Barrier function (BF) term is introduced for each subsystem to address the weak safely reachable problem, serving as a crucial tool for guiding the system's trajectory toward the safe set while avoiding unwanted sets. To deal with the tracking problem, the Hamilton-Jacobi-Bellman (HJB) framework is used through the actor-critic MNN-based backstepping technique to estimate the solution of the value functions and obtain both virtual and actual optimal control policies for each subsystem, effectively circumventing non-causality issues. Further, to mitigate catastrophic forgetting in multi-tasking scenarios, a regularizer term, which is derived from the online version of the Elastic Weight Consolidation (EWC) method, is included in the critic and actor MNN update laws without directly computing the Fisher information matrix. To enhance the convergence rate, the critic MNN is tuned with a hybrid learning technique involving weight adjustments both at specific sampling instants and iteratively within those intervals. A control barrier function (CBF) with a time-varying BF is also integrated into the actor update law, collaborating with the BF to keep the trajectory in the safe set with a smaller trade-off factor, simultaneously validating the safety condition in real-time. Finally, the overall stability is established. An example of a 6-DOF autonomous underwater vehicle (AUV) is used to assess the effectiveness of the proposed approach.</p>\n </div>","PeriodicalId":50347,"journal":{"name":"International Journal of Adaptive Control and Signal Processing","volume":"39 3","pages":"451-470"},"PeriodicalIF":3.8000,"publicationDate":"2024-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Adaptive Control and Signal Processing","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/acs.3950","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

This paper presents a comprehensive approach for achieving multi-task safe optimal adaptive tracking (MSOAT) for a class of nonlinear discrete-time systems, particularly those in strict-feedback form, utilizing a multi-layer neural network (MNN)-based framework. To begin, a cost function with a novel Barrier function (BF) term is introduced for each subsystem to address the weak safely reachable problem, serving as a crucial tool for guiding the system's trajectory toward the safe set while avoiding unwanted sets. To deal with the tracking problem, the Hamilton-Jacobi-Bellman (HJB) framework is used through the actor-critic MNN-based backstepping technique to estimate the solution of the value functions and obtain both virtual and actual optimal control policies for each subsystem, effectively circumventing non-causality issues. Further, to mitigate catastrophic forgetting in multi-tasking scenarios, a regularizer term, which is derived from the online version of the Elastic Weight Consolidation (EWC) method, is included in the critic and actor MNN update laws without directly computing the Fisher information matrix. To enhance the convergence rate, the critic MNN is tuned with a hybrid learning technique involving weight adjustments both at specific sampling instants and iteratively within those intervals. A control barrier function (CBF) with a time-varying BF is also integrated into the actor update law, collaborating with the BF to keep the trajectory in the safe set with a smaller trade-off factor, simultaneously validating the safety condition in real-time. Finally, the overall stability is established. An example of a 6-DOF autonomous underwater vehicle (AUV) is used to assess the effectiveness of the proposed approach.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
非线性严格反馈离散系统的终身安全最优自适应跟踪控制
本文提出了一种利用多层神经网络(MNN)框架实现一类非线性离散系统,特别是严格反馈系统的多任务安全最优自适应跟踪(MSOAT)的综合方法。首先,为每个子系统引入具有新屏障函数(BF)项的代价函数来解决弱安全可达问题,作为引导系统轨迹向安全集同时避免不需要集的关键工具。为了解决跟踪问题,采用Hamilton-Jacobi-Bellman (HJB)框架,通过基于行动者批评mnn的回溯技术来估计值函数的解,并获得每个子系统的虚拟和实际最优控制策略,有效地规避了非因果问题。此外,为了减轻多任务场景下的灾难性遗忘,在不直接计算Fisher信息矩阵的情况下,将一个来自弹性权重巩固(EWC)方法的在线版本的正则化项包含在评论家和行动者MNN更新定律中。为了提高收敛速度,使用混合学习技术对批评MNN进行调整,包括在特定采样时刻和在这些间隔内迭代地进行权重调整。将具有时变BF的控制障碍函数(CBF)集成到行动者更新律中,与BF协同使轨迹保持在安全集合中,权衡系数较小,同时实时验证安全状态。最后,建立整体稳定性。以六自由度自主水下航行器(AUV)为例,对该方法的有效性进行了验证。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
5.30
自引率
16.10%
发文量
163
审稿时长
5 months
期刊介绍: The International Journal of Adaptive Control and Signal Processing is concerned with the design, synthesis and application of estimators or controllers where adaptive features are needed to cope with uncertainties.Papers on signal processing should also have some relevance to adaptive systems. The journal focus is on model based control design approaches rather than heuristic or rule based control design methods. All papers will be expected to include significant novel material. Both the theory and application of adaptive systems and system identification are areas of interest. Papers on applications can include problems in the implementation of algorithms for real time signal processing and control. The stability, convergence, robustness and numerical aspects of adaptive algorithms are also suitable topics. The related subjects of controller tuning, filtering, networks and switching theory are also of interest. Principal areas to be addressed include: Auto-Tuning, Self-Tuning and Model Reference Adaptive Controllers Nonlinear, Robust and Intelligent Adaptive Controllers Linear and Nonlinear Multivariable System Identification and Estimation Identification of Linear Parameter Varying, Distributed and Hybrid Systems Multiple Model Adaptive Control Adaptive Signal processing Theory and Algorithms Adaptation in Multi-Agent Systems Condition Monitoring Systems Fault Detection and Isolation Methods Fault Detection and Isolation Methods Fault-Tolerant Control (system supervision and diagnosis) Learning Systems and Adaptive Modelling Real Time Algorithms for Adaptive Signal Processing and Control Adaptive Signal Processing and Control Applications Adaptive Cloud Architectures and Networking Adaptive Mechanisms for Internet of Things Adaptive Sliding Mode Control.
期刊最新文献
Issue Information Quantized Iterative Learning Control for Consensus of Nonlinear Impulsive Multi-Agent Systems With Inter-Channel Encoding-Decoding Mechanisms and Packet Dropouts Adaptive Predefined-Time Control for High-Order Nonlinear Systems With Unmodeled Dynamics Composite Learning Adaptive Optimized Backstepping Control for a Class of Nonlinear Strict-Feedback Systems With Prescribed Performance Robust Fault H ∞ $$ {H}_{\infty } $$ Filtering Design in Finite Frequency Domain for Discrete-Time Switched Singular Systems
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1