Adaptive Finite-Time RL Control for Stochastic Non-Linear Systems With Full State Constraints and Dead Zone Output

IF 3.8 4区 计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS International Journal of Adaptive Control and Signal Processing Pub Date : 2025-01-29 DOI:10.1002/acs.3980
Hongyao Li, Fuli Wang
{"title":"Adaptive Finite-Time RL Control for Stochastic Non-Linear Systems With Full State Constraints and Dead Zone Output","authors":"Hongyao Li,&nbsp;Fuli Wang","doi":"10.1002/acs.3980","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>In this article, the finite-time control problem of adaptive neural network (NN) reinforcement learning (RL) is investigated for the continuous time stochastic non-linear systems with full state constraints and dead zone output. Firstly, the adaptive estimation and smooth approximation technique are introduced to solve the difficulty arising from the dead zone non-linearity. Moreover, to overcome the problem of calculating the explosion caused by the repeated differentiation of the virtual control signals, a finite-time command filter is constructed. Combining the backstepping technique and the identifier-actor-critic RL strategy, an adaptive neural finite-time RL control scheme is proposed for the considered system by constructing the tangent-type time-varying barrier Lyapunov functions (BLFs), which optimizes the tracking performance while ensuring all states do not violate the constraints. Under the proposed control strategy, it is guaranteed that all signals are bounded in probability, and the output of the system can track the reference signal within a finite-time. Finally, the simulation results verify the effectiveness of the proposed scheme.</p>\n </div>","PeriodicalId":50347,"journal":{"name":"International Journal of Adaptive Control and Signal Processing","volume":"39 4","pages":"818-828"},"PeriodicalIF":3.8000,"publicationDate":"2025-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Adaptive Control and Signal Processing","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/acs.3980","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

In this article, the finite-time control problem of adaptive neural network (NN) reinforcement learning (RL) is investigated for the continuous time stochastic non-linear systems with full state constraints and dead zone output. Firstly, the adaptive estimation and smooth approximation technique are introduced to solve the difficulty arising from the dead zone non-linearity. Moreover, to overcome the problem of calculating the explosion caused by the repeated differentiation of the virtual control signals, a finite-time command filter is constructed. Combining the backstepping technique and the identifier-actor-critic RL strategy, an adaptive neural finite-time RL control scheme is proposed for the considered system by constructing the tangent-type time-varying barrier Lyapunov functions (BLFs), which optimizes the tracking performance while ensuring all states do not violate the constraints. Under the proposed control strategy, it is guaranteed that all signals are bounded in probability, and the output of the system can track the reference signal within a finite-time. Finally, the simulation results verify the effectiveness of the proposed scheme.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
具有全状态约束和死区输出的随机非线性系统的自适应有限时间 RL 控制
针对具有全状态约束和死区输出的连续时间随机非线性系统,研究了自适应神经网络(NN)强化学习(RL)的有限时间控制问题。首先,引入自适应估计和光滑逼近技术,解决了死区非线性带来的困难;此外,为了克服由于虚拟控制信号的重复微分引起的爆炸计算问题,构造了有限时间命令滤波器。结合回溯技术和辨识者-行为者-批评家RL策略,通过构造切线型时变屏障Lyapunov函数(blf),提出了一种自适应神经有限时间RL控制方案,在保证所有状态不违反约束的情况下优化跟踪性能。在所提出的控制策略下,保证了所有信号在概率上是有界的,并且系统的输出能够在有限时间内跟踪参考信号。最后,仿真结果验证了所提方案的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
5.30
自引率
16.10%
发文量
163
审稿时长
5 months
期刊介绍: The International Journal of Adaptive Control and Signal Processing is concerned with the design, synthesis and application of estimators or controllers where adaptive features are needed to cope with uncertainties.Papers on signal processing should also have some relevance to adaptive systems. The journal focus is on model based control design approaches rather than heuristic or rule based control design methods. All papers will be expected to include significant novel material. Both the theory and application of adaptive systems and system identification are areas of interest. Papers on applications can include problems in the implementation of algorithms for real time signal processing and control. The stability, convergence, robustness and numerical aspects of adaptive algorithms are also suitable topics. The related subjects of controller tuning, filtering, networks and switching theory are also of interest. Principal areas to be addressed include: Auto-Tuning, Self-Tuning and Model Reference Adaptive Controllers Nonlinear, Robust and Intelligent Adaptive Controllers Linear and Nonlinear Multivariable System Identification and Estimation Identification of Linear Parameter Varying, Distributed and Hybrid Systems Multiple Model Adaptive Control Adaptive Signal processing Theory and Algorithms Adaptation in Multi-Agent Systems Condition Monitoring Systems Fault Detection and Isolation Methods Fault Detection and Isolation Methods Fault-Tolerant Control (system supervision and diagnosis) Learning Systems and Adaptive Modelling Real Time Algorithms for Adaptive Signal Processing and Control Adaptive Signal Processing and Control Applications Adaptive Cloud Architectures and Networking Adaptive Mechanisms for Internet of Things Adaptive Sliding Mode Control.
期刊最新文献
Issue Information Issue Information Issue Information Dynamics-Memory-Event-Based Adaptive Attitude Consensus Control for Multi-UAVs With Self-Adjusting Performance Fault Estimation Based Control Design for Nonlinear Fractional-Order Systems Under Proportional-Integral Observer
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1