Siwen Liu;Yi Zuo;Tieshan Li;Huanqing Wang;Xiaoyang Gao;Yang Xiao
{"title":"Adaptive Composite Fixed-Time RL-Optimized Control for Nonlinear Systems and Its Application to Intelligent Ship Autopilot","authors":"Siwen Liu;Yi Zuo;Tieshan Li;Huanqing Wang;Xiaoyang Gao;Yang Xiao","doi":"10.1109/TAI.2024.3444731","DOIUrl":null,"url":null,"abstract":"In the article, an adaptive fixed-time reinforcement learning (RL) optimized control policy is given for nonlinear systems. Radial basis function neural networks (RBFNNs) are exploited to fit uncertain nonlinearities appeared in the considered systems and RL is applied under the critic-actor architecture by using RBFNNs. Specifically, a novel fixed-time smooth estimation system is proposed to improve the estimating performance of RBFNNs. The introduction of the hyperbolic tangent function effectively avoids the singularity problem of the derivative of the virtual controller. The stability analysis shows that the tracking error inclines to an adjustable region near the origin in a fixed-time interval and the boundedness of all signals is obtained. Finally, the intelligent ship autopilot is simulated to prove the utilizability of the obtained control way.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 1","pages":"66-78"},"PeriodicalIF":0.0000,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on artificial intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10638813/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

文章针对非线性系统给出了一种自适应固定时间强化学习(RL)优化控制策略。文章利用径向基函数神经网络(RBFNN)来拟合所考虑系统中出现的不确定非线性因素,并通过使用 RBFNN 在批判者-行为者架构下应用 RL。具体来说,为了提高 RBFNNs 的估计性能,提出了一种新的固定时间平滑估计系统。双曲正切函数的引入有效避免了虚拟控制器导数的奇异性问题。稳定性分析表明,在固定的时间间隔内,跟踪误差倾向于原点附近的可调区域,并获得了所有信号的有界性。最后,对智能船舶自动驾驶仪进行了仿真,以证明所获控制方法的可用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Adaptive Composite Fixed-Time RL-Optimized Control for Nonlinear Systems and Its Application to Intelligent Ship Autopilot
In the article, an adaptive fixed-time reinforcement learning (RL) optimized control policy is given for nonlinear systems. Radial basis function neural networks (RBFNNs) are exploited to fit uncertain nonlinearities appeared in the considered systems and RL is applied under the critic-actor architecture by using RBFNNs. Specifically, a novel fixed-time smooth estimation system is proposed to improve the estimating performance of RBFNNs. The introduction of the hyperbolic tangent function effectively avoids the singularity problem of the derivative of the virtual controller. The stability analysis shows that the tracking error inclines to an adjustable region near the origin in a fixed-time interval and the boundedness of all signals is obtained. Finally, the intelligent ship autopilot is simulated to prove the utilizability of the obtained control way.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
7.70
自引率
0.00%
发文量
0
期刊最新文献
Front Cover Table of Contents IEEE Transactions on Artificial Intelligence Publication Information Table of Contents Front Cover
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1