Reinforcement learning-based finite-time cross-media tracking control for a cross-media vehicle under unknown dynamics and disturbances

IF 2.2 4区 计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS IET Control Theory and Applications Pub Date : 2024-06-18 DOI:10.1049/cth2.12693
Shichong Wu, Lingli Xie, Jun Xian, Fei Liao, Wenhua Wu, Mingqing Lu, Xian Yi
{"title":"Reinforcement learning-based finite-time cross-media tracking control for a cross-media vehicle under unknown dynamics and disturbances","authors":"Shichong Wu,&nbsp;Lingli Xie,&nbsp;Jun Xian,&nbsp;Fei Liao,&nbsp;Wenhua Wu,&nbsp;Mingqing Lu,&nbsp;Xian Yi","doi":"10.1049/cth2.12693","DOIUrl":null,"url":null,"abstract":"<p>This study proposes a reinforcement learning-based finite-time cross-media tracking control approach for a slender body cross-media vehicle encountering unknown hydrodynamics, wind, and wave disturbances. Initially, a reinforcement learning framework consisting of the actor neural network and critic neural network is constructed. The critic neural network monitors the actions of the actor neural network and approximates the cost function, while the actor neural network estimates the unknown hydrodynamics and disturbances, minimising the cost function to optimise performance. Subsequently, the command filter featuring finite-time convergence is formulated, effectively managing the corresponding filter error through a proposed error compensating signal. By integrating these techniques, a reinforcement learning-based finite-time control strategy is developed, circumventing the singularity issue inherent in traditional finite-time backstepping strategies. Comparative analysis with existing methods demonstrates the strong robustness of the proposed scheme against unknown hydrodynamics and disturbances, ensuring finite-time convergence of the system's states and optimising controller performance. Finally, simulations confirm the effectiveness and superiority of the presented approach.</p>","PeriodicalId":50382,"journal":{"name":"IET Control Theory and Applications","volume":"18 11","pages":"1475-1490"},"PeriodicalIF":2.2000,"publicationDate":"2024-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cth2.12693","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Control Theory and Applications","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/cth2.12693","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

This study proposes a reinforcement learning-based finite-time cross-media tracking control approach for a slender body cross-media vehicle encountering unknown hydrodynamics, wind, and wave disturbances. Initially, a reinforcement learning framework consisting of the actor neural network and critic neural network is constructed. The critic neural network monitors the actions of the actor neural network and approximates the cost function, while the actor neural network estimates the unknown hydrodynamics and disturbances, minimising the cost function to optimise performance. Subsequently, the command filter featuring finite-time convergence is formulated, effectively managing the corresponding filter error through a proposed error compensating signal. By integrating these techniques, a reinforcement learning-based finite-time control strategy is developed, circumventing the singularity issue inherent in traditional finite-time backstepping strategies. Comparative analysis with existing methods demonstrates the strong robustness of the proposed scheme against unknown hydrodynamics and disturbances, ensuring finite-time convergence of the system's states and optimising controller performance. Finally, simulations confirm the effectiveness and superiority of the presented approach.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
未知动态和干扰条件下基于强化学习的跨媒体车辆有限时间跨媒体跟踪控制
本研究提出了一种基于强化学习的有限时间跨媒体跟踪控制方法,适用于遇到未知流体力学、风和波浪干扰的细长体跨媒体飞行器。首先,构建了一个由行动者神经网络和批评者神经网络组成的强化学习框架。批判者神经网络监控行动者神经网络的行动并逼近成本函数,而行动者神经网络则估计未知的流体力学和干扰,最小化成本函数以优化性能。随后,制定了以有限时间收敛为特征的指令滤波器,通过建议的误差补偿信号有效管理相应的滤波器误差。通过整合这些技术,开发出了基于强化学习的有限时间控制策略,规避了传统有限时间反步进策略固有的奇异性问题。与现有方法的对比分析表明,所提出的方案对未知流体力学和干扰具有很强的鲁棒性,可确保系统状态的有限时间收敛并优化控制器性能。最后,模拟证实了所提出方法的有效性和优越性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IET Control Theory and Applications
IET Control Theory and Applications 工程技术-工程:电子与电气
CiteScore
5.70
自引率
7.70%
发文量
167
审稿时长
5.1 months
期刊介绍: IET Control Theory & Applications is devoted to control systems in the broadest sense, covering new theoretical results and the applications of new and established control methods. Among the topics of interest are system modelling, identification and simulation, the analysis and design of control systems (including computer-aided design), and practical implementation. The scope encompasses technological, economic, physiological (biomedical) and other systems, including man-machine interfaces. Most of the papers published deal with original work from industrial and government laboratories and universities, but subject reviews and tutorial expositions of current methods are welcomed. Correspondence discussing published papers is also welcomed. Applications papers need not necessarily involve new theory. Papers which describe new realisations of established methods, or control techniques applied in a novel situation, or practical studies which compare various designs, would be of interest. Of particular value are theoretical papers which discuss the applicability of new work or applications which engender new theoretical applications.
期刊最新文献
Neuro-adaptive prescribed performance control for spacecraft rendezvous based on the fully-actuated system approach Adaptive polynomial Kalman filter for nonlinear state estimation in modified AR time series with fixed coefficients Observer-based adaptive control of vehicle platoon with uncertainty and input constraints An improved two-degree-of-freedom ADRC for asynchronous motor vector system Receding horizon control for persistent monitoring tasks with monitoring count requirements
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1