未知动态和干扰条件下基于强化学习的跨媒体车辆有限时间跨媒体跟踪控制

IF 2.3 4区计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS IET Control Theory and Applications Pub Date : 2024-06-18 DOI:10.1049/cth2.12693

Shichong Wu, Lingli Xie, Jun Xian, Fei Liao, Wenhua Wu, Mingqing Lu, Xian Yi

{"title":"未知动态和干扰条件下基于强化学习的跨媒体车辆有限时间跨媒体跟踪控制","authors":"Shichong Wu, Lingli Xie, Jun Xian, Fei Liao, Wenhua Wu, Mingqing Lu, Xian Yi","doi":"10.1049/cth2.12693","DOIUrl":null,"url":null,"abstract":"<p>This study proposes a reinforcement learning-based finite-time cross-media tracking control approach for a slender body cross-media vehicle encountering unknown hydrodynamics, wind, and wave disturbances. Initially, a reinforcement learning framework consisting of the actor neural network and critic neural network is constructed. The critic neural network monitors the actions of the actor neural network and approximates the cost function, while the actor neural network estimates the unknown hydrodynamics and disturbances, minimising the cost function to optimise performance. Subsequently, the command filter featuring finite-time convergence is formulated, effectively managing the corresponding filter error through a proposed error compensating signal. By integrating these techniques, a reinforcement learning-based finite-time control strategy is developed, circumventing the singularity issue inherent in traditional finite-time backstepping strategies. Comparative analysis with existing methods demonstrates the strong robustness of the proposed scheme against unknown hydrodynamics and disturbances, ensuring finite-time convergence of the system's states and optimising controller performance. Finally, simulations confirm the effectiveness and superiority of the presented approach.</p>","PeriodicalId":50382,"journal":{"name":"IET Control Theory and Applications","volume":"18 11","pages":"1475-1490"},"PeriodicalIF":2.3000,"publicationDate":"2024-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cth2.12693","citationCount":"0","resultStr":"{\"title\":\"Reinforcement learning-based finite-time cross-media tracking control for a cross-media vehicle under unknown dynamics and disturbances\",\"authors\":\"Shichong Wu, Lingli Xie, Jun Xian, Fei Liao, Wenhua Wu, Mingqing Lu, Xian Yi\",\"doi\":\"10.1049/cth2.12693\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>This study proposes a reinforcement learning-based finite-time cross-media tracking control approach for a slender body cross-media vehicle encountering unknown hydrodynamics, wind, and wave disturbances. Initially, a reinforcement learning framework consisting of the actor neural network and critic neural network is constructed. The critic neural network monitors the actions of the actor neural network and approximates the cost function, while the actor neural network estimates the unknown hydrodynamics and disturbances, minimising the cost function to optimise performance. Subsequently, the command filter featuring finite-time convergence is formulated, effectively managing the corresponding filter error through a proposed error compensating signal. By integrating these techniques, a reinforcement learning-based finite-time control strategy is developed, circumventing the singularity issue inherent in traditional finite-time backstepping strategies. Comparative analysis with existing methods demonstrates the strong robustness of the proposed scheme against unknown hydrodynamics and disturbances, ensuring finite-time convergence of the system's states and optimising controller performance. Finally, simulations confirm the effectiveness and superiority of the presented approach.</p>\",\"PeriodicalId\":50382,\"journal\":{\"name\":\"IET Control Theory and Applications\",\"volume\":\"18 11\",\"pages\":\"1475-1490\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2024-06-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cth2.12693\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IET Control Theory and Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ietresearch.onlinelibrary.wiley.com/doi/10.1049/cth2.12693\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Control Theory and Applications","FirstCategoryId":"94","ListUrlMain":"https://ietresearch.onlinelibrary.wiley.com/doi/10.1049/cth2.12693","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

本研究提出了一种基于强化学习的有限时间跨媒体跟踪控制方法，适用于遇到未知流体力学、风和波浪干扰的细长体跨媒体飞行器。首先，构建了一个由行动者神经网络和批评者神经网络组成的强化学习框架。批判者神经网络监控行动者神经网络的行动并逼近成本函数，而行动者神经网络则估计未知的流体力学和干扰，最小化成本函数以优化性能。随后，制定了以有限时间收敛为特征的指令滤波器，通过建议的误差补偿信号有效管理相应的滤波器误差。通过整合这些技术，开发出了基于强化学习的有限时间控制策略，规避了传统有限时间反步进策略固有的奇异性问题。与现有方法的对比分析表明，所提出的方案对未知流体力学和干扰具有很强的鲁棒性，可确保系统状态的有限时间收敛并优化控制器性能。最后，模拟证实了所提出方法的有效性和优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

摘要图片

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Reinforcement learning-based finite-time cross-media tracking control for a cross-media vehicle under unknown dynamics and disturbances

This study proposes a reinforcement learning-based finite-time cross-media tracking control approach for a slender body cross-media vehicle encountering unknown hydrodynamics, wind, and wave disturbances. Initially, a reinforcement learning framework consisting of the actor neural network and critic neural network is constructed. The critic neural network monitors the actions of the actor neural network and approximates the cost function, while the actor neural network estimates the unknown hydrodynamics and disturbances, minimising the cost function to optimise performance. Subsequently, the command filter featuring finite-time convergence is formulated, effectively managing the corresponding filter error through a proposed error compensating signal. By integrating these techniques, a reinforcement learning-based finite-time control strategy is developed, circumventing the singularity issue inherent in traditional finite-time backstepping strategies. Comparative analysis with existing methods demonstrates the strong robustness of the proposed scheme against unknown hydrodynamics and disturbances, ensuring finite-time convergence of the system's states and optimising controller performance. Finally, simulations confirm the effectiveness and superiority of the presented approach.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IET Control Theory and Applications 工程技术-工程：电子与电气

CiteScore

5.70

自引率

7.70%

发文量

167

审稿时长

5.1 months

期刊介绍： IET Control Theory & Applications is devoted to control systems in the broadest sense, covering new theoretical results and the applications of new and established control methods. Among the topics of interest are system modelling, identification and simulation, the analysis and design of control systems (including computer-aided design), and practical implementation. The scope encompasses technological, economic, physiological (biomedical) and other systems, including man-machine interfaces. Most of the papers published deal with original work from industrial and government laboratories and universities, but subject reviews and tutorial expositions of current methods are welcomed. Correspondence discussing published papers is also welcomed. Applications papers need not necessarily involve new theory. Papers which describe new realisations of established methods, or control techniques applied in a novel situation, or practical studies which compare various designs, would be of interest. Of particular value are theoretical papers which discuss the applicability of new work or applications which engender new theoretical applications.

期刊最新文献

A New Auto-Tuned Velocity Estimator Design for Mechatronic Systems Phase-Advanced ADRC: Precision Missile Attitude Control via Delay Compensation Predefined-Time Fault-Tolerant Control of Rigid Spacecraft With Prescribed Performance