Reinforcement Learning Strategy-Based Adaptive Tracking Control for Underactuated Dual Ship-Mounted Cranes: Theoretical Design and Hardware Experiments

IF 7.2 1区工程技术 Q1 AUTOMATION & CONTROL SYSTEMS IEEE Transactions on Industrial Electronics Pub Date : 2024-10-29 DOI:10.1109/TIE.2024.3481885

Shujie Wu;Haibo Zhang;Yuzhe Qian

{"title":"Reinforcement Learning Strategy-Based Adaptive Tracking Control for Underactuated Dual Ship-Mounted Cranes: Theoretical Design and Hardware Experiments","authors":"Shujie Wu;Haibo Zhang;Yuzhe Qian","doi":"10.1109/TIE.2024.3481885","DOIUrl":null,"url":null,"abstract":"As a flexible transportation equipment, the dual ship-mounted crane (DSMC) systems are widely used to transport cargos/goods under complex marine and harbor environments. However, automatic control of such complex systems still faces significant challenges due to their underactuated characteristics, unexpected sea wave disturbances, and uncertain system parameters. Most existing control methods are based on accurate dynamics model or linearized models, which can hardly suppress unknown interferences or may badly decreasing control effects when there exist system uncertainties. To solve the above problems, a reinforcement learning based adaptive tracking control method is proposed in this article, which can obtain a satisfactory control performance without accurate system parameters. Specifically, an actor and a critic neural network are constructed to execute the reinforcement learning (RL) algorithm, for which, the actor-network executes the control input, and the critic network judges the control performance and feedback reinforcement signal to the action network. In addition, a robust integral of the sign of error feedback signal is introduced to improve the robustness of the system. Based on Lyapunov stability theory, it is proved that the tracking error can converge to zero asymptotically under the proposed controller. Finally, hardware experimental results show the effectiveness and robustness of the proposed controller.","PeriodicalId":13402,"journal":{"name":"IEEE Transactions on Industrial Electronics","volume":"72 5","pages":"5408-5417"},"PeriodicalIF":7.2000,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Industrial Electronics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10738215/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

As a flexible transportation equipment, the dual ship-mounted crane (DSMC) systems are widely used to transport cargos/goods under complex marine and harbor environments. However, automatic control of such complex systems still faces significant challenges due to their underactuated characteristics, unexpected sea wave disturbances, and uncertain system parameters. Most existing control methods are based on accurate dynamics model or linearized models, which can hardly suppress unknown interferences or may badly decreasing control effects when there exist system uncertainties. To solve the above problems, a reinforcement learning based adaptive tracking control method is proposed in this article, which can obtain a satisfactory control performance without accurate system parameters. Specifically, an actor and a critic neural network are constructed to execute the reinforcement learning (RL) algorithm, for which, the actor-network executes the control input, and the critic network judges the control performance and feedback reinforcement signal to the action network. In addition, a robust integral of the sign of error feedback signal is introduced to improve the robustness of the system. Based on Lyapunov stability theory, it is proved that the tracking error can converge to zero asymptotically under the proposed controller. Finally, hardware experimental results show the effectiveness and robustness of the proposed controller.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于强化学习策略的欠驱动双船载起重机自适应跟踪控制：理论设计与硬件实验

双船吊系统作为一种灵活的运输设备，被广泛应用于复杂的海洋和港口环境下的货物运输。然而，由于此类复杂系统的欠驱动特性、不可预期的海浪干扰和系统参数的不确定性，其自动控制仍然面临着重大挑战。现有的控制方法大多是基于精确的动力学模型或线性化模型，当系统存在不确定性时，这些方法很难抑制未知干扰，甚至会严重降低控制效果。针对上述问题，本文提出了一种基于强化学习的自适应跟踪控制方法，该方法可以在不需要精确系统参数的情况下获得满意的控制性能。具体而言，构建了一个行动者和一个评论家神经网络来执行强化学习（RL）算法，其中行动者网络执行控制输入，评论家网络判断控制效果并将强化信号反馈给行动网络。此外，引入了误差反馈信号符号的鲁棒积分，提高了系统的鲁棒性。基于李雅普诺夫稳定性理论，证明了该控制器的跟踪误差可以渐近收敛于零。最后，硬件实验结果表明了所提控制器的有效性和鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Industrial Electronics 工程技术-工程：电子与电气

CiteScore

16.80

自引率

9.10%

发文量

1396

审稿时长

6.3 months

期刊介绍： Journal Name: IEEE Transactions on Industrial Electronics Publication Frequency: Monthly Scope: The scope of IEEE Transactions on Industrial Electronics encompasses the following areas: Applications of electronics, controls, and communications in industrial and manufacturing systems and processes. Power electronics and drive control techniques. System control and signal processing. Fault detection and diagnosis. Power systems. Instrumentation, measurement, and testing. Modeling and simulation. Motion control. Robotics. Sensors and actuators. Implementation of neural networks, fuzzy logic, and artificial intelligence in industrial systems. Factory automation. Communication and computer networks.