Reinforcement Learning Strategy-Based Adaptive Tracking Control for Underactuated Dual Ship-Mounted Cranes: Theoretical Design and Hardware Experiments

IF 7.2 1区 工程技术 Q1 AUTOMATION & CONTROL SYSTEMS IEEE Transactions on Industrial Electronics Pub Date : 2024-10-29 DOI:10.1109/TIE.2024.3481885
Shujie Wu;Haibo Zhang;Yuzhe Qian
{"title":"Reinforcement Learning Strategy-Based Adaptive Tracking Control for Underactuated Dual Ship-Mounted Cranes: Theoretical Design and Hardware Experiments","authors":"Shujie Wu;Haibo Zhang;Yuzhe Qian","doi":"10.1109/TIE.2024.3481885","DOIUrl":null,"url":null,"abstract":"As a flexible transportation equipment, the dual ship-mounted crane (DSMC) systems are widely used to transport cargos/goods under complex marine and harbor environments. However, automatic control of such complex systems still faces significant challenges due to their underactuated characteristics, unexpected sea wave disturbances, and uncertain system parameters. Most existing control methods are based on accurate dynamics model or linearized models, which can hardly suppress unknown interferences or may badly decreasing control effects when there exist system uncertainties. To solve the above problems, a reinforcement learning based adaptive tracking control method is proposed in this article, which can obtain a satisfactory control performance without accurate system parameters. Specifically, an actor and a critic neural network are constructed to execute the reinforcement learning (RL) algorithm, for which, the actor-network executes the control input, and the critic network judges the control performance and feedback reinforcement signal to the action network. In addition, a robust integral of the sign of error feedback signal is introduced to improve the robustness of the system. Based on Lyapunov stability theory, it is proved that the tracking error can converge to zero asymptotically under the proposed controller. Finally, hardware experimental results show the effectiveness and robustness of the proposed controller.","PeriodicalId":13402,"journal":{"name":"IEEE Transactions on Industrial Electronics","volume":"72 5","pages":"5408-5417"},"PeriodicalIF":7.2000,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Industrial Electronics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10738215/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

As a flexible transportation equipment, the dual ship-mounted crane (DSMC) systems are widely used to transport cargos/goods under complex marine and harbor environments. However, automatic control of such complex systems still faces significant challenges due to their underactuated characteristics, unexpected sea wave disturbances, and uncertain system parameters. Most existing control methods are based on accurate dynamics model or linearized models, which can hardly suppress unknown interferences or may badly decreasing control effects when there exist system uncertainties. To solve the above problems, a reinforcement learning based adaptive tracking control method is proposed in this article, which can obtain a satisfactory control performance without accurate system parameters. Specifically, an actor and a critic neural network are constructed to execute the reinforcement learning (RL) algorithm, for which, the actor-network executes the control input, and the critic network judges the control performance and feedback reinforcement signal to the action network. In addition, a robust integral of the sign of error feedback signal is introduced to improve the robustness of the system. Based on Lyapunov stability theory, it is proved that the tracking error can converge to zero asymptotically under the proposed controller. Finally, hardware experimental results show the effectiveness and robustness of the proposed controller.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于强化学习策略的欠驱动双船载起重机自适应跟踪控制:理论设计与硬件实验
双船吊系统作为一种灵活的运输设备,被广泛应用于复杂的海洋和港口环境下的货物运输。然而,由于此类复杂系统的欠驱动特性、不可预期的海浪干扰和系统参数的不确定性,其自动控制仍然面临着重大挑战。现有的控制方法大多是基于精确的动力学模型或线性化模型,当系统存在不确定性时,这些方法很难抑制未知干扰,甚至会严重降低控制效果。针对上述问题,本文提出了一种基于强化学习的自适应跟踪控制方法,该方法可以在不需要精确系统参数的情况下获得满意的控制性能。具体而言,构建了一个行动者和一个评论家神经网络来执行强化学习(RL)算法,其中行动者网络执行控制输入,评论家网络判断控制效果并将强化信号反馈给行动网络。此外,引入了误差反馈信号符号的鲁棒积分,提高了系统的鲁棒性。基于李雅普诺夫稳定性理论,证明了该控制器的跟踪误差可以渐近收敛于零。最后,硬件实验结果表明了所提控制器的有效性和鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Industrial Electronics
IEEE Transactions on Industrial Electronics 工程技术-工程:电子与电气
CiteScore
16.80
自引率
9.10%
发文量
1396
审稿时长
6.3 months
期刊介绍: Journal Name: IEEE Transactions on Industrial Electronics Publication Frequency: Monthly Scope: The scope of IEEE Transactions on Industrial Electronics encompasses the following areas: Applications of electronics, controls, and communications in industrial and manufacturing systems and processes. Power electronics and drive control techniques. System control and signal processing. Fault detection and diagnosis. Power systems. Instrumentation, measurement, and testing. Modeling and simulation. Motion control. Robotics. Sensors and actuators. Implementation of neural networks, fuzzy logic, and artificial intelligence in industrial systems. Factory automation. Communication and computer networks.
期刊最新文献
Modeling and Control for UAV With Off-Center Slung Load Exponential Stabilization for Discrete-Time Networked Control Systems With Multisource-Induced Time-Varying Delays IEEE Transactions on Industrial Electronics Publication Information A Hybrid Voltage Level Modulation for Dual Transformer LCL Immittance Network Resonant Dual-Active-Bridge Converter Fault-Tolerant Control Framework for Quadcopters With Implementation-Decoupled Fault Diagnosis and Continuum Control Allocation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1