Efficient Real2Sim2Real of Continuum Robots Using Deep Reinforcement Learning With Koopman Operator

IF 7.2 1区 工程技术 Q1 AUTOMATION & CONTROL SYSTEMS IEEE Transactions on Industrial Electronics Pub Date : 2025-02-05 DOI:10.1109/TIE.2025.3532733
Guanglin Ji;Qian Gao;Yin Xiao;Zhenglong Sun
{"title":"Efficient Real2Sim2Real of Continuum Robots Using Deep Reinforcement Learning With Koopman Operator","authors":"Guanglin Ji;Qian Gao;Yin Xiao;Zhenglong Sun","doi":"10.1109/TIE.2025.3532733","DOIUrl":null,"url":null,"abstract":"Accurate control of continuum robots is challenging, especially in the presence of external disturbances. To address this issue, reinforcement learning (RL) has been increasingly investigated in continuum robot control due to its online policy updating capability. However, the gap in Sim2Real transfer caused by inaccurate modeling results in RL implementation a dilemma. In this article, we propose a safety-critical Real2Sim2Real online RL framework, where the Real2Sim transfer is first achieved by the identification of a continuum robot using the Koopman operator. We further improve the training efficiency by introducing an imperfect demonstration into the RL framework. The offline policy is trained in simulations and then tested on a real continuum robot platform. During tests, the tracking performance is influenced by the hysteresis effect that cannot be captured by the Koopman operator. This results in a millimeter-level tracking root mean square error (RMSE). To address this issue, we online update the policy as well as the model, and the RMSE of the online controller outperforms the offline controller by 89.16% in free space and 85.70% under external payload, respectively.","PeriodicalId":13402,"journal":{"name":"IEEE Transactions on Industrial Electronics","volume":"72 8","pages":"8333-8343"},"PeriodicalIF":7.2000,"publicationDate":"2025-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Industrial Electronics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10875033/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Accurate control of continuum robots is challenging, especially in the presence of external disturbances. To address this issue, reinforcement learning (RL) has been increasingly investigated in continuum robot control due to its online policy updating capability. However, the gap in Sim2Real transfer caused by inaccurate modeling results in RL implementation a dilemma. In this article, we propose a safety-critical Real2Sim2Real online RL framework, where the Real2Sim transfer is first achieved by the identification of a continuum robot using the Koopman operator. We further improve the training efficiency by introducing an imperfect demonstration into the RL framework. The offline policy is trained in simulations and then tested on a real continuum robot platform. During tests, the tracking performance is influenced by the hysteresis effect that cannot be captured by the Koopman operator. This results in a millimeter-level tracking root mean square error (RMSE). To address this issue, we online update the policy as well as the model, and the RMSE of the online controller outperforms the offline controller by 89.16% in free space and 85.70% under external payload, respectively.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于Koopman算子的深度强化学习连续统机器人的高效Real2Sim2Real
连续体机器人的精确控制具有挑战性,特别是在存在外部干扰的情况下。为了解决这一问题,强化学习(RL)由于其在线策略更新能力在连续统机器人控制中得到了越来越多的研究。然而,由于建模不准确导致的Sim2Real传输的差距导致强化学习的实现陷入困境。在本文中,我们提出了一个安全关键的Real2Sim2Real在线RL框架,其中Real2Sim传输首先通过使用Koopman算子识别连续体机器人来实现。我们通过在RL框架中引入一个不完善的演示来进一步提高训练效率。离线策略在仿真中进行训练,然后在真实连续体机器人平台上进行测试。在测试过程中,跟踪性能受到库普曼算子无法捕获的滞后效应的影响。这导致毫米级的跟踪均方根误差(RMSE)。为了解决这个问题,我们在线更新了策略和模型,在线控制器的RMSE在空闲空间和外部负载下分别比离线控制器高89.16%和85.70%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Industrial Electronics
IEEE Transactions on Industrial Electronics 工程技术-工程:电子与电气
CiteScore
16.80
自引率
9.10%
发文量
1396
审稿时长
6.3 months
期刊介绍: Journal Name: IEEE Transactions on Industrial Electronics Publication Frequency: Monthly Scope: The scope of IEEE Transactions on Industrial Electronics encompasses the following areas: Applications of electronics, controls, and communications in industrial and manufacturing systems and processes. Power electronics and drive control techniques. System control and signal processing. Fault detection and diagnosis. Power systems. Instrumentation, measurement, and testing. Modeling and simulation. Motion control. Robotics. Sensors and actuators. Implementation of neural networks, fuzzy logic, and artificial intelligence in industrial systems. Factory automation. Communication and computer networks.
期刊最新文献
Modeling and Control for UAV With Off-Center Slung Load Exponential Stabilization for Discrete-Time Networked Control Systems With Multisource-Induced Time-Varying Delays IEEE Transactions on Industrial Electronics Publication Information A Hybrid Voltage Level Modulation for Dual Transformer LCL Immittance Network Resonant Dual-Active-Bridge Converter Fault-Tolerant Control Framework for Quadcopters With Implementation-Decoupled Fault Diagnosis and Continuum Control Allocation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1