Efficient Real2Sim2Real of Continuum Robots Using Deep Reinforcement Learning With Koopman Operator

IF 7.2 1区工程技术 Q1 AUTOMATION & CONTROL SYSTEMS IEEE Transactions on Industrial Electronics Pub Date : 2025-02-05 DOI:10.1109/TIE.2025.3532733

Guanglin Ji;Qian Gao;Yin Xiao;Zhenglong Sun

{"title":"Efficient Real2Sim2Real of Continuum Robots Using Deep Reinforcement Learning With Koopman Operator","authors":"Guanglin Ji;Qian Gao;Yin Xiao;Zhenglong Sun","doi":"10.1109/TIE.2025.3532733","DOIUrl":null,"url":null,"abstract":"Accurate control of continuum robots is challenging, especially in the presence of external disturbances. To address this issue, reinforcement learning (RL) has been increasingly investigated in continuum robot control due to its online policy updating capability. However, the gap in Sim2Real transfer caused by inaccurate modeling results in RL implementation a dilemma. In this article, we propose a safety-critical Real2Sim2Real online RL framework, where the Real2Sim transfer is first achieved by the identification of a continuum robot using the Koopman operator. We further improve the training efficiency by introducing an imperfect demonstration into the RL framework. The offline policy is trained in simulations and then tested on a real continuum robot platform. During tests, the tracking performance is influenced by the hysteresis effect that cannot be captured by the Koopman operator. This results in a millimeter-level tracking root mean square error (RMSE). To address this issue, we online update the policy as well as the model, and the RMSE of the online controller outperforms the offline controller by 89.16% in free space and 85.70% under external payload, respectively.","PeriodicalId":13402,"journal":{"name":"IEEE Transactions on Industrial Electronics","volume":"72 8","pages":"8333-8343"},"PeriodicalIF":7.2000,"publicationDate":"2025-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Industrial Electronics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10875033/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Accurate control of continuum robots is challenging, especially in the presence of external disturbances. To address this issue, reinforcement learning (RL) has been increasingly investigated in continuum robot control due to its online policy updating capability. However, the gap in Sim2Real transfer caused by inaccurate modeling results in RL implementation a dilemma. In this article, we propose a safety-critical Real2Sim2Real online RL framework, where the Real2Sim transfer is first achieved by the identification of a continuum robot using the Koopman operator. We further improve the training efficiency by introducing an imperfect demonstration into the RL framework. The offline policy is trained in simulations and then tested on a real continuum robot platform. During tests, the tracking performance is influenced by the hysteresis effect that cannot be captured by the Koopman operator. This results in a millimeter-level tracking root mean square error (RMSE). To address this issue, we online update the policy as well as the model, and the RMSE of the online controller outperforms the offline controller by 89.16% in free space and 85.70% under external payload, respectively.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于Koopman算子的深度强化学习连续统机器人的高效Real2Sim2Real

连续体机器人的精确控制具有挑战性，特别是在存在外部干扰的情况下。为了解决这一问题，强化学习（RL）由于其在线策略更新能力在连续统机器人控制中得到了越来越多的研究。然而，由于建模不准确导致的Sim2Real传输的差距导致强化学习的实现陷入困境。在本文中，我们提出了一个安全关键的Real2Sim2Real在线RL框架，其中Real2Sim传输首先通过使用Koopman算子识别连续体机器人来实现。我们通过在RL框架中引入一个不完善的演示来进一步提高训练效率。离线策略在仿真中进行训练，然后在真实连续体机器人平台上进行测试。在测试过程中，跟踪性能受到库普曼算子无法捕获的滞后效应的影响。这导致毫米级的跟踪均方根误差（RMSE）。为了解决这个问题，我们在线更新了策略和模型，在线控制器的RMSE在空闲空间和外部负载下分别比离线控制器高89.16%和85.70%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Industrial Electronics 工程技术-工程：电子与电气

CiteScore

16.80

自引率

9.10%

发文量

1396

审稿时长

6.3 months

期刊介绍： Journal Name: IEEE Transactions on Industrial Electronics Publication Frequency: Monthly Scope: The scope of IEEE Transactions on Industrial Electronics encompasses the following areas: Applications of electronics, controls, and communications in industrial and manufacturing systems and processes. Power electronics and drive control techniques. System control and signal processing. Fault detection and diagnosis. Power systems. Instrumentation, measurement, and testing. Modeling and simulation. Motion control. Robotics. Sensors and actuators. Implementation of neural networks, fuzzy logic, and artificial intelligence in industrial systems. Factory automation. Communication and computer networks.