Using digital twin to enhance Sim2real transfer for reinforcement learning in 3C assembly

Industrial Robot Pub Date : 2023-11-22 DOI:10.1108/ir-07-2023-0156

Weiwen Mu, Wenbai Chen, Huaidong Zhou, Naijun Liu, Haobin Shi, Jingchen Li

{"title":"Using digital twin to enhance Sim2real transfer for reinforcement learning in 3C assembly","authors":"Weiwen Mu, Wenbai Chen, Huaidong Zhou, Naijun Liu, Haobin Shi, Jingchen Li","doi":"10.1108/ir-07-2023-0156","DOIUrl":null,"url":null,"abstract":"<h3>Purpose</h3>\n<p>This paper aim to solve the problem of low assembly success rate for 3c assembly lines designed based on classical control algorithms due to inevitable random disturbances and other factors,by incorporating intelligent algorithms into the assembly line, the assembly process can be extended to uncertain assembly scenarios.</p>\n<h3>Design/methodology/approach</h3>\n<p>This work proposes a reinforcement learning framework based on digital twins. First, the authors used Unity3D to build a simulation environment that matches the real scene and achieved data synchronization between the real environment and the simulation environment through the robot operating system. Then, the authors trained the reinforcement learning model in the simulation environment. Finally, by creating a digital twin environment, the authors transferred the skill learned from the simulation to the real environment and achieved stable algorithm deployment in real-world scenarios.</p>\n<h3>Findings</h3>\n<p>In this work, the authors have completed the transfer of skill-learning algorithms from virtual to real environments by establishing a digital twin environment. On the one hand, the experiment proves the progressiveness of the algorithm and the feasibility of the application of digital twins in reinforcement learning transfer. On the other hand, the experimental results also provide reference for the application of digital twins in 3C assembly scenarios.</p>\n<h3>Originality/value</h3>\n<p>In this work, the authors designed a new encoder structure in the simulation environment to encode image information, which improved the model’s perception of the environment. At the same time, the authors used the fixed strategy combined with the reinforcement learning strategy to learn skills, which improved the rate of convergence and stability of skills learning. Finally, the authors transferred the learned skills to the physical platform through digital twin technology and realized the safe operation of the flexible printed circuit assembly task.</p>","PeriodicalId":501389,"journal":{"name":"Industrial Robot","volume":"12 6","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Industrial Robot","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1108/ir-07-2023-0156","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose

This paper aim to solve the problem of low assembly success rate for 3c assembly lines designed based on classical control algorithms due to inevitable random disturbances and other factors,by incorporating intelligent algorithms into the assembly line, the assembly process can be extended to uncertain assembly scenarios.

Design/methodology/approach

This work proposes a reinforcement learning framework based on digital twins. First, the authors used Unity3D to build a simulation environment that matches the real scene and achieved data synchronization between the real environment and the simulation environment through the robot operating system. Then, the authors trained the reinforcement learning model in the simulation environment. Finally, by creating a digital twin environment, the authors transferred the skill learned from the simulation to the real environment and achieved stable algorithm deployment in real-world scenarios.

Findings

In this work, the authors have completed the transfer of skill-learning algorithms from virtual to real environments by establishing a digital twin environment. On the one hand, the experiment proves the progressiveness of the algorithm and the feasibility of the application of digital twins in reinforcement learning transfer. On the other hand, the experimental results also provide reference for the application of digital twins in 3C assembly scenarios.

Originality/value

In this work, the authors designed a new encoder structure in the simulation environment to encode image information, which improved the model’s perception of the environment. At the same time, the authors used the fixed strategy combined with the reinforcement learning strategy to learn skills, which improved the rate of convergence and stability of skills learning. Finally, the authors transferred the learned skills to the physical platform through digital twin technology and realized the safe operation of the flexible printed circuit assembly task.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用数字孪生增强Sim2real迁移的3C装配强化学习

本文旨在解决基于经典控制算法设计的3c装配线由于不可避免的随机干扰等因素导致装配成功率低的问题，通过在装配线中引入智能算法，将装配过程扩展到不确定的装配场景。设计/方法/方法本工作提出了一种基于数字孪生的强化学习框架。首先，利用Unity3D搭建与真实场景相匹配的仿真环境，通过机器人操作系统实现真实环境与仿真环境的数据同步。然后，在仿真环境下对强化学习模型进行训练。最后，通过创建数字孪生环境，作者将从模拟中学习到的技能转移到真实环境中，并在现实场景中实现了稳定的算法部署。在这项工作中，作者通过建立一个数字孪生环境，完成了技能学习算法从虚拟环境到现实环境的转移。实验一方面证明了算法的先进性和数字孪生在强化学习迁移中应用的可行性。另一方面，实验结果也为数字孪生在3C装配场景中的应用提供了参考。在这项工作中，作者在仿真环境中设计了一种新的编码器结构来编码图像信息，提高了模型对环境的感知能力。同时，采用固定策略结合强化学习策略进行技能学习，提高了技能学习的收敛速度和稳定性。最后，通过数字孪生技术将所学技能转移到物理平台上，实现柔性印制电路组装任务的安全运行。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Industrial Robot

自引率

0.00%

发文量