学习柔性机械臂控制器并测试其对新观测、动力学和任务的泛化

2023 IEEE International Conference on Soft Robotics (RoboSoft) Pub Date : 2023-04-03 DOI:10.1109/RoboSoft55895.2023.10121988

Carlo Alessi, H. Hauser, A. Lucantonio, E. Falotico

{"title":"学习柔性机械臂控制器并测试其对新观测、动力学和任务的泛化","authors":"Carlo Alessi, H. Hauser, A. Lucantonio, E. Falotico","doi":"10.1109/RoboSoft55895.2023.10121988","DOIUrl":null,"url":null,"abstract":"Recently, learning-based controllers that leverage mechanical models of soft robots have shown promising results. This paper presents a closed-loop controller for dynamic trajectory tracking with a pneumatic soft robotic arm learned via Deep Reinforcement Learning using Proximal Policy Optimization. The control policy was trained in simulation leveraging a dynamic Cosserat rod model of the soft robot. The generalization capabilities of learned controllers are vital for successful deployment in the real world, especially when the encountered scenarios differ from the training environment. We assessed the generalization capabilities of the controller in silico for four tests. The first test involved the dynamic tracking of trajectories that differ significantly in shape and velocity profiles from the training data. Second, we evaluated the robustness of the controller to perpetual external end-point forces for dynamic tracking. For tracking tasks, it was also assessed the generalization to similar materials. Finally, we transferred the control policy without retraining to intercept a moving object with the end-effector. The learned control policy has shown good generalization capabilities in all four tests.","PeriodicalId":250981,"journal":{"name":"2023 IEEE International Conference on Soft Robotics (RoboSoft)","volume":"115 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Learning a Controller for Soft Robotic Arms and Testing its Generalization to New Observations, Dynamics, and Tasks\",\"authors\":\"Carlo Alessi, H. Hauser, A. Lucantonio, E. Falotico\",\"doi\":\"10.1109/RoboSoft55895.2023.10121988\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, learning-based controllers that leverage mechanical models of soft robots have shown promising results. This paper presents a closed-loop controller for dynamic trajectory tracking with a pneumatic soft robotic arm learned via Deep Reinforcement Learning using Proximal Policy Optimization. The control policy was trained in simulation leveraging a dynamic Cosserat rod model of the soft robot. The generalization capabilities of learned controllers are vital for successful deployment in the real world, especially when the encountered scenarios differ from the training environment. We assessed the generalization capabilities of the controller in silico for four tests. The first test involved the dynamic tracking of trajectories that differ significantly in shape and velocity profiles from the training data. Second, we evaluated the robustness of the controller to perpetual external end-point forces for dynamic tracking. For tracking tasks, it was also assessed the generalization to similar materials. Finally, we transferred the control policy without retraining to intercept a moving object with the end-effector. The learned control policy has shown good generalization capabilities in all four tests.\",\"PeriodicalId\":250981,\"journal\":{\"name\":\"2023 IEEE International Conference on Soft Robotics (RoboSoft)\",\"volume\":\"115 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE International Conference on Soft Robotics (RoboSoft)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/RoboSoft55895.2023.10121988\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Soft Robotics (RoboSoft)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RoboSoft55895.2023.10121988","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

最近，利用软机器人机械模型的基于学习的控制器已经显示出有希望的结果。本文提出了一种基于近端策略优化的深度强化学习的气动软机械臂动态轨迹跟踪闭环控制器。利用软机器人的动态Cosserat棒模型对控制策略进行仿真训练。学习控制器的泛化能力对于在现实世界中成功部署是至关重要的，特别是当遇到与训练环境不同的场景时。我们在四个测试中评估了控制器的计算机泛化能力。第一个测试涉及到轨迹的动态跟踪，这些轨迹在形状和速度方面与训练数据有很大的不同。其次，我们评估了控制器对动态跟踪的永久外部端点力的鲁棒性。对于跟踪任务，还评估了对类似材料的泛化。最后，我们将控制策略转移到用末端执行器拦截移动物体，而无需重新训练。该学习控制策略在四次测试中均显示出良好的泛化能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Learning a Controller for Soft Robotic Arms and Testing its Generalization to New Observations, Dynamics, and Tasks

Recently, learning-based controllers that leverage mechanical models of soft robots have shown promising results. This paper presents a closed-loop controller for dynamic trajectory tracking with a pneumatic soft robotic arm learned via Deep Reinforcement Learning using Proximal Policy Optimization. The control policy was trained in simulation leveraging a dynamic Cosserat rod model of the soft robot. The generalization capabilities of learned controllers are vital for successful deployment in the real world, especially when the encountered scenarios differ from the training environment. We assessed the generalization capabilities of the controller in silico for four tests. The first test involved the dynamic tracking of trajectories that differ significantly in shape and velocity profiles from the training data. Second, we evaluated the robustness of the controller to perpetual external end-point forces for dynamic tracking. For tracking tasks, it was also assessed the generalization to similar materials. Finally, we transferred the control policy without retraining to intercept a moving object with the end-effector. The learned control policy has shown good generalization capabilities in all four tests.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2023 IEEE International Conference on Soft Robotics (RoboSoft)

自引率

0.00%

发文量