学习柔性机械臂控制器并测试其对新观测、动力学和任务的泛化

Carlo Alessi, H. Hauser, A. Lucantonio, E. Falotico
{"title":"学习柔性机械臂控制器并测试其对新观测、动力学和任务的泛化","authors":"Carlo Alessi, H. Hauser, A. Lucantonio, E. Falotico","doi":"10.1109/RoboSoft55895.2023.10121988","DOIUrl":null,"url":null,"abstract":"Recently, learning-based controllers that leverage mechanical models of soft robots have shown promising results. This paper presents a closed-loop controller for dynamic trajectory tracking with a pneumatic soft robotic arm learned via Deep Reinforcement Learning using Proximal Policy Optimization. The control policy was trained in simulation leveraging a dynamic Cosserat rod model of the soft robot. The generalization capabilities of learned controllers are vital for successful deployment in the real world, especially when the encountered scenarios differ from the training environment. We assessed the generalization capabilities of the controller in silico for four tests. The first test involved the dynamic tracking of trajectories that differ significantly in shape and velocity profiles from the training data. Second, we evaluated the robustness of the controller to perpetual external end-point forces for dynamic tracking. For tracking tasks, it was also assessed the generalization to similar materials. Finally, we transferred the control policy without retraining to intercept a moving object with the end-effector. The learned control policy has shown good generalization capabilities in all four tests.","PeriodicalId":250981,"journal":{"name":"2023 IEEE International Conference on Soft Robotics (RoboSoft)","volume":"115 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Learning a Controller for Soft Robotic Arms and Testing its Generalization to New Observations, Dynamics, and Tasks\",\"authors\":\"Carlo Alessi, H. Hauser, A. Lucantonio, E. Falotico\",\"doi\":\"10.1109/RoboSoft55895.2023.10121988\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, learning-based controllers that leverage mechanical models of soft robots have shown promising results. This paper presents a closed-loop controller for dynamic trajectory tracking with a pneumatic soft robotic arm learned via Deep Reinforcement Learning using Proximal Policy Optimization. The control policy was trained in simulation leveraging a dynamic Cosserat rod model of the soft robot. The generalization capabilities of learned controllers are vital for successful deployment in the real world, especially when the encountered scenarios differ from the training environment. We assessed the generalization capabilities of the controller in silico for four tests. The first test involved the dynamic tracking of trajectories that differ significantly in shape and velocity profiles from the training data. Second, we evaluated the robustness of the controller to perpetual external end-point forces for dynamic tracking. For tracking tasks, it was also assessed the generalization to similar materials. Finally, we transferred the control policy without retraining to intercept a moving object with the end-effector. The learned control policy has shown good generalization capabilities in all four tests.\",\"PeriodicalId\":250981,\"journal\":{\"name\":\"2023 IEEE International Conference on Soft Robotics (RoboSoft)\",\"volume\":\"115 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE International Conference on Soft Robotics (RoboSoft)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/RoboSoft55895.2023.10121988\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Soft Robotics (RoboSoft)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RoboSoft55895.2023.10121988","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

最近,利用软机器人机械模型的基于学习的控制器已经显示出有希望的结果。本文提出了一种基于近端策略优化的深度强化学习的气动软机械臂动态轨迹跟踪闭环控制器。利用软机器人的动态Cosserat棒模型对控制策略进行仿真训练。学习控制器的泛化能力对于在现实世界中成功部署是至关重要的,特别是当遇到与训练环境不同的场景时。我们在四个测试中评估了控制器的计算机泛化能力。第一个测试涉及到轨迹的动态跟踪,这些轨迹在形状和速度方面与训练数据有很大的不同。其次,我们评估了控制器对动态跟踪的永久外部端点力的鲁棒性。对于跟踪任务,还评估了对类似材料的泛化。最后,我们将控制策略转移到用末端执行器拦截移动物体,而无需重新训练。该学习控制策略在四次测试中均显示出良好的泛化能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Learning a Controller for Soft Robotic Arms and Testing its Generalization to New Observations, Dynamics, and Tasks
Recently, learning-based controllers that leverage mechanical models of soft robots have shown promising results. This paper presents a closed-loop controller for dynamic trajectory tracking with a pneumatic soft robotic arm learned via Deep Reinforcement Learning using Proximal Policy Optimization. The control policy was trained in simulation leveraging a dynamic Cosserat rod model of the soft robot. The generalization capabilities of learned controllers are vital for successful deployment in the real world, especially when the encountered scenarios differ from the training environment. We assessed the generalization capabilities of the controller in silico for four tests. The first test involved the dynamic tracking of trajectories that differ significantly in shape and velocity profiles from the training data. Second, we evaluated the robustness of the controller to perpetual external end-point forces for dynamic tracking. For tracking tasks, it was also assessed the generalization to similar materials. Finally, we transferred the control policy without retraining to intercept a moving object with the end-effector. The learned control policy has shown good generalization capabilities in all four tests.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A Modular Bio-inspired Robotic Hand with High Sensitivity Sensorizing a Compression Sleeve for Continuous Pressure Monitoring and Lymphedema Treatment Using Pneumatic or Resistive Sensors Fabrication and Characterization of a Passive Variable Stiffness Joint based on Shear Thickening Fluids A Soft Wearable Robot to Support Scapular Adduction and Abduction for Respiratory Rehabilitation Design of 3D-Printed Continuum Robots Using Topology Optimized Compliant Joints
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1