SAR: Generalization of Physiological Dexterity via Synergistic Action Representation

C. Berg, V. Caggiano, Vikash Kumar
{"title":"SAR: Generalization of Physiological Dexterity via Synergistic Action Representation","authors":"C. Berg, V. Caggiano, Vikash Kumar","doi":"10.15607/RSS.2023.XIX.007","DOIUrl":null,"url":null,"abstract":"—Learning effective continuous control policies in high-dimensional systems, including musculoskeletal agents, remains a significant challenge. Over the course of biological evolution, organisms have developed robust mechanisms for over-coming this complexity to learn highly sophisticated strategies for motor control. What accounts for this robust behavioral flexibility? Modular control via muscle synergies, i.e. coordinated muscle co-contractions, is considered to be one putative mechanism that enables organisms to learn muscle control in a simplified and generalizable action space. Drawing inspiration from this evolved motor control strategy, we use a physiologically accurate hand model to investigate whether leveraging a Synergistic Action Representation ( SAR ) acquired from simpler manipulation tasks improves learning and generalization on more complex tasks. We find that SAR -exploiting policies trained on a complex, 100- object randomized reorientation task significantly outperformed ( > 70 % success) baseline approaches ( < 20 % success). Notably, SAR -exploiting policies were also found to zero-shot generalize to thousands of unseen objects with out-of-domain size variations, while policies that did not adopt SAR failed to generalize. SAR also enabled significantly improved transfer learning on real-world objects. Finally, using a robotic manipulation task set and a full-body humanoid locomotion task, we establish the generality of SAR on broader high-dimensional control problems, achieving SOTA performance with an order of magnitude improved sample efficiency. To the best of our knowledge, this investigation is the first of its kind to present an end-to-end pipeline for discovering synergies and using this representation to learn high-dimensional continuous control across a wide diversity of tasks","PeriodicalId":248720,"journal":{"name":"Robotics: Science and Systems XIX","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Robotics: Science and Systems XIX","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15607/RSS.2023.XIX.007","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

—Learning effective continuous control policies in high-dimensional systems, including musculoskeletal agents, remains a significant challenge. Over the course of biological evolution, organisms have developed robust mechanisms for over-coming this complexity to learn highly sophisticated strategies for motor control. What accounts for this robust behavioral flexibility? Modular control via muscle synergies, i.e. coordinated muscle co-contractions, is considered to be one putative mechanism that enables organisms to learn muscle control in a simplified and generalizable action space. Drawing inspiration from this evolved motor control strategy, we use a physiologically accurate hand model to investigate whether leveraging a Synergistic Action Representation ( SAR ) acquired from simpler manipulation tasks improves learning and generalization on more complex tasks. We find that SAR -exploiting policies trained on a complex, 100- object randomized reorientation task significantly outperformed ( > 70 % success) baseline approaches ( < 20 % success). Notably, SAR -exploiting policies were also found to zero-shot generalize to thousands of unseen objects with out-of-domain size variations, while policies that did not adopt SAR failed to generalize. SAR also enabled significantly improved transfer learning on real-world objects. Finally, using a robotic manipulation task set and a full-body humanoid locomotion task, we establish the generality of SAR on broader high-dimensional control problems, achieving SOTA performance with an order of magnitude improved sample efficiency. To the best of our knowledge, this investigation is the first of its kind to present an end-to-end pipeline for discovering synergies and using this representation to learn high-dimensional continuous control across a wide diversity of tasks
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
SAR:通过协同动作表征的生理灵巧的泛化
-在高维系统(包括肌肉骨骼因子)中学习有效的连续控制策略仍然是一个重大挑战。在生物进化的过程中,生物已经发展出强大的机制来克服这种复杂性,以学习高度复杂的运动控制策略。是什么导致了这种强大的行为灵活性?通过肌肉协同作用的模块化控制,即协调的肌肉共同收缩,被认为是一种假定的机制,使生物体能够在简化和可推广的动作空间中学习肌肉控制。从这种进化的运动控制策略中获得灵感,我们使用生理上准确的手部模型来研究从更简单的操作任务中获得的协同动作表示(SAR)是否能提高更复杂任务的学习和泛化。我们发现,在复杂的100个对象随机重定向任务上训练的SAR利用策略明显优于基线方法(成功率> 70%)(成功率< 20%)。值得注意的是,利用SAR的策略也被发现可以零射击泛化到数千个具有域外大小变化的看不见的物体,而不采用SAR的策略则无法泛化。SAR还显著改善了对现实世界对象的迁移学习。最后,利用机器人操作任务集和全身人形运动任务,我们在更广泛的高维控制问题上建立了SAR的通用性,实现了SOTA性能,并将样本效率提高了一个数量级。据我们所知,这项研究是同类研究中首次提出端到端管道来发现协同作用,并使用这种表示来学习跨各种任务的高维连续控制
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A Sampling-Based Approach for Heterogeneous Coalition Scheduling with Temporal Uncertainty ROSE: Rotation-based Squeezing Robotic Gripper toward Universal Handling of Objects ERASOR2: Instance-Aware Robust 3D Mapping of the Static World in Dynamic Scenes Autonomous Navigation, Mapping and Exploration with Gaussian Processes Predefined-Time Convergent Motion Control for Heterogeneous Continuum Robots
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1