首页 > 最新文献

Robotics: Science and Systems XIX最新文献

英文 中文
Centralized Model Predictive Control for Collaborative Loco-Manipulation 协同局部操作的集中式模型预测控制
Pub Date : 2023-07-10 DOI: 10.15607/RSS.2023.XIX.050
Flavio De Vincenti, Stelian Coros
—In this work, we extend the model predictive control methods developed in the legged robotics literature to collaborative loco-manipulation settings. The systems we study entail a payload collectively carried by multiple quadruped robots equipped with a mechanical arm. We use a direct multiple shooting method to solve the resulting high-dimensional, optimal control problems for trajectories of ground reaction forces, manipulation wrenches, and stepping locations. To capture the dominant dynamics of the system, we model each agent and the shared payload as single rigid bodies. We demonstrate the versatility of our framework in a series of simulation experiments involving collaborative manipulation over challenging terrains.
在这项工作中,我们将腿机器人文献中开发的模型预测控制方法扩展到协作的局部操作设置。我们研究的系统需要由配备机械臂的多个四足机器人共同携带有效载荷。我们使用直接多次射击方法来解决地面反作用力、操纵扳手和步进位置轨迹的高维最优控制问题。为了捕获系统的主导动力学,我们将每个代理和共享负载建模为单个刚体。我们在一系列模拟实验中展示了我们框架的多功能性,这些实验涉及在具有挑战性的地形上进行协作操作。
{"title":"Centralized Model Predictive Control for Collaborative Loco-Manipulation","authors":"Flavio De Vincenti, Stelian Coros","doi":"10.15607/RSS.2023.XIX.050","DOIUrl":"https://doi.org/10.15607/RSS.2023.XIX.050","url":null,"abstract":"—In this work, we extend the model predictive control methods developed in the legged robotics literature to collaborative loco-manipulation settings. The systems we study entail a payload collectively carried by multiple quadruped robots equipped with a mechanical arm. We use a direct multiple shooting method to solve the resulting high-dimensional, optimal control problems for trajectories of ground reaction forces, manipulation wrenches, and stepping locations. To capture the dominant dynamics of the system, we model each agent and the shared payload as single rigid bodies. We demonstrate the versatility of our framework in a series of simulation experiments involving collaborative manipulation over challenging terrains.","PeriodicalId":248720,"journal":{"name":"Robotics: Science and Systems XIX","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126701030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Co-optimization of Morphology and Behavior of Modular Robots via Hierarchical Deep Reinforcement Learning 基于层次深度强化学习的模块化机器人形态与行为协同优化
Pub Date : 2023-07-10 DOI: 10.15607/RSS.2023.XIX.096
Jieqiang Sun, Meibao Yao, Xueming Xiao, Zhibing Xie, Bo Zheng
—Modular robots hold the promise of changing their shape and even dimension to adapt to various tasks and environments. To realize this superiority, it is essential to find the appropriate morphology and its corresponding behavior simultaneously to ensure optimality of the reconfigura- tion. However, achieving co-optimization is challenging because robotic configuration and motion are interactive and coupled with each other, as well as their optimization processes. To this end, we proposed a co-optimization framework based on hierarchical Deep Reinforcement Learning (DRL), consisting of a configuration model and a motion model based on the Twin Delayed Deep Deterministic policy gradient algorithm (TD3). The two network models update asynchronously with a shared reward to ensure co-optimality. We conduct simulations and experiments with the Webots platform to validate the proposed framework, and the preliminary results show that it yields high quality optimization schemes and thus allows modular robots to be more adaptive to dynamic and multi-task scenarios.
模块化机器人有望改变其形状甚至尺寸,以适应各种任务和环境。为了实现这一优势,必须同时找到合适的形态和相应的行为,以保证重构的最优性。然而,实现协同优化是具有挑战性的,因为机器人的配置和运动是相互作用的,相互耦合的,以及它们的优化过程。为此,我们提出了一个基于分层深度强化学习(DRL)的协同优化框架,包括一个配置模型和一个基于双延迟深度确定性策略梯度算法(TD3)的运动模型。这两个网络模型以共享奖励异步更新,以确保协同最优性。我们在Webots平台上进行了仿真和实验来验证所提出的框架,初步结果表明,它产生了高质量的优化方案,从而使模块化机器人能够更好地适应动态和多任务场景。
{"title":"Co-optimization of Morphology and Behavior of Modular Robots via Hierarchical Deep Reinforcement Learning","authors":"Jieqiang Sun, Meibao Yao, Xueming Xiao, Zhibing Xie, Bo Zheng","doi":"10.15607/RSS.2023.XIX.096","DOIUrl":"https://doi.org/10.15607/RSS.2023.XIX.096","url":null,"abstract":"—Modular robots hold the promise of changing their shape and even dimension to adapt to various tasks and environments. To realize this superiority, it is essential to find the appropriate morphology and its corresponding behavior simultaneously to ensure optimality of the reconfigura- tion. However, achieving co-optimization is challenging because robotic configuration and motion are interactive and coupled with each other, as well as their optimization processes. To this end, we proposed a co-optimization framework based on hierarchical Deep Reinforcement Learning (DRL), consisting of a configuration model and a motion model based on the Twin Delayed Deep Deterministic policy gradient algorithm (TD3). The two network models update asynchronously with a shared reward to ensure co-optimality. We conduct simulations and experiments with the Webots platform to validate the proposed framework, and the preliminary results show that it yields high quality optimization schemes and thus allows modular robots to be more adaptive to dynamic and multi-task scenarios.","PeriodicalId":248720,"journal":{"name":"Robotics: Science and Systems XIX","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123663985","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Predefined-Time Convergent Motion Control for Heterogeneous Continuum Robots 异构连续体机器人的预定义时间收敛运动控制
Pub Date : 2023-07-10 DOI: 10.15607/RSS.2023.XIX.092
Ning Tan, YU Peng, Kai Huang
As research into continuum robots flourishes, there are more and more types of continuum robots, which require researchers to tirelessly design robot-specific motion control algorithms. Besides, the convergence time of control systems for continuum robots has received very little attention. In this paper, we propose a novel predefined-time convergent zeroing dynamics (PTCZD) model, which ensures that the associated error-monitoring function converges to zero in predefined-time. Based on the PTCZD model, we design an inverse kinematics solver and a state estimator for continuum robots, thereby obtaining a generic predefined-time convergent control method for heterogeneous continuum robots for the first time. Simulations and experiments based on cable-driven continuum robots and concentric tube continuum robots are performed to verify the efficacy, robustness and adaptability of the proposed control method. In addition, comparative studies are carried out to demonstrate its advantages against existing control methods for continuum robots.
随着连续体机器人研究的蓬勃发展,连续体机器人的种类越来越多,这就要求研究人员孜孜不倦地设计机器人专用的运动控制算法。此外,连续体机器人控制系统的收敛时间问题很少受到关注。在本文中,我们提出了一种新的预定义时间收敛归零动力学(PTCZD)模型,该模型保证了相关的误差监控函数在预定义时间内收敛于零。基于PTCZD模型,设计了连续体机器人的逆运动学解算器和状态估计器,从而首次获得了异构连续体机器人的通用预定义时间收敛控制方法。基于索驱动连续体机器人和同心管连续体机器人的仿真和实验验证了所提控制方法的有效性、鲁棒性和自适应性。此外,还进行了对比研究,以证明其相对于现有连续体机器人控制方法的优势。
{"title":"Predefined-Time Convergent Motion Control for Heterogeneous Continuum Robots","authors":"Ning Tan, YU Peng, Kai Huang","doi":"10.15607/RSS.2023.XIX.092","DOIUrl":"https://doi.org/10.15607/RSS.2023.XIX.092","url":null,"abstract":"As research into continuum robots flourishes, there are more and more types of continuum robots, which require researchers to tirelessly design robot-specific motion control algorithms. Besides, the convergence time of control systems for continuum robots has received very little attention. In this paper, we propose a novel predefined-time convergent zeroing dynamics (PTCZD) model, which ensures that the associated error-monitoring function converges to zero in predefined-time. Based on the PTCZD model, we design an inverse kinematics solver and a state estimator for continuum robots, thereby obtaining a generic predefined-time convergent control method for heterogeneous continuum robots for the first time. Simulations and experiments based on cable-driven continuum robots and concentric tube continuum robots are performed to verify the efficacy, robustness and adaptability of the proposed control method. In addition, comparative studies are carried out to demonstrate its advantages against existing control methods for continuum robots.","PeriodicalId":248720,"journal":{"name":"Robotics: Science and Systems XIX","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121785894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ERASOR2: Instance-Aware Robust 3D Mapping of the Static World in Dynamic Scenes ERASOR2:动态场景中静态世界的实例感知鲁棒3D映射
Pub Date : 2023-07-10 DOI: 10.15607/RSS.2023.XIX.067
Hyungtae Lim, Lucas Nunes, Benedikt Mersch, Xieyunali Chen, J. Behley, H. Myung, C. Stachniss
—A map of the environment is an essential component for robotic navigation. In the majority of cases, a map of the static part of the world is the basis for localization, planning, and navigation. However, dynamic objects that are presented in the scenes during mapping leave undesirable traces in the map, which can impede mobile robots from achieving successful robotic navigation. To remove the artifacts caused by dynamic objects in the map, we propose a novel instance-aware map building method. Our approach rejects dynamic points at an instance-level while preserving most static points by exploiting instance segmentation estimates. Furthermore, we propose effective ways to consider the erroneous estimates of instance segmentation, enabling our proposed method to be robust even under imprecise instance segmentation. As demonstrated in our experimental evaluation, our approach shows substantial performance increases in terms of both, the preservation of static points and rejection of dynamic points. Our code is available at https://github.com/url-kaist/ERASOR2 .
环境地图是机器人导航的重要组成部分。在大多数情况下,世界静态部分的地图是定位、规划和导航的基础。然而,在绘制过程中,场景中呈现的动态物体会在地图上留下不希望看到的痕迹,这可能会阻碍移动机器人实现成功的机器人导航。为了消除地图中动态对象造成的伪影,提出了一种新的基于实例的地图构建方法。我们的方法在实例级拒绝动态点,同时通过利用实例分割估计保留大多数静态点。此外,我们提出了有效的方法来考虑实例分割的错误估计,使我们提出的方法即使在不精确的实例分割下也具有鲁棒性。正如我们的实验评估所证明的那样,我们的方法在静态点的保存和动态点的拒绝两方面都显示出实质性的性能提高。我们的代码可在https://github.com/url-kaist/ERASOR2上获得。
{"title":"ERASOR2: Instance-Aware Robust 3D Mapping of the Static World in Dynamic Scenes","authors":"Hyungtae Lim, Lucas Nunes, Benedikt Mersch, Xieyunali Chen, J. Behley, H. Myung, C. Stachniss","doi":"10.15607/RSS.2023.XIX.067","DOIUrl":"https://doi.org/10.15607/RSS.2023.XIX.067","url":null,"abstract":"—A map of the environment is an essential component for robotic navigation. In the majority of cases, a map of the static part of the world is the basis for localization, planning, and navigation. However, dynamic objects that are presented in the scenes during mapping leave undesirable traces in the map, which can impede mobile robots from achieving successful robotic navigation. To remove the artifacts caused by dynamic objects in the map, we propose a novel instance-aware map building method. Our approach rejects dynamic points at an instance-level while preserving most static points by exploiting instance segmentation estimates. Furthermore, we propose effective ways to consider the erroneous estimates of instance segmentation, enabling our proposed method to be robust even under imprecise instance segmentation. As demonstrated in our experimental evaluation, our approach shows substantial performance increases in terms of both, the preservation of static points and rejection of dynamic points. Our code is available at https://github.com/url-kaist/ERASOR2 .","PeriodicalId":248720,"journal":{"name":"Robotics: Science and Systems XIX","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117227949","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Demonstrating A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free Reinforcement Learning 示范在公园散步:用无模型强化学习在20分钟内学会走路
Pub Date : 2023-07-10 DOI: 10.15607/RSS.2023.XIX.056
Laura M. Smith, Ilya Kostrikov, S. Levine
—Deep reinforcement learning is a promising ap- proach to learning policies in unstructured environments. Due to its sample inefficiency, though, deep RL applications have primarily focused on simulated environments. In this work, we demonstrate that the recent advancements in machine learning algorithms and libraries combined with careful MDP formulation lead to learning quadruped locomotion in only 20 minutes in the real world. We evaluate our approach on several indoor and outdoor terrains that are known to be challenging for classical, model-based controllers and observe that the robot consistently learns a walking gait on all of these terrains. Finally, we evaluate our design decisions in a simulated environment. We provide videos of all real-world training and code to reproduce our results on our website: https://sites.google.com/berkeley.
深度强化学习是在非结构化环境中学习策略的一种很有前途的方法。然而,由于样本效率低下,深度强化学习应用主要集中在模拟环境上。在这项工作中,我们证明了机器学习算法和库的最新进展与仔细的MDP公式相结合,可以在现实世界中仅20分钟内学习四足动物的运动。我们在几个室内和室外地形上评估了我们的方法,这些地形已知对经典的基于模型的控制器具有挑战性,并观察到机器人在所有这些地形上始终学习行走步态。最后,我们在模拟环境中评估我们的设计决策。我们在我们的网站https://sites.google.com/berkeley上提供了所有真实世界训练的视频和代码来重现我们的结果。
{"title":"Demonstrating A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free Reinforcement Learning","authors":"Laura M. Smith, Ilya Kostrikov, S. Levine","doi":"10.15607/RSS.2023.XIX.056","DOIUrl":"https://doi.org/10.15607/RSS.2023.XIX.056","url":null,"abstract":"—Deep reinforcement learning is a promising ap- proach to learning policies in unstructured environments. Due to its sample inefficiency, though, deep RL applications have primarily focused on simulated environments. In this work, we demonstrate that the recent advancements in machine learning algorithms and libraries combined with careful MDP formulation lead to learning quadruped locomotion in only 20 minutes in the real world. We evaluate our approach on several indoor and outdoor terrains that are known to be challenging for classical, model-based controllers and observe that the robot consistently learns a walking gait on all of these terrains. Finally, we evaluate our design decisions in a simulated environment. We provide videos of all real-world training and code to reproduce our results on our website: https://sites.google.com/berkeley.","PeriodicalId":248720,"journal":{"name":"Robotics: Science and Systems XIX","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126519754","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
Self-Supervised Lidar Place Recognition in Overhead Imagery Using Unpaired Data 基于非配对数据的自监督激光雷达位置识别
Pub Date : 2023-07-10 DOI: 10.15607/RSS.2023.XIX.098
T. Y. Tang, D. Martini
—As much as place recognition is crucial for navi- gation, mapping and collecting training ground truth, namely sensor data pairs across different locations, are costly and time- consuming. This paper tackles these by learning lidar place recognition on public overhead imagery and in a self-supervised fashion, with no need for paired lidar and overhead imagery data. We learn the cross-modal data comparison between lidar and overhead imagery with a multi-step framework. First, images are transformed into synthetic lidar data and a latent projection is learned. Next, we discover pseudo pairs of lidar and satellite data from unpaired and asynchronous sequences, and use them for training a final embedding space projection in a cross-modality place recognition framework. We train and test our approach on real data from various environments and show performances approaching a supervised method using paired data.
-尽管位置识别对导航至关重要,但绘制和收集训练场真相(即不同位置的传感器数据对)既昂贵又耗时。本文通过在公共架空图像上学习激光雷达位置识别并以自我监督的方式解决这些问题,而不需要配对的激光雷达和架空图像数据。我们用多步框架学习了激光雷达与架空图像的跨模态数据比较。首先,将图像转换为合成激光雷达数据,并学习潜在投影。接下来,我们从未配对和异步序列中发现激光雷达和卫星数据的伪对,并使用它们在交叉模态位置识别框架中训练最终的嵌入空间投影。我们在来自不同环境的真实数据上训练和测试了我们的方法,并展示了使用成对数据接近监督方法的性能。
{"title":"Self-Supervised Lidar Place Recognition in Overhead Imagery Using Unpaired Data","authors":"T. Y. Tang, D. Martini","doi":"10.15607/RSS.2023.XIX.098","DOIUrl":"https://doi.org/10.15607/RSS.2023.XIX.098","url":null,"abstract":"—As much as place recognition is crucial for navi- gation, mapping and collecting training ground truth, namely sensor data pairs across different locations, are costly and time- consuming. This paper tackles these by learning lidar place recognition on public overhead imagery and in a self-supervised fashion, with no need for paired lidar and overhead imagery data. We learn the cross-modal data comparison between lidar and overhead imagery with a multi-step framework. First, images are transformed into synthetic lidar data and a latent projection is learned. Next, we discover pseudo pairs of lidar and satellite data from unpaired and asynchronous sequences, and use them for training a final embedding space projection in a cross-modality place recognition framework. We train and test our approach on real data from various environments and show performances approaching a supervised method using paired data.","PeriodicalId":248720,"journal":{"name":"Robotics: Science and Systems XIX","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130569004","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Sampling-Based Approach for Heterogeneous Coalition Scheduling with Temporal Uncertainty 一种基于抽样的时间不确定性异构联盟调度方法
Pub Date : 2023-07-10 DOI: 10.15607/RSS.2023.XIX.107
Andrew Messing, Jacopo Banfi, M. Stadler, Ethan Stump, H. Ravichandar, N. Roy, S. Hutchinson
—Scheduling algorithms for real-world heterogeneous multi-robot teams must be able to reason about temporal uncertainty in the world model in order to create plans that are tolerant to the risk of unexpected delays. To this end, we present a novel sampling-based risk-aware approach for solving Heterogeneous Coalition Scheduling with Temporal Uncertainty (HCSTU) problems, which does not require any assumptions regarding the specific underlying cause of the temporal uncertainty or the specific duration distributions. Our approach computes a schedule which obeys the temporal constraints of a small number of heuristically-selected sample scenarios by solving a Mixed-Integer Linear Program, along with an upper bound on the schedule execution time. Then, it uses a hypothesis testing method, the Sequential Probability Ratio Test, to provide a probabilistic guarantee that the upper bound on the execu- tion time will be respected for a user-specified risk tolerance. With extensive experiments, we demonstrate that our approach empirically respects the risk tolerance, and generates solutions of comparable or better quality than state-of-the-art approaches while being an order of magnitude faster to compute on average. Finally, we demonstrate how robust schedules generated by our approach can be incorporated as solutions to subproblems within the broader Simultaneous Task Allocation and Planning with Spatiotemporal Constraints problem to both guide and expedite the search for solutions of higher quality and lower risk.
-现实世界异构多机器人团队的调度算法必须能够推理世界模型中的时间不确定性,以便创建能够容忍意外延迟风险的计划。为此,我们提出了一种新的基于抽样的风险意识方法来解决具有时间不确定性的异构联盟调度(HCSTU)问题,该方法不需要对时间不确定性的特定潜在原因或特定持续时间分布进行任何假设。我们的方法通过求解一个混合整数线性规划来计算一个服从少量启发式选择的示例场景的时间约束的调度,以及调度执行时间的上界。然后,它使用假设检验方法,即序列概率比检验,提供了一个概率保证,即执行时间的上界将符合用户指定的风险容忍度。通过广泛的实验,我们证明了我们的方法在经验上尊重风险容忍度,并产生与最先进的方法相当或更好质量的解决方案,同时平均计算速度快一个数量级。最后,我们展示了由我们的方法生成的健壮时间表如何被纳入到更广泛的具有时空约束的同步任务分配和规划问题中的子问题的解决方案中,以指导和加速寻找更高质量和更低风险的解决方案。
{"title":"A Sampling-Based Approach for Heterogeneous Coalition Scheduling with Temporal Uncertainty","authors":"Andrew Messing, Jacopo Banfi, M. Stadler, Ethan Stump, H. Ravichandar, N. Roy, S. Hutchinson","doi":"10.15607/RSS.2023.XIX.107","DOIUrl":"https://doi.org/10.15607/RSS.2023.XIX.107","url":null,"abstract":"—Scheduling algorithms for real-world heterogeneous multi-robot teams must be able to reason about temporal uncertainty in the world model in order to create plans that are tolerant to the risk of unexpected delays. To this end, we present a novel sampling-based risk-aware approach for solving Heterogeneous Coalition Scheduling with Temporal Uncertainty (HCSTU) problems, which does not require any assumptions regarding the specific underlying cause of the temporal uncertainty or the specific duration distributions. Our approach computes a schedule which obeys the temporal constraints of a small number of heuristically-selected sample scenarios by solving a Mixed-Integer Linear Program, along with an upper bound on the schedule execution time. Then, it uses a hypothesis testing method, the Sequential Probability Ratio Test, to provide a probabilistic guarantee that the upper bound on the execu- tion time will be respected for a user-specified risk tolerance. With extensive experiments, we demonstrate that our approach empirically respects the risk tolerance, and generates solutions of comparable or better quality than state-of-the-art approaches while being an order of magnitude faster to compute on average. Finally, we demonstrate how robust schedules generated by our approach can be incorporated as solutions to subproblems within the broader Simultaneous Task Allocation and Planning with Spatiotemporal Constraints problem to both guide and expedite the search for solutions of higher quality and lower risk.","PeriodicalId":248720,"journal":{"name":"Robotics: Science and Systems XIX","volume":"3 s4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"113957527","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SAR: Generalization of Physiological Dexterity via Synergistic Action Representation SAR:通过协同动作表征的生理灵巧的泛化
Pub Date : 2023-07-10 DOI: 10.15607/RSS.2023.XIX.007
C. Berg, V. Caggiano, Vikash Kumar
—Learning effective continuous control policies in high-dimensional systems, including musculoskeletal agents, remains a significant challenge. Over the course of biological evolution, organisms have developed robust mechanisms for over-coming this complexity to learn highly sophisticated strategies for motor control. What accounts for this robust behavioral flexibility? Modular control via muscle synergies, i.e. coordinated muscle co-contractions, is considered to be one putative mechanism that enables organisms to learn muscle control in a simplified and generalizable action space. Drawing inspiration from this evolved motor control strategy, we use a physiologically accurate hand model to investigate whether leveraging a Synergistic Action Representation ( SAR ) acquired from simpler manipulation tasks improves learning and generalization on more complex tasks. We find that SAR -exploiting policies trained on a complex, 100- object randomized reorientation task significantly outperformed ( > 70 % success) baseline approaches ( < 20 % success). Notably, SAR -exploiting policies were also found to zero-shot generalize to thousands of unseen objects with out-of-domain size variations, while policies that did not adopt SAR failed to generalize. SAR also enabled significantly improved transfer learning on real-world objects. Finally, using a robotic manipulation task set and a full-body humanoid locomotion task, we establish the generality of SAR on broader high-dimensional control problems, achieving SOTA performance with an order of magnitude improved sample efficiency. To the best of our knowledge, this investigation is the first of its kind to present an end-to-end pipeline for discovering synergies and using this representation to learn high-dimensional continuous control across a wide diversity of tasks
-在高维系统(包括肌肉骨骼因子)中学习有效的连续控制策略仍然是一个重大挑战。在生物进化的过程中,生物已经发展出强大的机制来克服这种复杂性,以学习高度复杂的运动控制策略。是什么导致了这种强大的行为灵活性?通过肌肉协同作用的模块化控制,即协调的肌肉共同收缩,被认为是一种假定的机制,使生物体能够在简化和可推广的动作空间中学习肌肉控制。从这种进化的运动控制策略中获得灵感,我们使用生理上准确的手部模型来研究从更简单的操作任务中获得的协同动作表示(SAR)是否能提高更复杂任务的学习和泛化。我们发现,在复杂的100个对象随机重定向任务上训练的SAR利用策略明显优于基线方法(成功率> 70%)(成功率< 20%)。值得注意的是,利用SAR的策略也被发现可以零射击泛化到数千个具有域外大小变化的看不见的物体,而不采用SAR的策略则无法泛化。SAR还显著改善了对现实世界对象的迁移学习。最后,利用机器人操作任务集和全身人形运动任务,我们在更广泛的高维控制问题上建立了SAR的通用性,实现了SOTA性能,并将样本效率提高了一个数量级。据我们所知,这项研究是同类研究中首次提出端到端管道来发现协同作用,并使用这种表示来学习跨各种任务的高维连续控制
{"title":"SAR: Generalization of Physiological Dexterity via Synergistic Action Representation","authors":"C. Berg, V. Caggiano, Vikash Kumar","doi":"10.15607/RSS.2023.XIX.007","DOIUrl":"https://doi.org/10.15607/RSS.2023.XIX.007","url":null,"abstract":"—Learning effective continuous control policies in high-dimensional systems, including musculoskeletal agents, remains a significant challenge. Over the course of biological evolution, organisms have developed robust mechanisms for over-coming this complexity to learn highly sophisticated strategies for motor control. What accounts for this robust behavioral flexibility? Modular control via muscle synergies, i.e. coordinated muscle co-contractions, is considered to be one putative mechanism that enables organisms to learn muscle control in a simplified and generalizable action space. Drawing inspiration from this evolved motor control strategy, we use a physiologically accurate hand model to investigate whether leveraging a Synergistic Action Representation ( SAR ) acquired from simpler manipulation tasks improves learning and generalization on more complex tasks. We find that SAR -exploiting policies trained on a complex, 100- object randomized reorientation task significantly outperformed ( > 70 % success) baseline approaches ( < 20 % success). Notably, SAR -exploiting policies were also found to zero-shot generalize to thousands of unseen objects with out-of-domain size variations, while policies that did not adopt SAR failed to generalize. SAR also enabled significantly improved transfer learning on real-world objects. Finally, using a robotic manipulation task set and a full-body humanoid locomotion task, we establish the generality of SAR on broader high-dimensional control problems, achieving SOTA performance with an order of magnitude improved sample efficiency. To the best of our knowledge, this investigation is the first of its kind to present an end-to-end pipeline for discovering synergies and using this representation to learn high-dimensional continuous control across a wide diversity of tasks","PeriodicalId":248720,"journal":{"name":"Robotics: Science and Systems XIX","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126155431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Adaptive Tracking Control of Dielectric Elastomer Soft Actuators with Viscoelastic Hysteresis Compensation 粘弹性滞后补偿的介电弹性体软执行器自适应跟踪控制
Pub Date : 2023-07-10 DOI: 10.15607/RSS.2023.XIX.093
Yunhua Zhao, L. Wen
—This paper proposes a new adaptive control method with viscoelastic hysteresis compensation for high-precision tracking control of dielectric elastomer actuators (DEAs). A direct inverse feedforward compensator is constructed by using a modified Prandtl-Ishlinskii model for compensating hysteresis nonlinearities. The dynamics effects of DEAs and disturbances are coped with the adaptive inverse controller using filtered-x normalized least mean square algorithm. A series of real-time tracking experiments are carried out on a DEA made of commercial acrylic elastomers. The proposed control method achieves accurate tracking of various trajectories with the relative root-mean-square tracking error ranging from 1.37% to a maximum of 4.37% over the whole operating frequency range, and outperforms previously proposed methods in terms of accuracy. The excellent tracking results demonstrate the effectiveness of the developed control method for dielectric elastomer artificial muscles based soft actuators.
针对介电弹性体作动器的高精度跟踪控制,提出了一种粘弹性滞后补偿的自适应控制方法。利用改进的Prandtl-Ishlinskii模型构造了直接逆前馈补偿器,用于补偿滞后非线性。采用滤波-x归一化最小均方算法的自适应逆控制器来处理dea和扰动的动态影响。在商用丙烯酸弹性体的DEA上进行了一系列实时跟踪实验。在整个工作频率范围内,该控制方法实现了对各种轨迹的精确跟踪,相对均方根跟踪误差在1.37%到最大4.37%之间,在精度上优于已有方法。良好的跟踪效果证明了所开发的介电弹性体人造肌肉软执行器控制方法的有效性。
{"title":"Adaptive Tracking Control of Dielectric Elastomer Soft Actuators with Viscoelastic Hysteresis Compensation","authors":"Yunhua Zhao, L. Wen","doi":"10.15607/RSS.2023.XIX.093","DOIUrl":"https://doi.org/10.15607/RSS.2023.XIX.093","url":null,"abstract":"—This paper proposes a new adaptive control method with viscoelastic hysteresis compensation for high-precision tracking control of dielectric elastomer actuators (DEAs). A direct inverse feedforward compensator is constructed by using a modified Prandtl-Ishlinskii model for compensating hysteresis nonlinearities. The dynamics effects of DEAs and disturbances are coped with the adaptive inverse controller using filtered-x normalized least mean square algorithm. A series of real-time tracking experiments are carried out on a DEA made of commercial acrylic elastomers. The proposed control method achieves accurate tracking of various trajectories with the relative root-mean-square tracking error ranging from 1.37% to a maximum of 4.37% over the whole operating frequency range, and outperforms previously proposed methods in terms of accuracy. The excellent tracking results demonstrate the effectiveness of the developed control method for dielectric elastomer artificial muscles based soft actuators.","PeriodicalId":248720,"journal":{"name":"Robotics: Science and Systems XIX","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129464726","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ROSE: Rotation-based Squeezing Robotic Gripper toward Universal Handling of Objects ROSE:面向物体通用处理的旋转挤压机器人抓取器
Pub Date : 2023-07-10 DOI: 10.15607/RSS.2023.XIX.090
S. T. Bui, Shinya Kawano, V. A. Ho
—Robotics hand/grippers nowadays are not limited to manufacturing lines; instead, they are widely utilized in cluttered environments, such as restaurants, farms, and warehouses. In such scenarios, they need to deal with high uncertainty of the grasped objects’ shapes, postures, surfaces, and material properties, which requires complex integration of sensing and decision-making process. On the other hand, integrating soft materials into the gripper’s design may tolerate the above uncertainties and reduce complexity in control. In this paper, we introduce ROSE , a novel soft gripper that can embrace the object and squeeze it by buckling a funnel-liked thin-walled soft membrane around the object by simple rotation of the base. Thanks to this design, ROSE hand can adapt to a wide range of objects that can fit in the funnel and handle with gentle gripping force. Regardless of this, ROSE can generate a high lift force (up to 33kgf) while significantly reducing the normal pressure on the gripped objects. In our experiment, a 198g ROSE can be integrated into a robot arm with a single actuation and successfully lift various types of objects, even after 400,000 trials. The embracing mechanism helps reduce the dependence of friction between the object and the membrane, as ROSE could pick up a chicken egg submerged inside an olive oil tank. We also report a feasible design for equipping the ROSE hand with tactile sensing while appealing to the scalability of the design to fit a wide range of objects. Video: https://youtu.be/E1wAI09LaoY
-机器人手/抓取器现在不局限于生产线;相反,它们被广泛应用于杂乱的环境,如餐馆、农场和仓库。在这种情况下,他们需要处理被抓物体的形状、姿势、表面和材料特性的高度不确定性,这需要复杂的感知和决策过程的整合。另一方面,将软材料集成到夹持器的设计中可以容忍上述不确定性并降低控制的复杂性。在本文中,我们介绍了一种新型的软夹持器ROSE,它可以通过简单的旋转底座,使物体周围的漏斗状薄壁软膜弯曲,从而抱住物体并挤压物体。由于这种设计,ROSE手可以适应各种各样的物体,这些物体可以放入漏斗中,并以温和的握力处理。尽管如此,ROSE可以产生高升力(高达33kgf),同时显著降低被抓物体的正常压力。在我们的实验中,一个198克的ROSE可以通过一个单一的驱动集成到一个机器人手臂中,并且可以成功地举起各种类型的物体,即使经过40万次试验。拥抱机制有助于减少物体与膜之间的摩擦依赖,就像ROSE可以捡起浸在橄榄油罐里的鸡蛋一样。我们还报告了一个可行的设计,为ROSE手配备触觉感应,同时吸引设计的可扩展性,以适应广泛的对象。的视频:https://youtu.be/E1wAI09LaoY
{"title":"ROSE: Rotation-based Squeezing Robotic Gripper toward Universal Handling of Objects","authors":"S. T. Bui, Shinya Kawano, V. A. Ho","doi":"10.15607/RSS.2023.XIX.090","DOIUrl":"https://doi.org/10.15607/RSS.2023.XIX.090","url":null,"abstract":"—Robotics hand/grippers nowadays are not limited to manufacturing lines; instead, they are widely utilized in cluttered environments, such as restaurants, farms, and warehouses. In such scenarios, they need to deal with high uncertainty of the grasped objects’ shapes, postures, surfaces, and material properties, which requires complex integration of sensing and decision-making process. On the other hand, integrating soft materials into the gripper’s design may tolerate the above uncertainties and reduce complexity in control. In this paper, we introduce ROSE , a novel soft gripper that can embrace the object and squeeze it by buckling a funnel-liked thin-walled soft membrane around the object by simple rotation of the base. Thanks to this design, ROSE hand can adapt to a wide range of objects that can fit in the funnel and handle with gentle gripping force. Regardless of this, ROSE can generate a high lift force (up to 33kgf) while significantly reducing the normal pressure on the gripped objects. In our experiment, a 198g ROSE can be integrated into a robot arm with a single actuation and successfully lift various types of objects, even after 400,000 trials. The embracing mechanism helps reduce the dependence of friction between the object and the membrane, as ROSE could pick up a chicken egg submerged inside an olive oil tank. We also report a feasible design for equipping the ROSE hand with tactile sensing while appealing to the scalability of the design to fit a wide range of objects. Video: https://youtu.be/E1wAI09LaoY","PeriodicalId":248720,"journal":{"name":"Robotics: Science and Systems XIX","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116933733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Robotics: Science and Systems XIX
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1