首页 > 最新文献

International Journal of Robotics Research最新文献

英文 中文
Frequency modulation of body waves to improve performance of sidewinding robots 体波频率调制提高侧绕机器人性能
IF 9.2 1区 计算机科学 Q1 Mathematics Pub Date : 2021-08-12 DOI: 10.1177/02783649211037715
Baxi Chong, Tianyu Wang, Jennifer M. Rieser, Bo Lin, Abdul Kaba, Grigoriy Blekherman, H. Choset, D. Goldman
Sidewinding is a form of locomotion executed by certain snakes and has been reconstructed in limbless robots; the gait is beneficial because it is effective in diverse terrestrial environments. Sidewinding gaits are generated by coordination of horizontal and vertical traveling waves of body undulation: the horizontal wave largely sets the direction of sidewinding with respect to the body frame while the vertical traveling wave largely determines the contact pattern between the body and the environment. When the locomotor’s center of mass leaves the supporting polygon formed by the contact pattern, undesirable locomotor behaviors (such as unwanted turning or unstable oscillation of the body) can occur. In this article, we develop an approach to generate desired translation and turning by modulating the vertical wave. These modulations alter the distribution of body–environment contact patches and can stabilize configurations that were previously statically unstable. The approach first identifies the spatial frequency of the vertical wave that statically stabilizes the locomotor for a given horizontal wave. Then, using geometric mechanics tools, we design the coordination between body waves that produces the desired translation or rotation. We demonstrate the effectiveness of our technique in numerical simulations and on experiments with a 16-joint limbless robot locomoting on flat hard ground. Our scheme broadens the range of movements and behaviors accessible to sidewinding locomotors at low speeds, which can lead to limbless systems capable of traversing diverse terrain stably and/or rapidly.
侧绕是由某些蛇执行的一种运动形式,已在无肢机器人中重建;步态是有益的,因为它在不同的陆地环境中是有效的。侧旋步态是由身体起伏的水平和垂直行波协调产生的:水平波在很大程度上决定了侧旋相对于身体框架的方向,而垂直行波在很大程度上将决定了身体与环境之间的接触模式。当运动的质心离开由接触模式形成的支撑多边形时,可能会发生不希望的运动行为(如身体不必要的转动或不稳定的振荡)。在本文中,我们开发了一种通过调制垂直波来产生所需平移和转向的方法。这些调节改变了身体-环境接触贴片的分布,并可以稳定以前静态不稳定的配置。该方法首先识别垂直波的空间频率,该频率对于给定的水平波静态稳定运动。然后,使用几何力学工具,我们设计体波之间的协调,以产生所需的平移或旋转。我们在数值模拟和16关节无肢机器人在平坦坚硬地面上运动的实验中证明了我们技术的有效性。我们的方案扩大了侧绕机车在低速下可以进行的运动和行为的范围,这可以导致无肢系统能够稳定和/或快速穿越不同的地形。
{"title":"Frequency modulation of body waves to improve performance of sidewinding robots","authors":"Baxi Chong, Tianyu Wang, Jennifer M. Rieser, Bo Lin, Abdul Kaba, Grigoriy Blekherman, H. Choset, D. Goldman","doi":"10.1177/02783649211037715","DOIUrl":"https://doi.org/10.1177/02783649211037715","url":null,"abstract":"Sidewinding is a form of locomotion executed by certain snakes and has been reconstructed in limbless robots; the gait is beneficial because it is effective in diverse terrestrial environments. Sidewinding gaits are generated by coordination of horizontal and vertical traveling waves of body undulation: the horizontal wave largely sets the direction of sidewinding with respect to the body frame while the vertical traveling wave largely determines the contact pattern between the body and the environment. When the locomotor’s center of mass leaves the supporting polygon formed by the contact pattern, undesirable locomotor behaviors (such as unwanted turning or unstable oscillation of the body) can occur. In this article, we develop an approach to generate desired translation and turning by modulating the vertical wave. These modulations alter the distribution of body–environment contact patches and can stabilize configurations that were previously statically unstable. The approach first identifies the spatial frequency of the vertical wave that statically stabilizes the locomotor for a given horizontal wave. Then, using geometric mechanics tools, we design the coordination between body waves that produces the desired translation or rotation. We demonstrate the effectiveness of our technique in numerical simulations and on experiments with a 16-joint limbless robot locomoting on flat hard ground. Our scheme broadens the range of movements and behaviors accessible to sidewinding locomotors at low speeds, which can lead to limbless systems capable of traversing diverse terrain stably and/or rapidly.","PeriodicalId":54942,"journal":{"name":"International Journal of Robotics Research","volume":null,"pages":null},"PeriodicalIF":9.2,"publicationDate":"2021-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44905344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Sequential robot imitation learning from observations 从观察中学习顺序机器人模仿
IF 9.2 1区 计算机科学 Q1 Mathematics Pub Date : 2021-08-06 DOI: 10.1177/02783649211032721
A. Tanwani, Andy Yan, Jonathan Lee, S. Calinon, Ken Goldberg
This paper presents a framework to learn the sequential structure in the demonstrations for robot imitation learning. We first present a family of task-parameterized hidden semi-Markov models that extracts invariant segments (also called sub-goals or options) from demonstrated trajectories, and optimally follows the sampled sequence of states from the model with a linear quadratic tracking controller. We then extend the concept to learning invariant segments from visual observations that are sequenced together for robot imitation. We present Motion2Vec that learns a deep embedding space by minimizing a metric learning loss in a Siamese network: images from the same action segment are pulled together while being pushed away from randomly sampled images of other segments, and a time contrastive loss is used to preserve the temporal ordering of the images. The trained embeddings are segmented with a recurrent neural network, and subsequently used for decoding the end-effector pose of the robot. We first show its application to a pick-and-place task with the Baxter robot while avoiding a moving obstacle from four kinesthetic demonstrations only, followed by suturing task imitation from publicly available suturing videos of the JIGSAWS dataset with state-of-the-art 85 . 5 % segmentation accuracy and 0 . 94 cm error in position per observation on the test set.
本文提出了一个在机器人模仿学习演示中学习序列结构的框架。我们首先提出了一组任务参数化的隐半马尔可夫模型,该模型从演示的轨迹中提取不变的分段(也称为子目标或选项),并使用线性二次跟踪控制器最优地跟踪模型中的采样状态序列。然后,我们将概念扩展到从视觉观察中学习不变片段,这些片段被排列在一起用于机器人模仿。我们提出了Motion2Vec,它通过最小化暹罗网络中的度量学习损失来学习深度嵌入空间:来自同一动作片段的图像被拉到一起,同时被推离其他片段的随机采样图像,并且时间对比损失用于保持图像的时间顺序。使用递归神经网络对训练后的嵌入进行分割,然后用于解码机器人的末端执行器姿态。我们首先展示了它在Baxter机器人的拾取和放置任务中的应用,同时仅从四个动觉演示中避免了移动障碍,然后从JIGSAWS数据集的公开缝合视频中模仿了最先进的85缝合任务。5%的分割准确率和0。测试装置上每次观测的位置误差为94厘米。
{"title":"Sequential robot imitation learning from observations","authors":"A. Tanwani, Andy Yan, Jonathan Lee, S. Calinon, Ken Goldberg","doi":"10.1177/02783649211032721","DOIUrl":"https://doi.org/10.1177/02783649211032721","url":null,"abstract":"This paper presents a framework to learn the sequential structure in the demonstrations for robot imitation learning. We first present a family of task-parameterized hidden semi-Markov models that extracts invariant segments (also called sub-goals or options) from demonstrated trajectories, and optimally follows the sampled sequence of states from the model with a linear quadratic tracking controller. We then extend the concept to learning invariant segments from visual observations that are sequenced together for robot imitation. We present Motion2Vec that learns a deep embedding space by minimizing a metric learning loss in a Siamese network: images from the same action segment are pulled together while being pushed away from randomly sampled images of other segments, and a time contrastive loss is used to preserve the temporal ordering of the images. The trained embeddings are segmented with a recurrent neural network, and subsequently used for decoding the end-effector pose of the robot. We first show its application to a pick-and-place task with the Baxter robot while avoiding a moving obstacle from four kinesthetic demonstrations only, followed by suturing task imitation from publicly available suturing videos of the JIGSAWS dataset with state-of-the-art 85 . 5 % segmentation accuracy and 0 . 94 cm error in position per observation on the test set.","PeriodicalId":54942,"journal":{"name":"International Journal of Robotics Research","volume":null,"pages":null},"PeriodicalIF":9.2,"publicationDate":"2021-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1177/02783649211032721","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43109234","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Design of multirotor aerial vehicles: A taxonomy based on input allocation 多旋翼飞行器设计:一种基于输入分配的分类法
IF 9.2 1区 计算机科学 Q1 Mathematics Pub Date : 2021-08-01 DOI: 10.1177/02783649211025998
Mahmoud Hamandi, Federico Usai, Quentin Sablé, Nicolas Staub, M. Tognon, A. Franchi
This paper reviews the effect of multirotor aerial vehicle designs on their abilities in terms of tasks and system properties. We propose a general taxonomy to characterize and describe multirotor aerial vehicles and their designs, which we apply exhaustively on the vast literature available. Thanks to the systematic characterization of the designs, we exhibit groups of designs having the same abilities in terms of achievable tasks and system properties. In particular, we organize the literature review based on the number of atomic actuation units and we discuss global properties arising from their choice and spatial distribution in the mechanical designs. Finally, we provide a discussion on the common traits of the designs found in the literature and the main open and future problems.
本文从任务和系统特性的角度综述了多旋翼飞行器设计对其能力的影响。我们提出了一个通用的分类法来表征和描述多旋翼飞行器及其设计,我们在大量可用的文献中详尽地应用了该分类法。由于设计的系统化特征,我们展示了在可实现任务和系统特性方面具有相同能力的设计组。特别是,我们根据原子驱动单元的数量组织了文献综述,并讨论了机械设计中原子驱动单元选择和空间分布产生的全局特性。最后,我们讨论了文献中发现的设计的共同特点,以及开放和未来的主要问题。
{"title":"Design of multirotor aerial vehicles: A taxonomy based on input allocation","authors":"Mahmoud Hamandi, Federico Usai, Quentin Sablé, Nicolas Staub, M. Tognon, A. Franchi","doi":"10.1177/02783649211025998","DOIUrl":"https://doi.org/10.1177/02783649211025998","url":null,"abstract":"This paper reviews the effect of multirotor aerial vehicle designs on their abilities in terms of tasks and system properties. We propose a general taxonomy to characterize and describe multirotor aerial vehicles and their designs, which we apply exhaustively on the vast literature available. Thanks to the systematic characterization of the designs, we exhibit groups of designs having the same abilities in terms of achievable tasks and system properties. In particular, we organize the literature review based on the number of atomic actuation units and we discuss global properties arising from their choice and spatial distribution in the mechanical designs. Finally, we provide a discussion on the common traits of the designs found in the literature and the main open and future problems.","PeriodicalId":54942,"journal":{"name":"International Journal of Robotics Research","volume":null,"pages":null},"PeriodicalIF":9.2,"publicationDate":"2021-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1177/02783649211025998","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45057574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 46
Bayesian controller fusion: Leveraging control priors in deep reinforcement learning for robotics 贝叶斯控制器融合:在机器人深度强化学习中利用控制先验
IF 9.2 1区 计算机科学 Q1 Mathematics Pub Date : 2021-07-21 DOI: 10.1177/02783649231167210
Krishan Rana, Vibhavari Dasagi, Jesse Haviland, Ben Talbot, Michael Milford, N. Sunderhauf
We present Bayesian Controller Fusion (BCF): a hybrid control strategy that combines the strengths of traditional hand-crafted controllers and model-free deep reinforcement learning (RL). BCF thrives in the robotics domain, where reliable but suboptimal control priors exist for many tasks, but RL from scratch remains unsafe and data-inefficient. By fusing uncertainty-aware distributional outputs from each system, BCF arbitrates control between them, exploiting their respective strengths. We study BCF on two real-world robotics tasks involving navigation in a vast and long-horizon environment, and a complex reaching task that involves manipulability maximisation. For both these domains, simple handcrafted controllers exist that can solve the task at hand in a risk-averse manner but do not necessarily exhibit the optimal solution given limitations in analytical modelling, controller miscalibration and task variation. As exploration is naturally guided by the prior in the early stages of training, BCF accelerates learning, while substantially improving beyond the performance of the control prior, as the policy gains more experience. More importantly, given the risk-aversity of the control prior, BCF ensures safe exploration and deployment, where the control prior naturally dominates the action distribution in states unknown to the policy. We additionally show BCF’s applicability to the zero-shot sim-to-real setting and its ability to deal with out-of-distribution states in the real world. BCF is a promising approach towards combining the complementary strengths of deep RL and traditional robotic control, surpassing what either can achieve independently. The code and supplementary video material are made publicly available at https://krishanrana.github.io/bcf.
我们提出了贝叶斯控制器融合(BCF):一种混合控制策略,结合了传统手工制作控制器和无模型深度强化学习(RL)的优势。BCF在机器人领域蓬勃发展,许多任务都存在可靠但次优的控制先验,但从头开始的强化学习仍然不安全且数据效率低下。通过融合每个系统的不确定性感知分布输出,BCF在它们之间进行仲裁控制,利用它们各自的优势。我们在两个现实世界的机器人任务上研究了BCF,其中包括在广阔和长期环境中的导航,以及涉及可操作性最大化的复杂到达任务。对于这两个领域,存在简单的手工制作控制器,可以以规避风险的方式解决手头的任务,但由于分析建模,控制器错误校准和任务变化的限制,不一定表现出最佳解决方案。由于在训练的早期阶段,探索自然是由先验引导的,因此BCF加速了学习,同时随着策略获得更多的经验,大大提高了控制先验的性能。更重要的是,考虑到控制先验的风险厌恶性,BCF确保了安全的探索和部署,其中控制先验自然支配着策略未知状态下的行动分布。此外,我们还展示了BCF对零射击模拟到真实设置的适用性,以及它在现实世界中处理分布外状态的能力。BCF是一种很有前途的方法,可以将深度强化学习和传统机器人控制的互补优势结合起来,超越任何一种单独实现的能力。代码和补充视频资料可在https://krishanrana.github.io/bcf上公开获取。
{"title":"Bayesian controller fusion: Leveraging control priors in deep reinforcement learning for robotics","authors":"Krishan Rana, Vibhavari Dasagi, Jesse Haviland, Ben Talbot, Michael Milford, N. Sunderhauf","doi":"10.1177/02783649231167210","DOIUrl":"https://doi.org/10.1177/02783649231167210","url":null,"abstract":"We present Bayesian Controller Fusion (BCF): a hybrid control strategy that combines the strengths of traditional hand-crafted controllers and model-free deep reinforcement learning (RL). BCF thrives in the robotics domain, where reliable but suboptimal control priors exist for many tasks, but RL from scratch remains unsafe and data-inefficient. By fusing uncertainty-aware distributional outputs from each system, BCF arbitrates control between them, exploiting their respective strengths. We study BCF on two real-world robotics tasks involving navigation in a vast and long-horizon environment, and a complex reaching task that involves manipulability maximisation. For both these domains, simple handcrafted controllers exist that can solve the task at hand in a risk-averse manner but do not necessarily exhibit the optimal solution given limitations in analytical modelling, controller miscalibration and task variation. As exploration is naturally guided by the prior in the early stages of training, BCF accelerates learning, while substantially improving beyond the performance of the control prior, as the policy gains more experience. More importantly, given the risk-aversity of the control prior, BCF ensures safe exploration and deployment, where the control prior naturally dominates the action distribution in states unknown to the policy. We additionally show BCF’s applicability to the zero-shot sim-to-real setting and its ability to deal with out-of-distribution states in the real world. BCF is a promising approach towards combining the complementary strengths of deep RL and traditional robotic control, surpassing what either can achieve independently. The code and supplementary video material are made publicly available at https://krishanrana.github.io/bcf.","PeriodicalId":54942,"journal":{"name":"International Journal of Robotics Research","volume":null,"pages":null},"PeriodicalIF":9.2,"publicationDate":"2021-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"65097934","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Motion planning by learning the solution manifold in trajectory optimization 学习轨迹优化解流形的运动规划
IF 9.2 1区 计算机科学 Q1 Mathematics Pub Date : 2021-07-13 DOI: 10.1177/02783649211044405
Takayuki Osa
The objective function used in trajectory optimization is often non-convex and can have an infinite set of local optima. In such cases, there are diverse solutions to perform a given task. Although there are a few methods to find multiple solutions for motion planning, they are limited to generating a finite set of solutions. To address this issue, we present an optimization method that learns an infinite set of solutions in trajectory optimization. In our framework, diverse solutions are obtained by learning latent representations of solutions. Our approach can be interpreted as training a deep generative model of collision-free trajectories for motion planning. The experimental results indicate that the trained model represents an infinite set of homotopic solutions for motion planning problems.
轨迹优化中使用的目标函数通常是非凸的,并且可以具有无限组局部最优解。在这种情况下,有不同的解决方案来执行给定的任务。尽管有几种方法可以找到运动规划的多个解,但它们仅限于生成有限的解集。为了解决这个问题,我们提出了一种在轨迹优化中学习无限组解的优化方法。在我们的框架中,通过学习解的潜在表示来获得不同的解。我们的方法可以被解释为训练用于运动规划的无碰撞轨迹的深度生成模型。实验结果表明,训练后的模型代表了运动规划问题的一组无穷多的同位解。
{"title":"Motion planning by learning the solution manifold in trajectory optimization","authors":"Takayuki Osa","doi":"10.1177/02783649211044405","DOIUrl":"https://doi.org/10.1177/02783649211044405","url":null,"abstract":"The objective function used in trajectory optimization is often non-convex and can have an infinite set of local optima. In such cases, there are diverse solutions to perform a given task. Although there are a few methods to find multiple solutions for motion planning, they are limited to generating a finite set of solutions. To address this issue, we present an optimization method that learns an infinite set of solutions in trajectory optimization. In our framework, diverse solutions are obtained by learning latent representations of solutions. Our approach can be interpreted as training a deep generative model of collision-free trajectories for motion planning. The experimental results indicate that the trained model represents an infinite set of homotopic solutions for motion planning problems.","PeriodicalId":54942,"journal":{"name":"International Journal of Robotics Research","volume":null,"pages":null},"PeriodicalIF":9.2,"publicationDate":"2021-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48450370","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Physical interaction as communication: Learning robot objectives online from human corrections 作为交流的物理交互:从人类纠正中在线学习机器人目标
IF 9.2 1区 计算机科学 Q1 Mathematics Pub Date : 2021-07-06 DOI: 10.1177/02783649211050958
Dylan P. Losey, Andrea V. Bajcsy, M. O'Malley, A. Dragan
When a robot performs a task next to a human, physical interaction is inevitable: the human might push, pull, twist, or guide the robot. The state of the art treats these interactions as disturbances that the robot should reject or avoid. At best, these robots respond safely while the human interacts; but after the human lets go, these robots simply return to their original behavior. We recognize that physical human–robot interaction (pHRI) is often intentional: the human intervenes on purpose because the robot is not doing the task correctly. In this article, we argue that when pHRI is intentional it is also informative: the robot can leverage interactions to learn how it should complete the rest of its current task even after the person lets go. We formalize pHRI as a dynamical system, where the human has in mind an objective function they want the robot to optimize, but the robot does not get direct access to the parameters of this objective: they are internal to the human. Within our proposed framework human interactions become observations about the true objective. We introduce approximations to learn from and respond to pHRI in real-time. We recognize that not all human corrections are perfect: often users interact with the robot noisily, and so we improve the efficiency of robot learning from pHRI by reducing unintended learning. Finally, we conduct simulations and user studies on a robotic manipulator to compare our proposed approach with the state of the art. Our results indicate that learning from pHRI leads to better task performance and improved human satisfaction.
当机器人在人类旁边执行任务时,物理交互是不可避免的:人类可能会推、拉、扭或引导机器人。目前的技术将这些相互作用视为干扰,机器人应该拒绝或避免。在最好的情况下,这些机器人在人类互动时能安全地做出反应;但在人类放手之后,这些机器人就会恢复原来的行为。我们认识到物理人机交互(pHRI)通常是有意的:人类故意干预,因为机器人没有正确地完成任务。在本文中,我们认为,当pHRI是有意的,它也可以提供信息:机器人可以利用交互来学习如何完成当前任务的其余部分,即使在人离开之后。我们将pHRI形式化为一个动态系统,其中人类在脑海中有一个目标函数,他们希望机器人优化,但机器人不能直接访问这个目标的参数:它们是人类内部的。在我们提出的框架内,人类的互动变成了对真实目标的观察。我们引入近似值来实时学习和响应pHRI。我们认识到并非所有的人类纠正都是完美的:用户经常与机器人进行嘈杂的交互,因此我们通过减少意外学习来提高机器人从pHRI学习的效率。最后,我们对机器人机械手进行了仿真和用户研究,以比较我们提出的方法与最先进的方法。我们的研究结果表明,从pHRI中学习可以提高任务绩效和人类满意度。
{"title":"Physical interaction as communication: Learning robot objectives online from human corrections","authors":"Dylan P. Losey, Andrea V. Bajcsy, M. O'Malley, A. Dragan","doi":"10.1177/02783649211050958","DOIUrl":"https://doi.org/10.1177/02783649211050958","url":null,"abstract":"When a robot performs a task next to a human, physical interaction is inevitable: the human might push, pull, twist, or guide the robot. The state of the art treats these interactions as disturbances that the robot should reject or avoid. At best, these robots respond safely while the human interacts; but after the human lets go, these robots simply return to their original behavior. We recognize that physical human–robot interaction (pHRI) is often intentional: the human intervenes on purpose because the robot is not doing the task correctly. In this article, we argue that when pHRI is intentional it is also informative: the robot can leverage interactions to learn how it should complete the rest of its current task even after the person lets go. We formalize pHRI as a dynamical system, where the human has in mind an objective function they want the robot to optimize, but the robot does not get direct access to the parameters of this objective: they are internal to the human. Within our proposed framework human interactions become observations about the true objective. We introduce approximations to learn from and respond to pHRI in real-time. We recognize that not all human corrections are perfect: often users interact with the robot noisily, and so we improve the efficiency of robot learning from pHRI by reducing unintended learning. Finally, we conduct simulations and user studies on a robotic manipulator to compare our proposed approach with the state of the art. Our results indicate that learning from pHRI leads to better task performance and improved human satisfaction.","PeriodicalId":54942,"journal":{"name":"International Journal of Robotics Research","volume":null,"pages":null},"PeriodicalIF":9.2,"publicationDate":"2021-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42136688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Simultaneous Localisation and Map Building: The Kidnapped Way 同时本地化和地图构建:被绑架的方式
IF 9.2 1区 计算机科学 Q1 Mathematics Pub Date : 2021-06-24 DOI: 10.26180/14835033.V1
D. Spero
Linear thermoplastic polycarbonates derived from: (i) a carbonate precursor; and (ii) at least one dihydric phenol represented by the general formula wherein R is independently selected from hydrogen and lower alkyl radicals, X is selected from monocyclic cycloalkylidene radicals containing from 8 to about 16 ring carbon atoms, and n and n' are independently selected from whole numbers having a value of from 0 to 2 inclusive, with the proviso that the sum of n plus n' is at least one.
线性热塑性聚碳酸酯来源于:(i)碳酸盐前驱体;(ii)至少一种用通式表示的二羟基苯酚,其中R独立取自氢和较低的烷基自由基,X独立取自含有8 ~约16个环碳原子的单环烷基烯自由基,n和n’独立取自值为0 ~ 2的整数,但n + n’的和至少为1。
{"title":"Simultaneous Localisation and Map Building: The Kidnapped Way","authors":"D. Spero","doi":"10.26180/14835033.V1","DOIUrl":"https://doi.org/10.26180/14835033.V1","url":null,"abstract":"Linear thermoplastic polycarbonates derived from: (i) a carbonate precursor; and (ii) at least one dihydric phenol represented by the general formula wherein R is independently selected from hydrogen and lower alkyl radicals, X is selected from monocyclic cycloalkylidene radicals containing from 8 to about 16 ring carbon atoms, and n and n' are independently selected from whole numbers having a value of from 0 to 2 inclusive, with the proviso that the sum of n plus n' is at least one.","PeriodicalId":54942,"journal":{"name":"International Journal of Robotics Research","volume":null,"pages":null},"PeriodicalIF":9.2,"publicationDate":"2021-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44788552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Contact-initiated shared control strategies for four-arm supernumerary manipulation with foot interfaces 接触启动的共享控制策略,用于带足部接口的四臂多余操作
IF 9.2 1区 计算机科学 Q1 Mathematics Pub Date : 2021-06-23 DOI: 10.1177/02783649211017642
Walid Amanhoud, Jacob Hernandez Sanchez, M. Bouri, A. Billard
In industrial or surgical settings, to achieve many tasks successfully, at least two people are needed. To this end, robotic assistance could be used to enable a single person to perform such tasks alone, with the help of robots through direct, shared, or autonomous control. We are interested in four-arm manipulation scenarios, where both feet are used to control two robotic arms via bi-pedal haptic interfaces. The robotic arms complement the tasks of the biological arms, for instance, in supporting and moving an object while working on it (using both hands). To reduce fatigue, cognitive workload, and to ease the execution of the foot manipulation, we propose two types of assistance that can be enabled upon contact with the object (i.e., based on the interaction forces): autonomous-contact force generation and auto-coordination of the robotic arms. The latter relates to controlling both arms with a single foot, once the object is grasped. We designed four (shared) control strategies that are derived from the combinations (absence/presence) of both assistance modalities, and we compared them through a user study (with 12 participants) on a four-arm manipulation task. The results show that force assistance positively improves human–robot fluency in the four-arm task, the ease of use and usefulness; it also reduces the fatigue. Finally, to make the dual-assistance approach the preferred and most successful among the proposed control strategies, delegating the grasping force to the robotic arms is a crucial factor when controlling them both with a single foot.
在工业或外科环境中,要成功完成许多任务,至少需要两个人。为此,机器人辅助可以使一个人单独完成这些任务,通过机器人的直接、共享或自主控制的帮助。我们感兴趣的是四臂操作场景,即双脚通过双踏板触觉接口来控制两个机械臂。机器人手臂是生物手臂的补充,例如,在操作物体时(使用双手)支撑和移动物体。为了减少疲劳,认知负荷,并简化足部操作的执行,我们提出了两种类型的辅助,可以在与物体接触时启用(即基于相互作用力):自主接触力生成和机械臂的自动协调。后者涉及到一旦抓住物体,用一只脚控制双臂。我们设计了四种(共享的)控制策略,这些策略来源于两种辅助方式的组合(缺席/在场),并通过一项用户研究(12名参与者)对四臂操作任务进行了比较。结果表明:力辅助对人机四臂任务的流畅性、易用性和实用性有积极的促进作用;它还能减少疲劳。最后,为了使双辅助方法在所提出的控制策略中成为首选和最成功的方法,在单脚控制机械臂时,将抓取力分配给机械臂是一个关键因素。
{"title":"Contact-initiated shared control strategies for four-arm supernumerary manipulation with foot interfaces","authors":"Walid Amanhoud, Jacob Hernandez Sanchez, M. Bouri, A. Billard","doi":"10.1177/02783649211017642","DOIUrl":"https://doi.org/10.1177/02783649211017642","url":null,"abstract":"In industrial or surgical settings, to achieve many tasks successfully, at least two people are needed. To this end, robotic assistance could be used to enable a single person to perform such tasks alone, with the help of robots through direct, shared, or autonomous control. We are interested in four-arm manipulation scenarios, where both feet are used to control two robotic arms via bi-pedal haptic interfaces. The robotic arms complement the tasks of the biological arms, for instance, in supporting and moving an object while working on it (using both hands). To reduce fatigue, cognitive workload, and to ease the execution of the foot manipulation, we propose two types of assistance that can be enabled upon contact with the object (i.e., based on the interaction forces): autonomous-contact force generation and auto-coordination of the robotic arms. The latter relates to controlling both arms with a single foot, once the object is grasped. We designed four (shared) control strategies that are derived from the combinations (absence/presence) of both assistance modalities, and we compared them through a user study (with 12 participants) on a four-arm manipulation task. The results show that force assistance positively improves human–robot fluency in the four-arm task, the ease of use and usefulness; it also reduces the fatigue. Finally, to make the dual-assistance approach the preferred and most successful among the proposed control strategies, delegating the grasping force to the robotic arms is a crucial factor when controlling them both with a single foot.","PeriodicalId":54942,"journal":{"name":"International Journal of Robotics Research","volume":null,"pages":null},"PeriodicalIF":9.2,"publicationDate":"2021-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1177/02783649211017642","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48283101","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Task space adaptation via the learning of gait controllers of magnetic soft millirobots 基于软磁机器人步态控制器学习的任务空间自适应
IF 9.2 1区 计算机科学 Q1 Mathematics Pub Date : 2021-06-16 DOI: 10.1177/02783649211021869
S. Demir, Utku Çulha, A. C. Karacakol, Abdon Pena‐Francesch, Sebastian Trimpe, M. Sitti
Untethered small-scale soft robots have promising applications in minimally invasive surgery, targeted drug delivery, and bioengineering applications as they can directly and non-invasively access confined and hard-to-reach spaces in the human body. For such potential biomedical applications, the adaptivity of the robot control is essential to ensure the continuity of the operations, as task environment conditions show dynamic variations that can alter the robot’s motion and task performance. The applicability of the conventional modeling and control methods is further limited for soft robots at the small-scale owing to their kinematics with virtually infinite degrees of freedom, inherent stochastic variability during fabrication, and changing dynamics during real-world interactions. To address the controller adaptation challenge to dynamically changing task environments, we propose using a probabilistic learning approach for a millimeter-scale magnetic walking soft robot using Bayesian optimization (BO) and Gaussian processes (GPs). Our approach provides a data-efficient learning scheme by finding the gait controller parameters while optimizing the stride length of the walking soft millirobot using a small number of physical experiments. To demonstrate the controller adaptation, we test the walking gait of the robot in task environments with different surface adhesion and roughness, and medium viscosity, which aims to represent the possible conditions for future robotic tasks inside the human body. We further utilize the transfer of the learned GP parameters among different task spaces and robots and compare their efficacy on the improvement of data-efficient controller learning.
无人值守的小型软机器人在微创手术、靶向药物输送和生物工程应用中有着很好的应用前景,因为它们可以直接无创地进入人体内有限且难以到达的空间。对于此类潜在的生物医学应用,机器人控制的自适应性对于确保操作的连续性至关重要,因为任务环境条件显示出动态变化,可以改变机器人的运动和任务性能。传统建模和控制方法在小规模软机器人中的适用性进一步受到限制,因为它们的运动学具有几乎无限的自由度、制造过程中固有的随机可变性以及真实世界交互过程中不断变化的动力学。为了解决控制器对动态变化任务环境的自适应挑战,我们建议使用贝叶斯优化(BO)和高斯过程(GPs)对毫米级磁性步行软机器人使用概率学习方法。我们的方法通过找到步态控制器参数,同时使用少量物理实验优化步行软毫米机器人的步长,提供了一种数据高效的学习方案。为了证明控制器的自适应性,我们测试了机器人在具有不同表面附着力和粗糙度以及介质粘度的任务环境中的行走步态,旨在代表未来机器人在人体内执行任务的可能条件。我们进一步利用了学习的GP参数在不同任务空间和机器人之间的传递,并比较了它们在改进数据高效控制器学习方面的功效。
{"title":"Task space adaptation via the learning of gait controllers of magnetic soft millirobots","authors":"S. Demir, Utku Çulha, A. C. Karacakol, Abdon Pena‐Francesch, Sebastian Trimpe, M. Sitti","doi":"10.1177/02783649211021869","DOIUrl":"https://doi.org/10.1177/02783649211021869","url":null,"abstract":"Untethered small-scale soft robots have promising applications in minimally invasive surgery, targeted drug delivery, and bioengineering applications as they can directly and non-invasively access confined and hard-to-reach spaces in the human body. For such potential biomedical applications, the adaptivity of the robot control is essential to ensure the continuity of the operations, as task environment conditions show dynamic variations that can alter the robot’s motion and task performance. The applicability of the conventional modeling and control methods is further limited for soft robots at the small-scale owing to their kinematics with virtually infinite degrees of freedom, inherent stochastic variability during fabrication, and changing dynamics during real-world interactions. To address the controller adaptation challenge to dynamically changing task environments, we propose using a probabilistic learning approach for a millimeter-scale magnetic walking soft robot using Bayesian optimization (BO) and Gaussian processes (GPs). Our approach provides a data-efficient learning scheme by finding the gait controller parameters while optimizing the stride length of the walking soft millirobot using a small number of physical experiments. To demonstrate the controller adaptation, we test the walking gait of the robot in task environments with different surface adhesion and roughness, and medium viscosity, which aims to represent the possible conditions for future robotic tasks inside the human body. We further utilize the transfer of the learned GP parameters among different task spaces and robots and compare their efficacy on the improvement of data-efficient controller learning.","PeriodicalId":54942,"journal":{"name":"International Journal of Robotics Research","volume":null,"pages":null},"PeriodicalIF":9.2,"publicationDate":"2021-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1177/02783649211021869","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49638544","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
GKNet: Grasp keypoint network for grasp candidates detection GKNet:抓取关键点网络,用于抓取候选对象的检测
IF 9.2 1区 计算机科学 Q1 Mathematics Pub Date : 2021-06-16 DOI: 10.1177/02783649211069569
Ruinian Xu, Fu-Jen Chu, P. Vela
Contemporary grasp detection approaches employ deep learning to achieve robustness to sensor and object model uncertainty. The two dominant approaches design either grasp-quality scoring or anchor-based grasp recognition networks. This paper presents a different approach to grasp detection by treating it as keypoint detection in image-space. The deep network detects each grasp candidate as a pair of keypoints, convertible to the grasp representation g = {x,y,w,θ} T , rather than a triplet or quartet of corner points. Decreasing the detection difficulty by grouping keypoints into pairs boosts performance. To promote capturing dependencies between keypoints, a non-local module is incorporated into the network design. A final filtering strategy based on discrete and continuous orientation prediction removes false correspondences and further improves grasp detection performance. GKNet, the approach presented here, achieves a good balance between accuracy and speed on the Cornell and the abridged Jacquard datasets (96.9% and 98.39% at 41.67 and 23.26 fps). Follow-up experiments on a manipulator evaluate GKNet using four types of grasping experiments reflecting different nuisance sources: static grasping, dynamic grasping, grasping at varied camera angles, and bin picking. GKNet outperforms reference baselines in static and dynamic grasping experiments while showing robustness to varied camera viewpoints and moderate clutter. The results confirm the hypothesis that grasp keypoints are an effective output representation for deep grasp networks that provide robustness to expected nuisance factors.
当前的抓握检测方法采用深度学习来实现对传感器和目标模型不确定性的鲁棒性。两种主要的方法要么设计抓握质量评分,要么设计基于锚点的抓握识别网络。本文提出了一种不同的抓取检测方法,将其视为图像空间中的关键点检测。深度网络将每个抓点候选检测为一对关键点,可转换为抓点表示g = {x,y,w,θ} T,而不是角点的三重或四重奏。通过将关键点分组成对来降低检测难度,从而提高性能。为了促进捕获关键点之间的依赖关系,在网络设计中加入了一个非本地模块。最后一种基于离散和连续方向预测的滤波策略消除了错误对应,进一步提高了抓取检测性能。本文提出的GKNet方法在Cornell和缩短的Jacquard数据集上实现了准确率和速度之间的良好平衡(分别为96.9%和98.39%,分别为41.67和23.26 fps)。在机械臂上的后续实验中,通过四种类型的抓取实验来评估GKNet,这些实验反映了不同的干扰源:静态抓取、动态抓取、不同相机角度的抓取和拾取垃圾箱。GKNet在静态和动态抓取实验中优于参考基线,同时显示出对不同相机视点和适度杂波的鲁棒性。结果证实了一个假设,即抓取关键点是深度抓取网络的有效输出表示,对预期的干扰因素提供鲁棒性。
{"title":"GKNet: Grasp keypoint network for grasp candidates detection","authors":"Ruinian Xu, Fu-Jen Chu, P. Vela","doi":"10.1177/02783649211069569","DOIUrl":"https://doi.org/10.1177/02783649211069569","url":null,"abstract":"Contemporary grasp detection approaches employ deep learning to achieve robustness to sensor and object model uncertainty. The two dominant approaches design either grasp-quality scoring or anchor-based grasp recognition networks. This paper presents a different approach to grasp detection by treating it as keypoint detection in image-space. The deep network detects each grasp candidate as a pair of keypoints, convertible to the grasp representation g = {x,y,w,θ} T , rather than a triplet or quartet of corner points. Decreasing the detection difficulty by grouping keypoints into pairs boosts performance. To promote capturing dependencies between keypoints, a non-local module is incorporated into the network design. A final filtering strategy based on discrete and continuous orientation prediction removes false correspondences and further improves grasp detection performance. GKNet, the approach presented here, achieves a good balance between accuracy and speed on the Cornell and the abridged Jacquard datasets (96.9% and 98.39% at 41.67 and 23.26 fps). Follow-up experiments on a manipulator evaluate GKNet using four types of grasping experiments reflecting different nuisance sources: static grasping, dynamic grasping, grasping at varied camera angles, and bin picking. GKNet outperforms reference baselines in static and dynamic grasping experiments while showing robustness to varied camera viewpoints and moderate clutter. The results confirm the hypothesis that grasp keypoints are an effective output representation for deep grasp networks that provide robustness to expected nuisance factors.","PeriodicalId":54942,"journal":{"name":"International Journal of Robotics Research","volume":null,"pages":null},"PeriodicalIF":9.2,"publicationDate":"2021-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47035577","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
期刊
International Journal of Robotics Research
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1