International Journal of Robotics Research最新文献_第2页

Certified polyhedral decompositions of collision-free configuration space 无碰撞位形空间的认证多面体分解

1区计算机科学 Q1 ROBOTICS

International Journal of Robotics Research

Pub Date : 2023-11-03 DOI: 10.1177/02783649231201437

Hongkai Dai, Alexandre Amice, Peter Werner, Annan Zhang, Russ Tedrake

Understanding the geometry of collision-free configuration space (C-free) in the presence of Cartesian-space obstacles is an essential ingredient for collision-free motion planning. While it is possible to check for collisions at a point using standard algorithms, to date no practical method exists for computing C-free regions with rigorous certificates due to the complexity of mapping Cartesian-space obstacles through the kinematics. In this work, we present the first to our knowledge rigorous method for approximately decomposing a rational parametrization of C-free into certified polyhedral regions. Our method, called C-Iris (C-space Iterative Regional Inflation by Semidefinite programming), generates large, convex polytopes in a rational parameterization of the configuration space which are rigorously certified to be collision-free. Such regions have been shown to be useful for both optimization-based and randomized motion planning. Based on convex optimization, our method works in arbitrary dimensions, only makes assumptions about the convexity of the obstacles in the 3D Cartesian space, and is fast enough to scale to realistic problems in manipulation. We demonstrate our algorithm’s ability to fill a non-trivial amount of collision-free C-space in several 2-DOF examples where the C-space can be visualized, as well as the scalability of our algorithm on a 7-DOF KUKA iiwa, a 6-DOF UR3e, and 12-DOF bimanual manipulators. An implementation of our algorithm is open-sourced in Drake . We furthermore provide examples of our algorithm in interactive Python notebooks .

在笛卡尔空间障碍物存在的情况下，了解无碰撞构型空间(C-free)的几何形状是无碰撞运动规划的重要组成部分。虽然可以使用标准算法来检查一点上的碰撞，但由于通过运动学映射笛卡尔空间障碍物的复杂性，迄今为止还没有实用的方法来计算具有严格证书的C-free区域。在这项工作中，我们提出了我们所知的第一个严格的方法来近似分解C-free的合理参数化为认证的多面体区域。我们的方法，称为C-Iris (C-space Iterative Regional Inflation by半确定规划)，在构型空间的合理参数化中生成大型凸多面体，并被严格证明为无碰撞。这些区域已被证明对基于优化和随机运动规划都是有用的。该方法基于凸优化，适用于任意维度，仅对三维笛卡尔空间中障碍物的凹凸性进行假设，并且速度足够快，可以扩展到实际操作问题。我们在几个2自由度的例子中展示了我们的算法填充大量无碰撞c空间的能力，其中c空间可以可视化，以及我们的算法在7自由度KUKA iiwa, 6自由度UR3e和12自由度手动机械手上的可扩展性。我们算法的实现在Drake中是开源的。我们还在交互式Python笔记本中提供了我们算法的示例。

{"title":"Certified polyhedral decompositions of collision-free configuration space","authors":"Hongkai Dai, Alexandre Amice, Peter Werner, Annan Zhang, Russ Tedrake","doi":"10.1177/02783649231201437","DOIUrl":"https://doi.org/10.1177/02783649231201437","url":null,"abstract":"Understanding the geometry of collision-free configuration space (C-free) in the presence of Cartesian-space obstacles is an essential ingredient for collision-free motion planning. While it is possible to check for collisions at a point using standard algorithms, to date no practical method exists for computing C-free regions with rigorous certificates due to the complexity of mapping Cartesian-space obstacles through the kinematics. In this work, we present the first to our knowledge rigorous method for approximately decomposing a rational parametrization of C-free into certified polyhedral regions. Our method, called C-Iris (C-space Iterative Regional Inflation by Semidefinite programming), generates large, convex polytopes in a rational parameterization of the configuration space which are rigorously certified to be collision-free. Such regions have been shown to be useful for both optimization-based and randomized motion planning. Based on convex optimization, our method works in arbitrary dimensions, only makes assumptions about the convexity of the obstacles in the 3D Cartesian space, and is fast enough to scale to realistic problems in manipulation. We demonstrate our algorithm’s ability to fill a non-trivial amount of collision-free C-space in several 2-DOF examples where the C-space can be visualized, as well as the scalability of our algorithm on a 7-DOF KUKA iiwa, a 6-DOF UR3e, and 12-DOF bimanual manipulators. An implementation of our algorithm is open-sourced in Drake . We furthermore provide examples of our algorithm in interactive Python notebooks .","PeriodicalId":54942,"journal":{"name":"International Journal of Robotics Research","volume":"5 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135873889","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Nonverbal social behavior generation for social robots using end-to-end learning 基于端到端学习的社交机器人非语言社交行为生成

1区计算机科学 Q1 ROBOTICS

International Journal of Robotics Research

Pub Date : 2023-11-02 DOI: 10.1177/02783649231207974

Woo-Ri Ko, Minsu Jang, Jaeyeon Lee, Jaehong Kim

Social robots facilitate improved human–robot interactions through nonverbal behaviors such as handshakes or hugs. However, the traditional methods, which rely on precoded motions, are predictable and can detract from the perception of robots as interactive agents. To address this issue, we have introduced a Seq2Seq-based neural network model that learns social behaviors from human–human interactions in an end-to-end manner. To mitigate the risk of invalid pose sequences during long-term behavior generation, we incorporated a generative adversarial network (GAN). This proposed method was tested using the humanoid robot, Pepper, in a simulated environment. Given the challenges in assessing the success of social behavior generation, we devised novel metrics to quantify the discrepancy between the generated and ground-truth behaviors. Our analysis reveals the impact of different networks on behavior generation performance and compares the efficacy of learning multiple behaviors versus a single behavior. We anticipate that our method will find application in various sectors, including home service, guide, delivery, educational, and virtual robots, thereby enhancing user interaction and enjoyment.

社交机器人通过握手或拥抱等非语言行为促进了人与人之间的互动。然而，传统的方法依赖于预编码的运动，是可预测的，并且可能会削弱机器人作为交互式代理的感知。为了解决这个问题，我们引入了一个基于seq2seq的神经网络模型，该模型以端到端的方式从人与人之间的互动中学习社会行为。为了降低长期行为生成过程中无效姿势序列的风险，我们采用了生成对抗网络(GAN)。在模拟环境中使用人形机器人Pepper对该方法进行了测试。考虑到评估社会行为生成成功的挑战，我们设计了新的指标来量化生成的行为与真实行为之间的差异。我们的分析揭示了不同网络对行为生成性能的影响，并比较了学习多种行为与学习单一行为的效果。我们预计我们的方法将在各个领域得到应用，包括家庭服务、导游、送货、教育和虚拟机器人，从而增强用户的互动和享受。

{"title":"Nonverbal social behavior generation for social robots using end-to-end learning","authors":"Woo-Ri Ko, Minsu Jang, Jaeyeon Lee, Jaehong Kim","doi":"10.1177/02783649231207974","DOIUrl":"https://doi.org/10.1177/02783649231207974","url":null,"abstract":"Social robots facilitate improved human–robot interactions through nonverbal behaviors such as handshakes or hugs. However, the traditional methods, which rely on precoded motions, are predictable and can detract from the perception of robots as interactive agents. To address this issue, we have introduced a Seq2Seq-based neural network model that learns social behaviors from human–human interactions in an end-to-end manner. To mitigate the risk of invalid pose sequences during long-term behavior generation, we incorporated a generative adversarial network (GAN). This proposed method was tested using the humanoid robot, Pepper, in a simulated environment. Given the challenges in assessing the success of social behavior generation, we devised novel metrics to quantify the discrepancy between the generated and ground-truth behaviors. Our analysis reveals the impact of different networks on behavior generation performance and compares the efficacy of learning multiple behaviors versus a single behavior. We anticipate that our method will find application in various sectors, including home service, guide, delivery, educational, and virtual robots, thereby enhancing user interaction and enjoyment.","PeriodicalId":54942,"journal":{"name":"International Journal of Robotics Research","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135875571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Quatro++: Robust global registration exploiting ground segmentation for loop closing in LiDAR SLAM 在激光雷达SLAM中利用地面分割进行闭环闭合的鲁棒全球配准

1区计算机科学 Q1 ROBOTICS

International Journal of Robotics Research

Pub Date : 2023-11-02 DOI: 10.1177/02783649231207654

Hyungtae Lim, Beomsoo Kim, Daebeom Kim, Eungchang Mason Lee, Hyun Myung

Global registration is a fundamental task that estimates the relative pose between two viewpoints of 3D point clouds. However, there are two issues that degrade the performance of global registration in LiDAR SLAM: one is the sparsity issue and the other is degeneracy. The sparsity issue is caused by the sparse characteristics of the 3D point cloud measurements in a mechanically spinning LiDAR sensor. The degeneracy issue sometimes occurs because the outlier-rejection methods reject too many correspondences, leaving less than three inliers. These two issues have become more severe as the pose discrepancy between the two viewpoints of 3D point clouds becomes greater. To tackle these problems, we propose a robust global registration framework, called Quatro++. Extending our previous work that solely focused on the global registration itself, we address the robust global registration in terms of the loop closing in LiDAR SLAM. To this end, ground segmentation is exploited to achieve robust global registration. Through the experiments, we demonstrate that our proposed method shows a higher succeiasdsss rate than the state-of-the-art global registration methods, overcoming the sparsity and degeneracy issues. In addition, we show that ground segmentation asdasd asignificantly helps to idfdfsncrease the success rate for the ground vehicles. Finally, we apply our proposed method to the loop clossdasdlksajing modulasdse in LiDAR SLAM and confirm that the quality of the loop constraints is improved, showing more precise mapping results. Therefore, the experimental evidence corroborated the suitabilitiasdasdy of our method as an initial alignment in the loop closing. Our code is available at https://quatro-plusplus.github.io .

全局配准是估计三维点云两个视点之间的相对姿态的一项基本任务。然而，影响激光雷达SLAM全局配准性能的主要有两个问题:稀疏性问题和简并性问题。稀疏性问题是由机械旋转激光雷达传感器三维点云测量的稀疏特性引起的。有时会出现退化问题，因为异常值拒绝方法拒绝了太多的对应，留下少于三个内线。随着三维点云两个视点的位姿差异越来越大，这两个问题变得越来越严重。为了解决这些问题，我们提出了一个强大的全球注册框架，称为quatro++。我们扩展了之前只关注全局配准本身的工作，在LiDAR SLAM中解决了闭环关闭方面的鲁棒全局配准问题。为此，利用地面分割实现鲁棒的全局配准。实验结果表明，该方法克服了稀疏性和简并性问题，比现有的全局配准方法具有更高的成功率。此外，我们还表明，地面分割技术对于提高地面车辆的识别成功率有显著的帮助。最后，将本文提出的方法应用于LiDAR SLAM中的环路约束模块，验证了环路约束的质量得到了提高，映射结果更加精确。因此，实验证据证实了我们的方法作为闭环初始对准的适用性。我们的代码可在https://quatro-plusplus.github.io上获得。

{"title":"Quatro++: Robust global registration exploiting ground segmentation for loop closing in LiDAR SLAM","authors":"Hyungtae Lim, Beomsoo Kim, Daebeom Kim, Eungchang Mason Lee, Hyun Myung","doi":"10.1177/02783649231207654","DOIUrl":"https://doi.org/10.1177/02783649231207654","url":null,"abstract":"Global registration is a fundamental task that estimates the relative pose between two viewpoints of 3D point clouds. However, there are two issues that degrade the performance of global registration in LiDAR SLAM: one is the sparsity issue and the other is degeneracy. The sparsity issue is caused by the sparse characteristics of the 3D point cloud measurements in a mechanically spinning LiDAR sensor. The degeneracy issue sometimes occurs because the outlier-rejection methods reject too many correspondences, leaving less than three inliers. These two issues have become more severe as the pose discrepancy between the two viewpoints of 3D point clouds becomes greater. To tackle these problems, we propose a robust global registration framework, called Quatro++. Extending our previous work that solely focused on the global registration itself, we address the robust global registration in terms of the loop closing in LiDAR SLAM. To this end, ground segmentation is exploited to achieve robust global registration. Through the experiments, we demonstrate that our proposed method shows a higher succeiasdsss rate than the state-of-the-art global registration methods, overcoming the sparsity and degeneracy issues. In addition, we show that ground segmentation asdasd asignificantly helps to idfdfsncrease the success rate for the ground vehicles. Finally, we apply our proposed method to the loop clossdasdlksajing modulasdse in LiDAR SLAM and confirm that the quality of the loop constraints is improved, showing more precise mapping results. Therefore, the experimental evidence corroborated the suitabilitiasdasdy of our method as an initial alignment in the loop closing. Our code is available at https://quatro-plusplus.github.io .","PeriodicalId":54942,"journal":{"name":"International Journal of Robotics Research","volume":"22 5","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135973131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A structured prediction approach for robot imitation learning 机器人模仿学习的结构化预测方法

1区计算机科学 Q1 ROBOTICS

International Journal of Robotics Research

Pub Date : 2023-11-01 DOI: 10.1177/02783649231204656

Anqing Duan, Iason Batzianoulis, Raffaello Camoriano, Lorenzo Rosasco, Daniele Pucci, Aude Billard

We propose a structured prediction approach for robot imitation learning from demonstrations. Among various tools for robot imitation learning, supervised learning has been observed to have a prominent role. Structured prediction is a form of supervised learning that enables learning models to operate on output spaces with complex structures. Through the lens of structured prediction, we show how robots can learn to imitate trajectories belonging to not only Euclidean spaces but also Riemannian manifolds. Exploiting ideas from information theory, we propose a class of loss functions based on the f-divergence to measure the information loss between the demonstrated and reproduced probabilistic trajectories. Different types of f-divergence will result in different policies, which we call imitation modes. Furthermore, our approach enables the incorporation of spatial and temporal trajectory modulation, which is necessary for robots to be adaptive to the change in working conditions. We benchmark our algorithm against state-of-the-art methods in terms of trajectory reproduction and adaptation. The quantitative evaluation shows that our approach outperforms other algorithms regarding both accuracy and efficiency. We also report real-world experimental results on learning manifold trajectories in a polishing task with a KUKA LWR robot arm, illustrating the effectiveness of our algorithmic framework.

我们提出了一种结构化的预测方法，用于机器人模仿学习。在机器人模仿学习的各种工具中，监督学习已经被观察到具有突出的作用。结构化预测是监督学习的一种形式，它使学习模型能够在具有复杂结构的输出空间上运行。通过结构化预测的镜头，我们展示了机器人如何学习模仿不仅属于欧几里得空间而且属于黎曼流形的轨迹。利用信息论的思想，我们提出了一类基于f散度的损失函数来测量演示和再现的概率轨迹之间的信息损失。不同类型的f散度会产生不同的政策，我们称之为模仿模式。此外，我们的方法能够结合空间和时间轨迹调制，这是机器人适应工作条件变化所必需的。我们将我们的算法与最先进的轨迹复制和适应方法进行比较。定量评估表明，我们的方法在准确性和效率方面都优于其他算法。我们还报告了使用KUKA LWR机械臂在抛光任务中学习流形轨迹的实际实验结果，说明了我们的算法框架的有效性。

{"title":"A structured prediction approach for robot imitation learning","authors":"Anqing Duan, Iason Batzianoulis, Raffaello Camoriano, Lorenzo Rosasco, Daniele Pucci, Aude Billard","doi":"10.1177/02783649231204656","DOIUrl":"https://doi.org/10.1177/02783649231204656","url":null,"abstract":"We propose a structured prediction approach for robot imitation learning from demonstrations. Among various tools for robot imitation learning, supervised learning has been observed to have a prominent role. Structured prediction is a form of supervised learning that enables learning models to operate on output spaces with complex structures. Through the lens of structured prediction, we show how robots can learn to imitate trajectories belonging to not only Euclidean spaces but also Riemannian manifolds. Exploiting ideas from information theory, we propose a class of loss functions based on the f-divergence to measure the information loss between the demonstrated and reproduced probabilistic trajectories. Different types of f-divergence will result in different policies, which we call imitation modes. Furthermore, our approach enables the incorporation of spatial and temporal trajectory modulation, which is necessary for robots to be adaptive to the change in working conditions. We benchmark our algorithm against state-of-the-art methods in terms of trajectory reproduction and adaptation. The quantitative evaluation shows that our approach outperforms other algorithms regarding both accuracy and efficiency. We also report real-world experimental results on learning manifold trajectories in a polishing task with a KUKA LWR robot arm, illustrating the effectiveness of our algorithmic framework.","PeriodicalId":54942,"journal":{"name":"International Journal of Robotics Research","volume":"97 5","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135372905","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multi-robot, multi-sensor exploration of multifarious environments with full mission aerial autonomy 多机器人、多传感器探索多种环境，具有完全的空中自主任务

1区计算机科学 Q1 ROBOTICS

International Journal of Robotics Research

Pub Date : 2023-10-31 DOI: 10.1177/02783649231203342

Graeme Best, Rohit Garg, John Keller, Geoffrey A. Hollinger, Sebastian Scherer

We present a coordinated autonomy pipeline for multi-sensor exploration of confined environments. We simultaneously address four broad challenges that are typically overlooked in prior work: (a) make effective use of both range and vision sensing modalities, (b) perform this exploration across a wide range of environments, (c) be resilient to adverse events, and (d) execute this onboard teams of physical robots. Our solution centers around a behavior tree architecture, which adaptively switches between various behaviors involving coordinated exploration and responding to adverse events. Our exploration strategy exploits the benefits of both visual and range sensors with a generalized frontier-based exploration algorithm and an OpenVDB-based map processing pipeline. Our local planner utilizes a dynamically feasible trajectory library and a GPU-based Euclidean distance transform map to allow fast and safe navigation through both tight doorways and expansive spaces. The autonomy pipeline is evaluated with an extensive set of field experiments, with teams of up to three robots that fly up to 3 m/s and distances exceeding 1 km in confined spaces. We provide a summary of various field experiments and detail resilient behaviors that arose: maneuvering narrow doorways, adapting to unexpected environment changes, and emergency landing. Experiments are also detailed from the DARPA Subterranean Challenge, where our proposed autonomy pipeline contributed to us winning the “Most Sectors Explored” award. We provide an extended discussion of lessons learned, release software as open source, and present a video that illustrates our extensive field trials.

我们提出了一个协调自主管道的多传感器勘探受限环境。我们同时解决了在以前的工作中通常被忽视的四个广泛的挑战:(a)有效地利用距离和视觉传感模式，(b)在广泛的环境中进行这种探索，(c)对不良事件具有弹性，以及(d)在物理机器人团队中执行此任务。我们的解决方案以行为树架构为中心，它可以自适应地在各种行为之间切换，包括协调探索和响应不良事件。我们的勘探策略利用了视觉和距离传感器的优势，采用了通用的基于边界的勘探算法和基于openvdb的地图处理管道。我们的本地规划器利用动态可行的轨迹库和基于gpu的欧几里得距离变换地图，允许快速安全的导航通过狭窄的门道和广阔的空间。自动化管道通过一系列广泛的现场实验进行评估，最多三个机器人组成的团队在密闭空间内飞行速度可达3米/秒，飞行距离超过1公里。我们提供了各种现场实验的总结，并详细介绍了产生的弹性行为:操纵狭窄的门道，适应意外的环境变化，以及紧急着陆。在DARPA地下挑战赛中，我们提出的自主管道为我们赢得了“探索最多领域”奖做出了贡献。我们提供了对经验教训的扩展讨论，发布了开源软件，并展示了一个视频，说明了我们广泛的现场试验。

{"title":"Multi-robot, multi-sensor exploration of multifarious environments with full mission aerial autonomy","authors":"Graeme Best, Rohit Garg, John Keller, Geoffrey A. Hollinger, Sebastian Scherer","doi":"10.1177/02783649231203342","DOIUrl":"https://doi.org/10.1177/02783649231203342","url":null,"abstract":"We present a coordinated autonomy pipeline for multi-sensor exploration of confined environments. We simultaneously address four broad challenges that are typically overlooked in prior work: (a) make effective use of both range and vision sensing modalities, (b) perform this exploration across a wide range of environments, (c) be resilient to adverse events, and (d) execute this onboard teams of physical robots. Our solution centers around a behavior tree architecture, which adaptively switches between various behaviors involving coordinated exploration and responding to adverse events. Our exploration strategy exploits the benefits of both visual and range sensors with a generalized frontier-based exploration algorithm and an OpenVDB-based map processing pipeline. Our local planner utilizes a dynamically feasible trajectory library and a GPU-based Euclidean distance transform map to allow fast and safe navigation through both tight doorways and expansive spaces. The autonomy pipeline is evaluated with an extensive set of field experiments, with teams of up to three robots that fly up to 3 m/s and distances exceeding 1 km in confined spaces. We provide a summary of various field experiments and detail resilient behaviors that arose: maneuvering narrow doorways, adapting to unexpected environment changes, and emergency landing. Experiments are also detailed from the DARPA Subterranean Challenge, where our proposed autonomy pipeline contributed to us winning the “Most Sectors Explored” award. We provide an extended discussion of lessons learned, release software as open source, and present a video that illustrates our extensive field trials.","PeriodicalId":54942,"journal":{"name":"International Journal of Robotics Research","volume":"12 21","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135813224","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

DUEL: Depth visUal Ego-motion Learning for autonomous robot obstacle avoidance 决斗:自主机器人避障的深度视觉自我运动学习

1区计算机科学 Q1 ROBOTICS

International Journal of Robotics Research

Pub Date : 2023-10-30 DOI: 10.1177/02783649231210325

Naiyao Wang, Bo Zhang, Haixu Chi, Hua Wang, Seán McLoone, Hongbo Liu

Reliable obstacle avoidance, which is essential for safe autonomous robot interaction with the real world, raises various challenges such as difficulties with obstacle perception and latent factor cognition impacting multi-modal obstacle avoidance. In this paper, we propose a Depth visUal Ego-motion Learning (DUEL) model, consisting of a cognitive generation network, a policy decision network and a potential partition network, to learn autonomous obstacle avoidance from expert policies. The DUEL model takes advantage of binocular vision to perceive scene depth. This serves as the input to the cognitive generation network which generates obstacle avoidance policies by maximizing its causal entropy. The policy decision network then optimizes the generation of the policies referring to expert policies. The generated obstacle avoidance policies are simultaneously transferred to the potential partition network to capture the latent factors contained within expert policies and perform multi-modal obstacle avoidance. These three core networks iteratively optimize the multi-modal policies relying on causal entropy and mutual information theorems, which are proven theoretically. Experimental comparisons with state-of-the-art models on 7 metrics demonstrate the effectiveness of the DUEL model. It achieves the best performance with an average ADE (Average Displacement Error) of 0.29 and average FDE (Final Displacement Error) of 0.55 across five different scenarios. Results show that the DUEL model can maintain an average obstacle avoidance success rate of 97% for both simulated and real world scenarios with multiple obstacles, demonstrating its success at capturing latent factors from expert policies. Our source codes are available at https://github.com/ACoTAI/DUEL .

可靠避障是自主机器人与现实世界安全交互的关键，但在多模态避障问题上存在障碍感知困难和潜在因素认知困难等诸多挑战。本文提出了一种深度视觉自我运动学习(deep visUal self -motion Learning，简称DUEL)模型，该模型由认知生成网络、策略决策网络和潜在划分网络组成，用于从专家策略中学习自主避障。决斗模型利用双目视觉来感知场景深度。这作为认知生成网络的输入，该网络通过最大化其因果熵来生成避障策略。策略决策网络参照专家策略对策略生成进行优化。生成的避障策略同时传递到潜在分区网络中，捕捉专家策略中包含的潜在因素，进行多模式避障。这三个核心网络依靠因果熵和互信息定理对多模态策略进行迭代优化，并在理论上得到了证明。在7个指标上与最先进的模型进行实验比较，证明了DUEL模型的有效性。在5种不同的情况下，平均ADE(平均位移误差)为0.29，平均FDE(最终位移误差)为0.55，达到了最佳性能。结果表明，无论在模拟场景还是现实场景中，该模型都能保持97%的平均避障成功率，证明了其在捕获专家政策潜在因素方面的成功。我们的源代码可在https://github.com/ACoTAI/DUEL上获得。

{"title":"DUEL: Depth visUal Ego-motion Learning for autonomous robot obstacle avoidance","authors":"Naiyao Wang, Bo Zhang, Haixu Chi, Hua Wang, Seán McLoone, Hongbo Liu","doi":"10.1177/02783649231210325","DOIUrl":"https://doi.org/10.1177/02783649231210325","url":null,"abstract":"Reliable obstacle avoidance, which is essential for safe autonomous robot interaction with the real world, raises various challenges such as difficulties with obstacle perception and latent factor cognition impacting multi-modal obstacle avoidance. In this paper, we propose a Depth visUal Ego-motion Learning (DUEL) model, consisting of a cognitive generation network, a policy decision network and a potential partition network, to learn autonomous obstacle avoidance from expert policies. The DUEL model takes advantage of binocular vision to perceive scene depth. This serves as the input to the cognitive generation network which generates obstacle avoidance policies by maximizing its causal entropy. The policy decision network then optimizes the generation of the policies referring to expert policies. The generated obstacle avoidance policies are simultaneously transferred to the potential partition network to capture the latent factors contained within expert policies and perform multi-modal obstacle avoidance. These three core networks iteratively optimize the multi-modal policies relying on causal entropy and mutual information theorems, which are proven theoretically. Experimental comparisons with state-of-the-art models on 7 metrics demonstrate the effectiveness of the DUEL model. It achieves the best performance with an average ADE (Average Displacement Error) of 0.29 and average FDE (Final Displacement Error) of 0.55 across five different scenarios. Results show that the DUEL model can maintain an average obstacle avoidance success rate of 97% for both simulated and real world scenarios with multiple obstacles, demonstrating its success at capturing latent factors from expert policies. Our source codes are available at https://github.com/ACoTAI/DUEL .","PeriodicalId":54942,"journal":{"name":"International Journal of Robotics Research","volume":"47 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136019443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The matroid team surviving orienteers problem and its variants: Constrained routing of heterogeneous teams with risky traversal 矩阵团队生存定向问题及其变体:具有风险穿越的异构团队的约束路由

1区计算机科学 Q1 ROBOTICS

International Journal of Robotics Research

Pub Date : 2023-10-30 DOI: 10.1177/02783649231210326

Stefan Jorgensen, Marco Pavone

Consider deploying a team of robots in order to visit sites in a risky environment (i.e., where a robot might be lost during a traversal), subject to team-based operational constraints such as limits on team composition, traffic throughputs, and launch constraints. We formalize this problem using a graph to represent the environment, enforcing probabilistic survival constraints for each robot, and using a matroid (which generalizes linear independence to sets) to capture the team-based operational constraints. The resulting “Matroid Team Surviving Orienteers” (MTSO) problem has broad applications for robotics such as informative path planning, resource delivery, and search and rescue. We demonstrate that the objective for the MTSO problem has submodular structure, which leads us to develop two polynomial time algorithms which are guaranteed to find a solution with value within a constant factor of the optimum. The second of our algorithms is an extension of the accelerated continuous greedy algorithm, and can be applied to much broader classes of constraints while maintaining bounds on suboptimality. In addition to in-depth analysis, we demonstrate the efficiency of our approaches by applying them to a scenario where a team of robots must gather information while avoiding dangers in the Coral Triangle and characterize scaling and parameter selection using a synthetic dataset.

考虑部署一个机器人团队，以便访问危险环境中的站点(例如，机器人可能在遍历过程中丢失)，并受到基于团队的操作约束，例如团队组成、流量吞吐量和发射约束的限制。我们使用一个图来表示环境，对每个机器人实施概率生存约束，并使用一个矩阵(将线性独立性推广到集合)来捕获基于团队的操作约束，从而形式化了这个问题。由此产生的“Matroid Team survival Orienteers”(MTSO)问题在机器人领域有着广泛的应用，比如信息路径规划、资源传递、搜索和救援。我们证明了MTSO问题的目标具有子模结构，这使得我们开发了两种多项式时间算法，保证找到值在最优的常数因子内的解。我们的第二种算法是加速连续贪婪算法的扩展，可以应用于更广泛的约束类别，同时保持次优性的界限。除了深入分析之外，我们还通过将我们的方法应用于一个场景来证明我们的方法的效率，在这个场景中，一组机器人必须收集信息，同时避免珊瑚三角中的危险，并使用合成数据集描述缩放和参数选择。

{"title":"The matroid team surviving orienteers problem and its variants: Constrained routing of heterogeneous teams with risky traversal","authors":"Stefan Jorgensen, Marco Pavone","doi":"10.1177/02783649231210326","DOIUrl":"https://doi.org/10.1177/02783649231210326","url":null,"abstract":"Consider deploying a team of robots in order to visit sites in a risky environment (i.e., where a robot might be lost during a traversal), subject to team-based operational constraints such as limits on team composition, traffic throughputs, and launch constraints. We formalize this problem using a graph to represent the environment, enforcing probabilistic survival constraints for each robot, and using a matroid (which generalizes linear independence to sets) to capture the team-based operational constraints. The resulting “Matroid Team Surviving Orienteers” (MTSO) problem has broad applications for robotics such as informative path planning, resource delivery, and search and rescue. We demonstrate that the objective for the MTSO problem has submodular structure, which leads us to develop two polynomial time algorithms which are guaranteed to find a solution with value within a constant factor of the optimum. The second of our algorithms is an extension of the accelerated continuous greedy algorithm, and can be applied to much broader classes of constraints while maintaining bounds on suboptimality. In addition to in-depth analysis, we demonstrate the efficiency of our approaches by applying them to a scenario where a team of robots must gather information while avoiding dangers in the Coral Triangle and characterize scaling and parameter selection using a synthetic dataset.","PeriodicalId":54942,"journal":{"name":"International Journal of Robotics Research","volume":"330 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136019442","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The effects of selected object features on a pick-and-place task: A human multimodal dataset 选定对象特征对拾取和放置任务的影响:人类多模态数据集

1区计算机科学 Q1 ROBOTICS

International Journal of Robotics Research

Pub Date : 2023-10-30 DOI: 10.1177/02783649231210965

Linda Lastrico, Valerio Belcamino, Alessandro Carfì, Alessia Vignolo, Alessandra Sciutti, Fulvio Mastrogiovanni, Francesco Rea

We propose a dataset to study the influence of object-specific characteristics on human pick-and-place movements and compare the quality of the motion kinematics extracted by various sensors. This dataset is also suitable for promoting a broader discussion on general learning problems in the hand-object interaction domain, such as intention recognition or motion generation with applications in the Robotics field. The dataset consists of the recordings of 15 subjects performing 80 repetitions of a pick-and-place action under various experimental conditions, for a total of 1200 pick-and-places. The data has been collected thanks to a multimodal setup composed of multiple cameras, observing the actions from different perspectives, a motion capture system, and a wrist-worn inertial measurement unit. All the objects manipulated in the experiments are identical in shape, size, and appearance but differ in weight and liquid filling, which influences the carefulness required for their handling.

我们提出了一个数据集来研究物体特定特征对人类拾取运动的影响，并比较各种传感器提取的运动运动学的质量。该数据集也适用于促进对手-对象交互领域中一般学习问题的更广泛讨论，例如机器人领域中的意图识别或运动生成应用。该数据集由15名受试者在不同实验条件下重复80次拾取和放置动作的记录组成，总共有1200次拾取和放置。这些数据的收集得益于由多个摄像头组成的多模式设置，从不同的角度观察动作，一个动作捕捉系统和一个手腕上佩戴的惯性测量单元。在实验中操作的所有物体在形状、大小和外观上都是相同的，但重量和液体填充不同，这影响了搬运它们时需要的小心。

引用次数: 0

MAgro dataset: A dataset for simultaneous localization and mapping in agricultural environments MAgro数据集:用于农业环境中同步定位和制图的数据集

1区计算机科学 Q1 ROBOTICS

International Journal of Robotics Research

Pub Date : 2023-10-25 DOI: 10.1177/02783649231210011

Mercedes Marzoa Tanco, Guillermo Trinidad Barnech, Federico Andrade, Javier Baliosian, Martin LLofriu, JM Di Martino, Gonzalo Tejera

The agricultural industry is being transformed, thanks to recent innovations in computer vision and deep learning. However, the lack of specific datasets collected in natural agricultural environments is, arguably, the main bottleneck for novel discoveries and benchmarking. The present work provides a novel dataset, Magro, and a framework to expand data collection. We present the first version of the Magro Dataset V1.0, consisting of nine ROS bags (and the corresponding raw data) containing data collected in apple and pear crops. Data were gathered, repeating a fixed trajectory on different days under different illumination and weather conditions. To support the evaluation of loop closure algorithms, the trajectories are designed to have loop closures, revisiting some places from different viewpoints. We use a Clearpath’s Jackal robot equipped with stereo cameras pointing to the front and left side, a 3D LIDAR, three inertial measurement units (IMU), and wheel encoders. Additionally, we provide calibrated RTK GPS data that can be used as ground truth. Our dataset is openly available, and it will be updated to have more data and variability. Finally, we tested two existing state-of-the-art algorithms for vision and point cloud-based localization and mapping on our novel dataset to validate the dataset’s usability.

由于最近在计算机视觉和深度学习方面的创新，农业正在发生变革。然而，缺乏在自然农业环境中收集的具体数据集，可以说是新发现和基准的主要瓶颈。目前的工作提供了一个新的数据集，Magro和一个扩展数据收集的框架。我们介绍了第一个版本的Magro数据集V1.0，由9个ROS包(和相应的原始数据)组成，其中包含在苹果和梨作物中收集的数据。收集数据，在不同的日子、不同的光照和天气条件下重复固定的轨迹。为了支持闭环算法的评估，轨迹被设计成具有闭环，从不同的角度重新访问一些地方。我们使用Clearpath的Jackal机器人，该机器人配备了指向前方和左侧的立体摄像头、3D激光雷达、三个惯性测量单元(IMU)和轮式编码器。此外，我们提供校准的RTK GPS数据，可以用作地面真值。我们的数据集是公开的，它将被更新以拥有更多的数据和可变性。最后，我们在我们的新数据集上测试了两种现有的最先进的视觉和基于点云的定位和映射算法，以验证数据集的可用性。

{"title":"MAgro dataset: A dataset for simultaneous localization and mapping in agricultural environments","authors":"Mercedes Marzoa Tanco, Guillermo Trinidad Barnech, Federico Andrade, Javier Baliosian, Martin LLofriu, JM Di Martino, Gonzalo Tejera","doi":"10.1177/02783649231210011","DOIUrl":"https://doi.org/10.1177/02783649231210011","url":null,"abstract":"The agricultural industry is being transformed, thanks to recent innovations in computer vision and deep learning. However, the lack of specific datasets collected in natural agricultural environments is, arguably, the main bottleneck for novel discoveries and benchmarking. The present work provides a novel dataset, Magro, and a framework to expand data collection. We present the first version of the Magro Dataset V1.0, consisting of nine ROS bags (and the corresponding raw data) containing data collected in apple and pear crops. Data were gathered, repeating a fixed trajectory on different days under different illumination and weather conditions. To support the evaluation of loop closure algorithms, the trajectories are designed to have loop closures, revisiting some places from different viewpoints. We use a Clearpath’s Jackal robot equipped with stereo cameras pointing to the front and left side, a 3D LIDAR, three inertial measurement units (IMU), and wheel encoders. Additionally, we provide calibrated RTK GPS data that can be used as ground truth. Our dataset is openly available, and it will be updated to have more data and variability. Finally, we tested two existing state-of-the-art algorithms for vision and point cloud-based localization and mapping on our novel dataset to validate the dataset’s usability.","PeriodicalId":54942,"journal":{"name":"International Journal of Robotics Research","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134973524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Impact-aware task-space quadratic-programming control 影响感知任务空间二次规划控制

1区计算机科学 Q1 ROBOTICS

International Journal of Robotics Research

Pub Date : 2023-10-16 DOI: 10.1177/02783649231198558

Yuquan Wang, Niels Dehio, Arnaud Tanguy, Abderrahmane Kheddar

Robots usually establish contacts at rigid surfaces with near-zero relative velocities. Otherwise, impact-induced energy propagates in the robot’s linkage and may cause irreversible damage to the hardware. Moreover, abrupt changes in task-space contact velocity and peak impact forces also result in abrupt changes in robot joint velocities and torques; which can compromise controllers’ stability, especially for those based on smooth models. In reality, several tasks would require establishing contact with moderately high velocity. We propose to enhance task-space multi-objective controllers formulated as a quadratic program to be resilient to frictional impacts in three dimensions. We devise new constraints and reformulate the usual ones to be robust to the abrupt joint state changes mentioned earlier. The impact event becomes a controlled process once the optimal control search space is aware of: (1) the hardware-affordable impact bounds and (2) analytically computed feasible set (polyhedra) that constrain post-impact critical states. Prior to and nearby the targeted contact spot, we assume, at each control cycle, that the impact will occur at the next iteration. This somewhat one-step preview makes our controller robust to impact time and location. To assess our approach, we experimented its resilience to moderate impacts with the Panda manipulator and achieved swift grabbing tasks with the HRP-4 humanoid robot.

机器人通常在相对速度接近于零的刚性表面上建立接触。否则，冲击引起的能量会在机器人的连杆中传播，并可能对硬件造成不可逆的损坏。此外，任务空间接触速度和峰值冲击力的突变也会导致机器人关节速度和扭矩的突变;这可能会影响控制器的稳定性，特别是对于那些基于平滑模型的控制器。实际上，有几项任务需要以中等速度建立接触。我们建议增强任务空间多目标控制器，将其表述为二次规划，以适应三维摩擦冲击。我们设计了新的约束，并重新制定了通常的约束，以对前面提到的关节状态突变具有鲁棒性。一旦最优控制搜索空间意识到:(1)硬件负担得起的碰撞边界和(2)约束碰撞后临界状态的解析计算可行集(多面体)，碰撞事件就成为一个受控过程。在目标接触点之前和附近，我们假设，在每个控制周期中，下一次迭代将发生冲击。这种一步预览使我们的控制器对时间和位置的影响具有鲁棒性。为了评估我们的方法，我们用Panda机械手实验了它对中等冲击的弹性，并用HRP-4类人机器人实现了快速抓取任务。

{"title":"Impact-aware task-space quadratic-programming control","authors":"Yuquan Wang, Niels Dehio, Arnaud Tanguy, Abderrahmane Kheddar","doi":"10.1177/02783649231198558","DOIUrl":"https://doi.org/10.1177/02783649231198558","url":null,"abstract":"Robots usually establish contacts at rigid surfaces with near-zero relative velocities. Otherwise, impact-induced energy propagates in the robot’s linkage and may cause irreversible damage to the hardware. Moreover, abrupt changes in task-space contact velocity and peak impact forces also result in abrupt changes in robot joint velocities and torques; which can compromise controllers’ stability, especially for those based on smooth models. In reality, several tasks would require establishing contact with moderately high velocity. We propose to enhance task-space multi-objective controllers formulated as a quadratic program to be resilient to frictional impacts in three dimensions. We devise new constraints and reformulate the usual ones to be robust to the abrupt joint state changes mentioned earlier. The impact event becomes a controlled process once the optimal control search space is aware of: (1) the hardware-affordable impact bounds and (2) analytically computed feasible set (polyhedra) that constrain post-impact critical states. Prior to and nearby the targeted contact spot, we assume, at each control cycle, that the impact will occur at the next iteration. This somewhat one-step preview makes our controller robust to impact time and location. To assess our approach, we experimented its resilience to moderate impacts with the Panda manipulator and achieved swift grabbing tasks with the HRP-4 humanoid robot.","PeriodicalId":54942,"journal":{"name":"International Journal of Robotics Research","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136079275","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 19