首页 > 最新文献

Autonomous Robots最新文献

英文 中文
Reinforcement learning with imitative behaviors for humanoid robots navigation: synchronous planning and control 仿人机器人导航的模仿行为强化学习:同步规划与控制
IF 3.7 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-04-17 DOI: 10.1007/s10514-024-10160-w
Xiaoying Wang, Tong Zhang

Humanoid robots have strong adaptability to complex environments and possess human-like flexibility, enabling them to perform precise farming and harvesting tasks in varying depths of terrains. They serve as essential tools for agricultural intelligence. In this article, a novel method was proposed to improve the robustness of autonomous navigation for humanoid robots, which intercommunicates the data fusion of the footprint planning and control levels. In particular, a deep reinforcement learning model - Proximal Policy Optimization (PPO) that has been fine-tuned is introduced into this layer, before which heuristic trajectory was generated based on imitation learning. In the RL period, the KL divergence between the agent’s policy and imitative expert policy as a value penalty is added to the advantage function. As a proof of concept, our navigation policy is trained in a robotic simulator and then successfully applied to the physical robot GTX for indoor multi-mode navigation. The experimental results conclude that incorporating imitation learning imparts anthropomorphic attributes to robots and facilitates the generation of seamless footstep patterns. There is a significant improvement in ZMP trajectory in y-direction from the center by 21.56% is noticed. Additionally, this method improves dynamic locomotion stability, the body attitude angle falling between less than ± 5.5(^circ ) compared to ± 48.4(^circ ) with traditional algorithm. In general, navigation error is below 5 cm, which we verified in the experiments. It is thought that the outcome of the proposed framework presented in this article can provide a reference for researchers studying autonomous navigation applications of humanoid robots on uneven ground.

仿人机器人对复杂环境有很强的适应能力,具有类似人类的灵活性,能够在不同深度的地形中执行精确的耕作和收割任务。它们是农业智能的重要工具。本文提出了一种提高仿人机器人自主导航鲁棒性的新方法,该方法将足迹规划和控制层面的数据融合起来。特别是,在这一层中引入了经过微调的深度强化学习模型--近端策略优化(PPO),在此之前,基于模仿学习生成启发式轨迹。在 RL 阶段,代理策略与模仿专家策略之间的 KL 发散作为一种价值惩罚被添加到优势函数中。作为概念验证,我们在机器人模拟器中训练了导航策略,并将其成功应用于物理机器人 GTX 的室内多模式导航。实验结果表明,模仿学习赋予了机器人拟人属性,并有助于生成无缝脚步模式。ZMP轨迹在从中心开始的Y方向上有明显改善,改善幅度达21.56%。此外,该方法还提高了动态运动的稳定性,与传统算法的± 48.4(^circ )相比,该方法的身体姿态角小于± 5.5(^circ )。一般来说,导航误差低于 5 厘米,这一点我们在实验中得到了验证。本文提出的框架成果可以为研究仿人机器人在不平整地面上的自主导航应用提供参考。
{"title":"Reinforcement learning with imitative behaviors for humanoid robots navigation: synchronous planning and control","authors":"Xiaoying Wang,&nbsp;Tong Zhang","doi":"10.1007/s10514-024-10160-w","DOIUrl":"10.1007/s10514-024-10160-w","url":null,"abstract":"<div><p>Humanoid robots have strong adaptability to complex environments and possess human-like flexibility, enabling them to perform precise farming and harvesting tasks in varying depths of terrains. They serve as essential tools for agricultural intelligence. In this article, a novel method was proposed to improve the robustness of autonomous navigation for humanoid robots, which intercommunicates the data fusion of the footprint planning and control levels. In particular, a deep reinforcement learning model - Proximal Policy Optimization (PPO) that has been fine-tuned is introduced into this layer, before which heuristic trajectory was generated based on imitation learning. In the RL period, the KL divergence between the agent’s policy and imitative expert policy as a value penalty is added to the advantage function. As a proof of concept, our navigation policy is trained in a robotic simulator and then successfully applied to the physical robot <i>GTX</i> for indoor multi-mode navigation. The experimental results conclude that incorporating imitation learning imparts anthropomorphic attributes to robots and facilitates the generation of seamless footstep patterns. There is a significant improvement in ZMP trajectory in y-direction from the center by 21.56% is noticed. Additionally, this method improves dynamic locomotion stability, the body attitude angle falling between less than ± 5.5<span>(^circ )</span> compared to ± 48.4<span>(^circ )</span> with traditional algorithm. In general, navigation error is below 5 cm, which we verified in the experiments. It is thought that the outcome of the proposed framework presented in this article can provide a reference for researchers studying autonomous navigation applications of humanoid robots on uneven ground.\u0000</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"48 2-3","pages":""},"PeriodicalIF":3.7,"publicationDate":"2024-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140608698","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Terrain traversability prediction through self-supervised learning and unsupervised domain adaptation on synthetic data 通过合成数据上的自监督学习和无监督域适应进行地形可穿越性预测
IF 3.7 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-03-30 DOI: 10.1007/s10514-024-10158-4
Giuseppe Vecchio, Simone Palazzo, Dario C. Guastella, Daniela Giordano, Giovanni Muscato, Concetto Spampinato

Terrain traversability estimation is a fundamental task for supporting robot navigation on uneven surfaces. Recent learning-based approaches for predicting traversability from RGB images have shown promising results, but require manual annotation of a large number of images for training. To address this limitation, we present a method for traversability estimation on unlabeled videos that combines dataset synthesis, self-supervision and unsupervised domain adaptation. We pose the traversability estimation as a vector regression task over vertical bands of the observed frame. The model is pre-trained through self-supervision to reduce the distribution shift between synthetic and real data and encourage shared feature learning. Then, supervised training on synthetic videos is carried out, while employing an unsupervised domain adaptation loss to improve its generalization capabilities on real scenes. Experimental results show that our approach is on par with standard supervised training, and effectively supports robot navigation without the need of manual annotations. Training code and synthetic dataset will be publicly released at: https://github.com/perceivelab/traversability-synth.

地形可穿越性估算是支持机器人在不平路面上导航的一项基本任务。最近基于学习的 RGB 图像可穿越性预测方法取得了可喜的成果,但需要对大量图像进行人工标注训练。为了解决这一局限性,我们提出了一种在无标注视频上进行可穿越性估算的方法,该方法结合了数据集合成、自监督和无监督领域适应。我们将可穿越性估算看作是对观察到的帧的垂直带进行向量回归的任务。通过自我监督对模型进行预训练,以减少合成数据和真实数据之间的分布偏移,并鼓励共享特征学习。然后,在合成视频上进行监督训练,同时采用无监督域适应损失来提高其在真实场景上的泛化能力。实验结果表明,我们的方法与标准的监督训练不相上下,无需人工标注即可有效支持机器人导航。训练代码和合成数据集将在以下网站公开发布:https://github.com/perceivelab/traversability-synth。
{"title":"Terrain traversability prediction through self-supervised learning and unsupervised domain adaptation on synthetic data","authors":"Giuseppe Vecchio,&nbsp;Simone Palazzo,&nbsp;Dario C. Guastella,&nbsp;Daniela Giordano,&nbsp;Giovanni Muscato,&nbsp;Concetto Spampinato","doi":"10.1007/s10514-024-10158-4","DOIUrl":"10.1007/s10514-024-10158-4","url":null,"abstract":"<div><p>Terrain traversability estimation is a fundamental task for supporting robot navigation on uneven surfaces. Recent learning-based approaches for predicting traversability from RGB images have shown promising results, but require manual annotation of a large number of images for training. To address this limitation, we present a method for traversability estimation on unlabeled videos that combines dataset synthesis, self-supervision and unsupervised domain adaptation. We pose the traversability estimation as a vector regression task over vertical bands of the observed frame. The model is pre-trained through self-supervision to reduce the distribution shift between synthetic and real data and encourage shared feature learning. Then, supervised training on synthetic videos is carried out, while employing an unsupervised domain adaptation loss to improve its generalization capabilities on real scenes. Experimental results show that our approach is on par with standard supervised training, and effectively supports robot navigation without the need of manual annotations. Training code and synthetic dataset will be publicly released at: https://github.com/perceivelab/traversability-synth.</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"48 2-3","pages":""},"PeriodicalIF":3.7,"publicationDate":"2024-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10514-024-10158-4.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140364755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Maximal coverage problems with routing constraints using cross-entropy Monte Carlo tree search 利用交叉熵蒙特卡洛树搜索解决具有路由限制的最大覆盖问题
IF 3.7 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-01-30 DOI: 10.1007/s10514-024-10156-6
Pao-Te Lin, Kuo-Shih Tseng

Spatial search, and environmental monitoring are key technologies in robotics. These problems can be reformulated as maximal coverage problems with routing constraints, which are NP-hard problems. The generalized cost-benefit algorithm (GCB) can solve these problems with theoretical guarantees. To achieve better performance, evolutionary algorithms (EA) boost its performance via more samples. However, it is hard to know the terminal conditions of EA to outperform GCB. To solve these problems with theoretical guarantees and terminal conditions, in this research, the cross-entropy based Monte Carlo Tree Search algorithm (CE-MCTS) is proposed. It consists of three parts: the EA for sampling the branches, the upper confidence bound policy for selections, and the estimation of distribution algorithm for simulations. The experiments demonstrate that the CE-MCTS outperforms benchmark approaches (e.g., GCB, EAMC) in spatial search problems.

空间搜索和环境监测是机器人技术中的关键技术。这些问题可以被重新表述为带有路由约束的最大覆盖问题,是 NP 难问题。广义成本收益算法(GCB)可以在理论上保证解决这些问题。为了获得更好的性能,进化算法(EA)通过增加样本来提高性能。然而,我们很难知道 EA 优于 GCB 的最终条件。为了解决这些具有理论保证和终端条件的问题,本研究提出了基于交叉熵的蒙特卡洛树搜索算法(CE-MCTS)。该算法由三部分组成:用于分支采样的 EA、用于选择的置信上限策略和用于模拟的分布估计算法。实验证明,在空间搜索问题上,CE-MCTS 优于基准方法(如 GCB、EAMC)。
{"title":"Maximal coverage problems with routing constraints using cross-entropy Monte Carlo tree search","authors":"Pao-Te Lin,&nbsp;Kuo-Shih Tseng","doi":"10.1007/s10514-024-10156-6","DOIUrl":"10.1007/s10514-024-10156-6","url":null,"abstract":"<div><p>Spatial search, and environmental monitoring are key technologies in robotics. These problems can be reformulated as maximal coverage problems with routing constraints, which are NP-hard problems. The generalized cost-benefit algorithm (GCB) can solve these problems with theoretical guarantees. To achieve better performance, evolutionary algorithms (EA) boost its performance via more samples. However, it is hard to know the terminal conditions of EA to outperform GCB. To solve these problems with theoretical guarantees and terminal conditions, in this research, the cross-entropy based Monte Carlo Tree Search algorithm (CE-MCTS) is proposed. It consists of three parts: the EA for sampling the branches, the upper confidence bound policy for selections, and the estimation of distribution algorithm for simulations. The experiments demonstrate that the CE-MCTS outperforms benchmark approaches (e.g., GCB, EAMC) in spatial search problems.\u0000</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"48 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2024-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139646697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Collocation methods for second and higher order systems 二阶和高阶系统的搭配方法
IF 3.7 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-01-28 DOI: 10.1007/s10514-023-10155-z
Siro Moreno-Martín, Lluís Ros, Enric Celaya

It is often unnoticed that the predominant way to use collocation methods is fundamentally flawed when applied to optimal control in robotics. Such methods assume that the system dynamics is given by a first order ODE, whereas robots are often governed by a second or higher order ODE involving configuration variables and their time derivatives. To apply a collocation method, therefore, the usual practice is to resort to the well known procedure of casting an Mth order ODE into M first order ones. This manipulation, which in the continuous domain is perfectly valid, leads to inconsistencies when the problem is discretized. Since the configuration variables and their time derivatives are approximated with polynomials of the same degree, their differential dependencies cannot be fulfilled, and the actual dynamics is not satisfied, not even at the collocation points. This paper draws attention to this problem, and develops improved versions of the trapezoidal and Hermite–Simpson collocation methods that do not present these inconsistencies. In many cases, the new methods reduce the dynamics transcription error in one order of magnitude, or even more, without noticeably increasing the cost of computing the solutions.

人们往往没有注意到,在应用于机器人优化控制时,主要的搭配方法存在根本性缺陷。这种方法假定系统动力学由一阶 ODE 给出,而机器人通常受二阶或更高阶的 ODE 控制,其中涉及配置变量及其时间导数。因此,要应用配位法,通常的做法是采用众所周知的将 M 阶 ODE 转化为 M 阶一阶 ODE 的程序。这种操作方法在连续域中完全有效,但在问题离散化时却会导致不一致。由于配置变量及其时间导数是用同阶多项式逼近的,因此无法满足它们的微分依赖关系,也就无法满足实际的动力学要求,甚至在配置点上也是如此。本文提请注意这一问题,并开发了梯形和赫米特-辛普森配准方法的改进版本,这些方法不会出现这些不一致问题。在许多情况下,新方法将动力学转录误差减少了一个数量级,甚至更多,而计算求解的成本却没有明显增加。
{"title":"Collocation methods for second and higher order systems","authors":"Siro Moreno-Martín,&nbsp;Lluís Ros,&nbsp;Enric Celaya","doi":"10.1007/s10514-023-10155-z","DOIUrl":"10.1007/s10514-023-10155-z","url":null,"abstract":"<div><p>It is often unnoticed that the predominant way to use collocation methods is fundamentally flawed when applied to optimal control in robotics. Such methods assume that the system dynamics is given by a first order ODE, whereas robots are often governed by a second or higher order ODE involving configuration variables and their time derivatives. To apply a collocation method, therefore, the usual practice is to resort to the well known procedure of casting an <i>M</i>th order ODE into <i>M</i> first order ones. This manipulation, which in the continuous domain is perfectly valid, leads to inconsistencies when the problem is discretized. Since the configuration variables and their time derivatives are approximated with polynomials of the same degree, their differential dependencies cannot be fulfilled, and the actual dynamics is not satisfied, not even at the collocation points. This paper draws attention to this problem, and develops improved versions of the trapezoidal and Hermite–Simpson collocation methods that do not present these inconsistencies. In many cases, the new methods reduce the dynamics transcription error in one order of magnitude, or even more, without noticeably increasing the cost of computing the solutions.</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"48 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2024-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10514-023-10155-z.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139579306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Boosting the hospital by integrating mobile robotic assistance systems: a comprehensive classification of the risks to be addressed 通过整合移动机器人辅助系统促进医院发展:应对风险的全面分类
IF 3.7 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-01-24 DOI: 10.1007/s10514-023-10154-0
Lukas Bernhard, Patrik Schwingenschlögl, Jörg Hofmann, Dirk Wilhelm, Alois Knoll

Mobile service robots are a promising technology for supporting workflows throughout the hospital. Combined with an understanding of the environment and the current situation, such systems have the potential to become invaluable tools for overcoming personal shortages and streamlining healthcare workflows. However, few robotic systems have actually been translated to practical application so far, which is due to many challenges centered around the strict and unique requirements imposed by the different hospital environments, which have not yet been collected and analyzed in a structured manner. To address this need, we now present a comprehensive classification of different dimensions of risk to be considered when designing mobile service robots for the hospital. Our classification consists of six risk categories – environmental complexity, hygienic requirements, interaction with persons and objects, workflow flexibility and autonomy – for each of which a scale with distinct risk levels is provided. This concept, for the first time allows for a precise classification of mobile service robots for the hospital, which can prove useful for certification and admission procedures as well as for defining architectural and safety requirements throughout the design process of such robots.

移动服务机器人是一项前景广阔的技术,可为整个医院的工作流程提供支持。结合对环境和现状的了解,这类系统有可能成为克服人员短缺和简化医疗保健工作流程的宝贵工具。然而,迄今为止,很少有机器人系统真正投入实际应用,这是由于不同医院环境所提出的严格而独特的要求带来了许多挑战,而这些挑战尚未以结构化的方式加以收集和分析。为了满足这一需求,我们现在对设计医院移动服务机器人时需要考虑的不同风险维度进行全面分类。我们的分类包括六个风险类别--环境复杂性、卫生要求、与人和物体的互动、工作流程灵活性和自主性--并为每个类别提供了具有不同风险等级的量表。这一概念首次对医院用移动服务机器人进行了精确分类,可用于认证和入院程序,以及在此类机器人的整个设计过程中确定建筑和安全要求。
{"title":"Boosting the hospital by integrating mobile robotic assistance systems: a comprehensive classification of the risks to be addressed","authors":"Lukas Bernhard,&nbsp;Patrik Schwingenschlögl,&nbsp;Jörg Hofmann,&nbsp;Dirk Wilhelm,&nbsp;Alois Knoll","doi":"10.1007/s10514-023-10154-0","DOIUrl":"10.1007/s10514-023-10154-0","url":null,"abstract":"<div><p>Mobile service robots are a promising technology for supporting workflows throughout the hospital. Combined with an understanding of the environment and the current situation, such systems have the potential to become invaluable tools for overcoming personal shortages and streamlining healthcare workflows. However, few robotic systems have actually been translated to practical application so far, which is due to many challenges centered around the strict and unique requirements imposed by the different hospital environments, which have not yet been collected and analyzed in a structured manner. To address this need, we now present a comprehensive classification of different dimensions of risk to be considered when designing mobile service robots for the hospital. Our classification consists of six risk categories – environmental complexity, hygienic requirements, interaction with persons and objects, workflow flexibility and autonomy – for each of which a scale with distinct risk levels is provided. This concept, for the first time allows for a precise classification of mobile service robots for the hospital, which can prove useful for certification and admission procedures as well as for defining architectural and safety requirements throughout the design process of such robots.</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"48 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2024-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10514-023-10154-0.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139558975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic task allocation approaches for coordinated exploration of Subterranean environments 地下环境协同勘探的动态任务分配方法
IF 3.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-11-23 DOI: 10.1007/s10514-023-10142-4
Matthew O’Brien, Jason Williams, Shengkang Chen, Alex Pitt, Ronald Arkin, Navinda Kottege

This paper presents the methods used by team CSIRO Data61 for multi-agent coordination and exploration in the DARPA Subterranean (SubT) Challenge. The SubT competition involved a single operator sending teams of robots to rapidly explore underground environments with severe navigation and communication challenges. Coordination was framed as a multi-robot task allocation (MRTA) problem to allow for a seamless integration of exploration with other required tasks. Methods for extending a consensus-based task allocation approach for an online and highly dynamic mission are discussed. Exploration tasks were generated from frontiers in a map of traversable space, and graph-based heuristics applied to guide the selection of exploration tasks. Results from simulation, field testing, and the final competition are presented. Team CSIRO Data61 tied for most points scored and achieved second place during the final SubT event.

本文介绍了CSIRO Data61团队在DARPA地下(SubT)挑战赛中用于多智能体协调和探索的方法。SubT竞赛涉及单个操作员派遣机器人团队快速探索具有严峻导航和通信挑战的地下环境。协调被框架为一个多机器人任务分配(MRTA)问题,以允许探索与其他所需任务的无缝集成。讨论了将基于共识的任务分配方法扩展到在线高动态任务的方法。从可穿越空间地图的边界生成探索任务,并应用基于图的启发式方法指导探索任务的选择。给出了仿真、现场测试和决赛的结果。CSIRO Data61队在最后的SubT赛事中获得了最多的得分并获得了第二名。
{"title":"Dynamic task allocation approaches for coordinated exploration of Subterranean environments","authors":"Matthew O’Brien,&nbsp;Jason Williams,&nbsp;Shengkang Chen,&nbsp;Alex Pitt,&nbsp;Ronald Arkin,&nbsp;Navinda Kottege","doi":"10.1007/s10514-023-10142-4","DOIUrl":"10.1007/s10514-023-10142-4","url":null,"abstract":"<div><p>This paper presents the methods used by team CSIRO Data61 for multi-agent coordination and exploration in the DARPA Subterranean (SubT) Challenge. The SubT competition involved a single operator sending teams of robots to rapidly explore underground environments with severe navigation and communication challenges. Coordination was framed as a multi-robot task allocation (MRTA) problem to allow for a seamless integration of exploration with other required tasks. Methods for extending a consensus-based task allocation approach for an online and highly dynamic mission are discussed. Exploration tasks were generated from frontiers in a map of traversable space, and graph-based heuristics applied to guide the selection of exploration tasks. Results from simulation, field testing, and the final competition are presented. Team CSIRO Data61 tied for most points scored and achieved second place during the final SubT event.</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"47 8","pages":"1559 - 1577"},"PeriodicalIF":3.5,"publicationDate":"2023-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138473082","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
AuRo special issue on large language models in robotics guest editorial AuRo关于机器人中的大型语言模型的特刊客座编辑
IF 3.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-11-17 DOI: 10.1007/s10514-023-10153-1
{"title":"AuRo special issue on large language models in robotics guest editorial","authors":"","doi":"10.1007/s10514-023-10153-1","DOIUrl":"10.1007/s10514-023-10153-1","url":null,"abstract":"","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"47 8","pages":"979 - 980"},"PeriodicalIF":3.5,"publicationDate":"2023-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138473235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TidyBot: personalized robot assistance with large language models TidyBot:具有大型语言模型的个性化机器人辅助
IF 3.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-11-16 DOI: 10.1007/s10514-023-10139-z
Jimmy Wu, Rika Antonova, Adam Kan, Marion Lepert, Andy Zeng, Shuran Song, Jeannette Bohg, Szymon Rusinkiewicz, Thomas Funkhouser

For a robot to personalize physical assistance effectively, it must learn user preferences that can be generally reapplied to future scenarios. In this work, we investigate personalization of household cleanup with robots that can tidy up rooms by picking up objects and putting them away. A key challenge is determining the proper place to put each object, as people’s preferences can vary greatly depending on personal taste or cultural background. For instance, one person may prefer storing shirts in the drawer, while another may prefer them on the shelf. We aim to build systems that can learn such preferences from just a handful of examples via prior interactions with a particular person. We show that robots can combine language-based planning and perception with the few-shot summarization capabilities of large language models to infer generalized user preferences that are broadly applicable to future interactions. This approach enables fast adaptation and achieves 91.2% accuracy on unseen objects in our benchmark dataset. We also demonstrate our approach on a real-world mobile manipulator called TidyBot, which successfully puts away 85.0% of objects in real-world test scenarios.

为了让机器人有效地个性化物理辅助,它必须了解用户的偏好,这些偏好通常可以在未来的场景中重新应用。在这项工作中,我们研究了家庭清洁的个性化,机器人可以通过捡起物体并把它们放好来清理房间。一个关键的挑战是确定每件物品的合适放置位置,因为人们的偏好可能因个人品味或文化背景而有很大差异。例如,一个人可能喜欢把衬衫放在抽屉里,而另一个人可能喜欢把它们放在架子上。我们的目标是建立一个系统,可以通过与特定的人之前的互动,从少数例子中学习这种偏好。我们表明,机器人可以将基于语言的规划和感知与大型语言模型的少量汇总能力相结合,以推断广泛适用于未来交互的广义用户偏好。该方法实现了快速自适应,并在基准数据集中对未见对象实现了91.2%的准确率。我们还在一个名为TidyBot的真实世界的移动机械手上展示了我们的方法,它在真实世界的测试场景中成功地收起了85.0%的物体。
{"title":"TidyBot: personalized robot assistance with large language models","authors":"Jimmy Wu,&nbsp;Rika Antonova,&nbsp;Adam Kan,&nbsp;Marion Lepert,&nbsp;Andy Zeng,&nbsp;Shuran Song,&nbsp;Jeannette Bohg,&nbsp;Szymon Rusinkiewicz,&nbsp;Thomas Funkhouser","doi":"10.1007/s10514-023-10139-z","DOIUrl":"10.1007/s10514-023-10139-z","url":null,"abstract":"<div><p>For a robot to personalize physical assistance effectively, it must learn user preferences that can be generally reapplied to future scenarios. In this work, we investigate personalization of household cleanup with robots that can tidy up rooms by picking up objects and putting them away. A key challenge is determining the proper place to put each object, as people’s preferences can vary greatly depending on personal taste or cultural background. For instance, one person may prefer storing shirts in the drawer, while another may prefer them on the shelf. We aim to build systems that can learn such preferences from just a handful of examples via prior interactions with a particular person. We show that robots can combine language-based planning and perception with the few-shot summarization capabilities of large language models to infer generalized user preferences that are broadly applicable to future interactions. This approach enables fast adaptation and achieves 91.2% accuracy on unseen objects in our benchmark dataset. We also demonstrate our approach on a real-world mobile manipulator called TidyBot, which successfully puts away 85.0% of objects in real-world test scenarios.</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"47 8","pages":"1087 - 1102"},"PeriodicalIF":3.5,"publicationDate":"2023-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138473086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 68
Learning to summarize and answer questions about a virtual robot’s past actions 学习总结和回答关于虚拟机器人过去行为的问题
IF 3.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-11-16 DOI: 10.1007/s10514-023-10134-4
Chad DeChant, Iretiayo Akinola, Daniel Bauer

When robots perform long action sequences, users will want to easily and reliably find out what they have done. We therefore demonstrate the task of learning to summarize and answer questions about a robot agent’s past actions using natural language alone. A single system with a large language model at its core is trained to both summarize and answer questions about action sequences given ego-centric video frames of a virtual robot and a question prompt. To enable training of question answering, we develop a method to automatically generate English-language questions and answers about objects, actions, and the temporal order in which actions occurred during episodes of robot action in the virtual environment. Training one model to both summarize and answer questions enables zero-shot transfer of representations of objects learned through question answering to improved action summarization.

当机器人执行长动作序列时,用户会想要轻松可靠地找出它们做了什么。因此,我们展示了学习的任务,即仅使用自然语言来总结和回答关于机器人代理过去行为的问题。一个以大型语言模型为核心的单一系统被训练来总结和回答关于动作序列的问题,给定一个虚拟机器人的以自我为中心的视频帧和一个问题提示。为了实现问题回答的训练,我们开发了一种方法来自动生成关于对象、动作和虚拟环境中机器人动作期间动作发生的时间顺序的英语问题和答案。训练一个模型来总结和回答问题,可以将通过回答问题学习到的对象的表示零概率转移到改进的动作总结。
{"title":"Learning to summarize and answer questions about a virtual robot’s past actions","authors":"Chad DeChant,&nbsp;Iretiayo Akinola,&nbsp;Daniel Bauer","doi":"10.1007/s10514-023-10134-4","DOIUrl":"10.1007/s10514-023-10134-4","url":null,"abstract":"<div><p>When robots perform long action sequences, users will want to easily and reliably find out what they have done. We therefore demonstrate the task of learning to summarize and answer questions about a robot agent’s past actions using natural language alone. A single system with a large language model at its core is trained to both summarize and answer questions about action sequences given ego-centric video frames of a virtual robot and a question prompt. To enable training of question answering, we develop a method to automatically generate English-language questions and answers about objects, actions, and the temporal order in which actions occurred during episodes of robot action in the virtual environment. Training one model to both summarize and answer questions enables zero-shot transfer of representations of objects learned through question answering to improved action summarization. \u0000</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"47 8","pages":"1103 - 1118"},"PeriodicalIF":3.5,"publicationDate":"2023-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10514-023-10134-4.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138473077","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Text2Motion: from natural language instructions to feasible plans Text2Motion:从自然语言指令到可行的计划
IF 3.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-11-14 DOI: 10.1007/s10514-023-10131-7
Kevin Lin, Christopher Agia, Toki Migimatsu, Marco Pavone, Jeannette Bohg

We propose Text2Motion, a language-based planning framework enabling robots to solve sequential manipulation tasks that require long-horizon reasoning. Given a natural language instruction, our framework constructs both a task- and motion-level plan that is verified to reach inferred symbolic goals. Text2Motion uses feasibility heuristics encoded in Q-functions of a library of skills to guide task planning with Large Language Models. Whereas previous language-based planners only consider the feasibility of individual skills, Text2Motion actively resolves geometric dependencies spanning skill sequences by performing geometric feasibility planning during its search. We evaluate our method on a suite of problems that require long-horizon reasoning, interpretation of abstract goals, and handling of partial affordance perception. Our experiments show that Text2Motion can solve these challenging problems with a success rate of 82%, while prior state-of-the-art language-based planning methods only achieve 13%. Text2Motion thus provides promising generalization characteristics to semantically diverse sequential manipulation tasks with geometric dependencies between skills. Qualitative results are made available at https://sites.google.com/stanford.edu/text2motion.

我们提出Text2Motion,一个基于语言的规划框架,使机器人能够解决需要长期推理的顺序操作任务。给定一个自然语言指令,我们的框架构建了一个任务级和动作级计划,该计划被验证以达到推断的符号目标。Text2Motion使用在技能库的q函数中编码的可行性启发式来指导大型语言模型的任务规划。以前基于语言的规划器只考虑单个技能的可行性,而Text2Motion通过在搜索过程中执行几何可行性规划,主动解决跨越技能序列的几何依赖性。我们在一系列问题上评估了我们的方法,这些问题需要长期的推理,抽象目标的解释,以及部分可视性感知的处理。我们的实验表明,Text2Motion可以以82%的成功率解决这些具有挑战性的问题,而之前最先进的基于语言的规划方法只有13%的成功率。因此,Text2Motion为技能之间具有几何依赖性的语义多样的顺序操作任务提供了有希望的泛化特征。定性结果可在https://sites.google.com/stanford.edu/text2motion查阅。
{"title":"Text2Motion: from natural language instructions to feasible plans","authors":"Kevin Lin,&nbsp;Christopher Agia,&nbsp;Toki Migimatsu,&nbsp;Marco Pavone,&nbsp;Jeannette Bohg","doi":"10.1007/s10514-023-10131-7","DOIUrl":"10.1007/s10514-023-10131-7","url":null,"abstract":"<div><p>We propose Text2Motion, a language-based planning framework enabling robots to solve sequential manipulation tasks that require long-horizon reasoning. Given a natural language instruction, our framework constructs both a task- and motion-level plan that is verified to reach inferred symbolic goals. Text2Motion uses feasibility heuristics encoded in Q-functions of a library of skills to guide task planning with Large Language Models. Whereas previous language-based planners only consider the feasibility of individual skills, Text2Motion actively resolves geometric dependencies spanning skill sequences by performing geometric feasibility planning during its search. We evaluate our method on a suite of problems that require long-horizon reasoning, interpretation of abstract goals, and handling of partial affordance perception. Our experiments show that Text2Motion can solve these challenging problems with a success rate of 82%, while prior state-of-the-art language-based planning methods only achieve 13%. Text2Motion thus provides promising generalization characteristics to semantically diverse sequential manipulation tasks with geometric dependencies between skills. Qualitative results are made available at https://sites.google.com/stanford.edu/text2motion.\u0000</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"47 8","pages":"1345 - 1365"},"PeriodicalIF":3.5,"publicationDate":"2023-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134954182","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 73
期刊
Autonomous Robots
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1