Autonomous Robots最新文献

英文中文

Editorial - Robotics: Science and Systems 2022 社论 - 机器人：科学与系统 2022

IF 3.7 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Autonomous Robots

Pub Date : 2024-05-03 DOI: 10.1007/s10514-024-10161-9

引用次数: 0

Adaptive hybrid local–global sampling for fast informed sampling-based optimal path planning 通过自适应知情采样加速基于采样的最优路径规划

IF 3.7 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Autonomous Robots

Pub Date : 2024-04-20 DOI: 10.1007/s10514-024-10157-5

Marco Faroni, Nicola Pedrocchi, Manuel Beschi

This paper improves the performance of RRT(^*)-like sampling-based path planners by combining admissible informed sampling and local sampling (i.e., sampling the neighborhood of the current solution). An adaptive strategy regulates the trade-off between exploration (admissible informed sampling) and exploitation (local sampling) based on online rewards from previous samples. The paper demonstrates that the algorithm is asymptotically optimal and has a better convergence rate than state-of-the-art path planners (e.g., Informed-RRT(^*)) in several simulated and real-world scenarios. An open-source, ROS-compatible implementation of the algorithm is publicly available.

本文通过结合可允许的知情采样和局部采样（即对当前解的邻域进行采样），提高了类似采样的路径规划器的性能。一种自适应策略会根据之前采样的在线回报来调节探索（允许的知情采样）和利用（局部采样）之间的权衡。论文证明，在多个模拟和实际场景中，所产生的算法是渐进最优的，而且收敛速度优于最先进的路径规划器（例如，Informed-RRT/(^*)）。该算法的开源、兼容 ROS 的实现已公开发布。

引用次数: 0

Reinforcement learning with imitative behaviors for humanoid robots navigation: synchronous planning and control 仿人机器人导航的模仿行为强化学习：同步规划与控制

IF 3.7 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Autonomous Robots

Pub Date : 2024-04-17 DOI: 10.1007/s10514-024-10160-w

Xiaoying Wang, Tong Zhang

Humanoid robots have strong adaptability to complex environments and possess human-like flexibility, enabling them to perform precise farming and harvesting tasks in varying depths of terrains. They serve as essential tools for agricultural intelligence. In this article, a novel method was proposed to improve the robustness of autonomous navigation for humanoid robots, which intercommunicates the data fusion of the footprint planning and control levels. In particular, a deep reinforcement learning model - Proximal Policy Optimization (PPO) that has been fine-tuned is introduced into this layer, before which heuristic trajectory was generated based on imitation learning. In the RL period, the KL divergence between the agent’s policy and imitative expert policy as a value penalty is added to the advantage function. As a proof of concept, our navigation policy is trained in a robotic simulator and then successfully applied to the physical robot GTX for indoor multi-mode navigation. The experimental results conclude that incorporating imitation learning imparts anthropomorphic attributes to robots and facilitates the generation of seamless footstep patterns. There is a significant improvement in ZMP trajectory in y-direction from the center by 21.56% is noticed. Additionally, this method improves dynamic locomotion stability, the body attitude angle falling between less than ± 5.5(^circ ) compared to ± 48.4(^circ ) with traditional algorithm. In general, navigation error is below 5 cm, which we verified in the experiments. It is thought that the outcome of the proposed framework presented in this article can provide a reference for researchers studying autonomous navigation applications of humanoid robots on uneven ground.

仿人机器人对复杂环境有很强的适应能力，具有类似人类的灵活性，能够在不同深度的地形中执行精确的耕作和收割任务。它们是农业智能的重要工具。本文提出了一种提高仿人机器人自主导航鲁棒性的新方法，该方法将足迹规划和控制层面的数据融合起来。特别是，在这一层中引入了经过微调的深度强化学习模型--近端策略优化（PPO），在此之前，基于模仿学习生成启发式轨迹。在 RL 阶段，代理策略与模仿专家策略之间的 KL 发散作为一种价值惩罚被添加到优势函数中。作为概念验证，我们在机器人模拟器中训练了导航策略，并将其成功应用于物理机器人 GTX 的室内多模式导航。实验结果表明，模仿学习赋予了机器人拟人属性，并有助于生成无缝脚步模式。ZMP轨迹在从中心开始的Y方向上有明显改善，改善幅度达21.56%。此外，该方法还提高了动态运动的稳定性，与传统算法的± 48.4（^circ ）相比，该方法的身体姿态角小于± 5.5（^circ ）。一般来说，导航误差低于 5 厘米，这一点我们在实验中得到了验证。本文提出的框架成果可以为研究仿人机器人在不平整地面上的自主导航应用提供参考。

{"title":"Reinforcement learning with imitative behaviors for humanoid robots navigation: synchronous planning and control","authors":"Xiaoying Wang, Tong Zhang","doi":"10.1007/s10514-024-10160-w","DOIUrl":"10.1007/s10514-024-10160-w","url":null,"abstract":"<div><p>Humanoid robots have strong adaptability to complex environments and possess human-like flexibility, enabling them to perform precise farming and harvesting tasks in varying depths of terrains. They serve as essential tools for agricultural intelligence. In this article, a novel method was proposed to improve the robustness of autonomous navigation for humanoid robots, which intercommunicates the data fusion of the footprint planning and control levels. In particular, a deep reinforcement learning model - Proximal Policy Optimization (PPO) that has been fine-tuned is introduced into this layer, before which heuristic trajectory was generated based on imitation learning. In the RL period, the KL divergence between the agent’s policy and imitative expert policy as a value penalty is added to the advantage function. As a proof of concept, our navigation policy is trained in a robotic simulator and then successfully applied to the physical robot <i>GTX</i> for indoor multi-mode navigation. The experimental results conclude that incorporating imitation learning imparts anthropomorphic attributes to robots and facilitates the generation of seamless footstep patterns. There is a significant improvement in ZMP trajectory in y-direction from the center by 21.56% is noticed. Additionally, this method improves dynamic locomotion stability, the body attitude angle falling between less than ± 5.5<span>(^circ )</span> compared to ± 48.4<span>(^circ )</span> with traditional algorithm. In general, navigation error is below 5 cm, which we verified in the experiments. It is thought that the outcome of the proposed framework presented in this article can provide a reference for researchers studying autonomous navigation applications of humanoid robots on uneven ground.\u0000</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"48 2-3","pages":""},"PeriodicalIF":3.7,"publicationDate":"2024-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140608698","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Terrain traversability prediction through self-supervised learning and unsupervised domain adaptation on synthetic data 通过合成数据上的自监督学习和无监督域适应进行地形可穿越性预测

IF 3.7 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Autonomous Robots

Pub Date : 2024-03-30 DOI: 10.1007/s10514-024-10158-4

Giuseppe Vecchio, Simone Palazzo, Dario C. Guastella, Daniela Giordano, Giovanni Muscato, Concetto Spampinato

Terrain traversability estimation is a fundamental task for supporting robot navigation on uneven surfaces. Recent learning-based approaches for predicting traversability from RGB images have shown promising results, but require manual annotation of a large number of images for training. To address this limitation, we present a method for traversability estimation on unlabeled videos that combines dataset synthesis, self-supervision and unsupervised domain adaptation. We pose the traversability estimation as a vector regression task over vertical bands of the observed frame. The model is pre-trained through self-supervision to reduce the distribution shift between synthetic and real data and encourage shared feature learning. Then, supervised training on synthetic videos is carried out, while employing an unsupervised domain adaptation loss to improve its generalization capabilities on real scenes. Experimental results show that our approach is on par with standard supervised training, and effectively supports robot navigation without the need of manual annotations. Training code and synthetic dataset will be publicly released at: https://github.com/perceivelab/traversability-synth.

地形可穿越性估算是支持机器人在不平路面上导航的一项基本任务。最近基于学习的 RGB 图像可穿越性预测方法取得了可喜的成果，但需要对大量图像进行人工标注训练。为了解决这一局限性，我们提出了一种在无标注视频上进行可穿越性估算的方法，该方法结合了数据集合成、自监督和无监督领域适应。我们将可穿越性估算看作是对观察到的帧的垂直带进行向量回归的任务。通过自我监督对模型进行预训练，以减少合成数据和真实数据之间的分布偏移，并鼓励共享特征学习。然后，在合成视频上进行监督训练，同时采用无监督域适应损失来提高其在真实场景上的泛化能力。实验结果表明，我们的方法与标准的监督训练不相上下，无需人工标注即可有效支持机器人导航。训练代码和合成数据集将在以下网站公开发布：https://github.com/perceivelab/traversability-synth。

{"title":"Terrain traversability prediction through self-supervised learning and unsupervised domain adaptation on synthetic data","authors":"Giuseppe Vecchio, Simone Palazzo, Dario C. Guastella, Daniela Giordano, Giovanni Muscato, Concetto Spampinato","doi":"10.1007/s10514-024-10158-4","DOIUrl":"10.1007/s10514-024-10158-4","url":null,"abstract":"<div><p>Terrain traversability estimation is a fundamental task for supporting robot navigation on uneven surfaces. Recent learning-based approaches for predicting traversability from RGB images have shown promising results, but require manual annotation of a large number of images for training. To address this limitation, we present a method for traversability estimation on unlabeled videos that combines dataset synthesis, self-supervision and unsupervised domain adaptation. We pose the traversability estimation as a vector regression task over vertical bands of the observed frame. The model is pre-trained through self-supervision to reduce the distribution shift between synthetic and real data and encourage shared feature learning. Then, supervised training on synthetic videos is carried out, while employing an unsupervised domain adaptation loss to improve its generalization capabilities on real scenes. Experimental results show that our approach is on par with standard supervised training, and effectively supports robot navigation without the need of manual annotations. Training code and synthetic dataset will be publicly released at: https://github.com/perceivelab/traversability-synth.</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"48 2-3","pages":""},"PeriodicalIF":3.7,"publicationDate":"2024-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10514-024-10158-4.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140364755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Maximal coverage problems with routing constraints using cross-entropy Monte Carlo tree search 利用交叉熵蒙特卡洛树搜索解决具有路由限制的最大覆盖问题

IF 3.7 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Autonomous Robots

Pub Date : 2024-01-30 DOI: 10.1007/s10514-024-10156-6

Pao-Te Lin, Kuo-Shih Tseng

Spatial search, and environmental monitoring are key technologies in robotics. These problems can be reformulated as maximal coverage problems with routing constraints, which are NP-hard problems. The generalized cost-benefit algorithm (GCB) can solve these problems with theoretical guarantees. To achieve better performance, evolutionary algorithms (EA) boost its performance via more samples. However, it is hard to know the terminal conditions of EA to outperform GCB. To solve these problems with theoretical guarantees and terminal conditions, in this research, the cross-entropy based Monte Carlo Tree Search algorithm (CE-MCTS) is proposed. It consists of three parts: the EA for sampling the branches, the upper confidence bound policy for selections, and the estimation of distribution algorithm for simulations. The experiments demonstrate that the CE-MCTS outperforms benchmark approaches (e.g., GCB, EAMC) in spatial search problems.

空间搜索和环境监测是机器人技术中的关键技术。这些问题可以被重新表述为带有路由约束的最大覆盖问题，是 NP 难问题。广义成本收益算法（GCB）可以在理论上保证解决这些问题。为了获得更好的性能，进化算法（EA）通过增加样本来提高性能。然而，我们很难知道 EA 优于 GCB 的最终条件。为了解决这些具有理论保证和终端条件的问题，本研究提出了基于交叉熵的蒙特卡洛树搜索算法（CE-MCTS）。该算法由三部分组成：用于分支采样的 EA、用于选择的置信上限策略和用于模拟的分布估计算法。实验证明，在空间搜索问题上，CE-MCTS 优于基准方法（如 GCB、EAMC）。

引用次数: 0

Collocation methods for second and higher order systems 二阶和高阶系统的搭配方法

IF 3.7 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Autonomous Robots

Pub Date : 2024-01-28 DOI: 10.1007/s10514-023-10155-z

Siro Moreno-Martín, Lluís Ros, Enric Celaya

It is often unnoticed that the predominant way to use collocation methods is fundamentally flawed when applied to optimal control in robotics. Such methods assume that the system dynamics is given by a first order ODE, whereas robots are often governed by a second or higher order ODE involving configuration variables and their time derivatives. To apply a collocation method, therefore, the usual practice is to resort to the well known procedure of casting an Mth order ODE into M first order ones. This manipulation, which in the continuous domain is perfectly valid, leads to inconsistencies when the problem is discretized. Since the configuration variables and their time derivatives are approximated with polynomials of the same degree, their differential dependencies cannot be fulfilled, and the actual dynamics is not satisfied, not even at the collocation points. This paper draws attention to this problem, and develops improved versions of the trapezoidal and Hermite–Simpson collocation methods that do not present these inconsistencies. In many cases, the new methods reduce the dynamics transcription error in one order of magnitude, or even more, without noticeably increasing the cost of computing the solutions.

人们往往没有注意到，在应用于机器人优化控制时，主要的搭配方法存在根本性缺陷。这种方法假定系统动力学由一阶 ODE 给出，而机器人通常受二阶或更高阶的 ODE 控制，其中涉及配置变量及其时间导数。因此，要应用配位法，通常的做法是采用众所周知的将 M 阶 ODE 转化为 M 阶一阶 ODE 的程序。这种操作方法在连续域中完全有效，但在问题离散化时却会导致不一致。由于配置变量及其时间导数是用同阶多项式逼近的，因此无法满足它们的微分依赖关系，也就无法满足实际的动力学要求，甚至在配置点上也是如此。本文提请注意这一问题，并开发了梯形和赫米特-辛普森配准方法的改进版本，这些方法不会出现这些不一致问题。在许多情况下，新方法将动力学转录误差减少了一个数量级，甚至更多，而计算求解的成本却没有明显增加。

{"title":"Collocation methods for second and higher order systems","authors":"Siro Moreno-Martín, Lluís Ros, Enric Celaya","doi":"10.1007/s10514-023-10155-z","DOIUrl":"10.1007/s10514-023-10155-z","url":null,"abstract":"<div><p>It is often unnoticed that the predominant way to use collocation methods is fundamentally flawed when applied to optimal control in robotics. Such methods assume that the system dynamics is given by a first order ODE, whereas robots are often governed by a second or higher order ODE involving configuration variables and their time derivatives. To apply a collocation method, therefore, the usual practice is to resort to the well known procedure of casting an <i>M</i>th order ODE into <i>M</i> first order ones. This manipulation, which in the continuous domain is perfectly valid, leads to inconsistencies when the problem is discretized. Since the configuration variables and their time derivatives are approximated with polynomials of the same degree, their differential dependencies cannot be fulfilled, and the actual dynamics is not satisfied, not even at the collocation points. This paper draws attention to this problem, and develops improved versions of the trapezoidal and Hermite–Simpson collocation methods that do not present these inconsistencies. In many cases, the new methods reduce the dynamics transcription error in one order of magnitude, or even more, without noticeably increasing the cost of computing the solutions.</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"48 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2024-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10514-023-10155-z.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139579306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Boosting the hospital by integrating mobile robotic assistance systems: a comprehensive classification of the risks to be addressed 通过整合移动机器人辅助系统促进医院发展：应对风险的全面分类

IF 3.7 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Autonomous Robots

Pub Date : 2024-01-24 DOI: 10.1007/s10514-023-10154-0

Lukas Bernhard, Patrik Schwingenschlögl, Jörg Hofmann, Dirk Wilhelm, Alois Knoll

Mobile service robots are a promising technology for supporting workflows throughout the hospital. Combined with an understanding of the environment and the current situation, such systems have the potential to become invaluable tools for overcoming personal shortages and streamlining healthcare workflows. However, few robotic systems have actually been translated to practical application so far, which is due to many challenges centered around the strict and unique requirements imposed by the different hospital environments, which have not yet been collected and analyzed in a structured manner. To address this need, we now present a comprehensive classification of different dimensions of risk to be considered when designing mobile service robots for the hospital. Our classification consists of six risk categories – environmental complexity, hygienic requirements, interaction with persons and objects, workflow flexibility and autonomy – for each of which a scale with distinct risk levels is provided. This concept, for the first time allows for a precise classification of mobile service robots for the hospital, which can prove useful for certification and admission procedures as well as for defining architectural and safety requirements throughout the design process of such robots.

移动服务机器人是一项前景广阔的技术，可为整个医院的工作流程提供支持。结合对环境和现状的了解，这类系统有可能成为克服人员短缺和简化医疗保健工作流程的宝贵工具。然而，迄今为止，很少有机器人系统真正投入实际应用，这是由于不同医院环境所提出的严格而独特的要求带来了许多挑战，而这些挑战尚未以结构化的方式加以收集和分析。为了满足这一需求，我们现在对设计医院移动服务机器人时需要考虑的不同风险维度进行全面分类。我们的分类包括六个风险类别--环境复杂性、卫生要求、与人和物体的互动、工作流程灵活性和自主性--并为每个类别提供了具有不同风险等级的量表。这一概念首次对医院用移动服务机器人进行了精确分类，可用于认证和入院程序，以及在此类机器人的整个设计过程中确定建筑和安全要求。

{"title":"Boosting the hospital by integrating mobile robotic assistance systems: a comprehensive classification of the risks to be addressed","authors":"Lukas Bernhard, Patrik Schwingenschlögl, Jörg Hofmann, Dirk Wilhelm, Alois Knoll","doi":"10.1007/s10514-023-10154-0","DOIUrl":"10.1007/s10514-023-10154-0","url":null,"abstract":"<div><p>Mobile service robots are a promising technology for supporting workflows throughout the hospital. Combined with an understanding of the environment and the current situation, such systems have the potential to become invaluable tools for overcoming personal shortages and streamlining healthcare workflows. However, few robotic systems have actually been translated to practical application so far, which is due to many challenges centered around the strict and unique requirements imposed by the different hospital environments, which have not yet been collected and analyzed in a structured manner. To address this need, we now present a comprehensive classification of different dimensions of risk to be considered when designing mobile service robots for the hospital. Our classification consists of six risk categories – environmental complexity, hygienic requirements, interaction with persons and objects, workflow flexibility and autonomy – for each of which a scale with distinct risk levels is provided. This concept, for the first time allows for a precise classification of mobile service robots for the hospital, which can prove useful for certification and admission procedures as well as for defining architectural and safety requirements throughout the design process of such robots.</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"48 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2024-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10514-023-10154-0.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139558975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Dynamic task allocation approaches for coordinated exploration of Subterranean environments 地下环境协同勘探的动态任务分配方法

IF 3.5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Autonomous Robots

Pub Date : 2023-11-23 DOI: 10.1007/s10514-023-10142-4

Matthew O’Brien, Jason Williams, Shengkang Chen, Alex Pitt, Ronald Arkin, Navinda Kottege

This paper presents the methods used by team CSIRO Data61 for multi-agent coordination and exploration in the DARPA Subterranean (SubT) Challenge. The SubT competition involved a single operator sending teams of robots to rapidly explore underground environments with severe navigation and communication challenges. Coordination was framed as a multi-robot task allocation (MRTA) problem to allow for a seamless integration of exploration with other required tasks. Methods for extending a consensus-based task allocation approach for an online and highly dynamic mission are discussed. Exploration tasks were generated from frontiers in a map of traversable space, and graph-based heuristics applied to guide the selection of exploration tasks. Results from simulation, field testing, and the final competition are presented. Team CSIRO Data61 tied for most points scored and achieved second place during the final SubT event.

本文介绍了CSIRO Data61团队在DARPA地下(SubT)挑战赛中用于多智能体协调和探索的方法。SubT竞赛涉及单个操作员派遣机器人团队快速探索具有严峻导航和通信挑战的地下环境。协调被框架为一个多机器人任务分配(MRTA)问题，以允许探索与其他所需任务的无缝集成。讨论了将基于共识的任务分配方法扩展到在线高动态任务的方法。从可穿越空间地图的边界生成探索任务，并应用基于图的启发式方法指导探索任务的选择。给出了仿真、现场测试和决赛的结果。CSIRO Data61队在最后的SubT赛事中获得了最多的得分并获得了第二名。

引用次数: 1

AuRo special issue on large language models in robotics guest editorial AuRo关于机器人中的大型语言模型的特刊客座编辑

IF 3.5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Autonomous Robots

Pub Date : 2023-11-17 DOI: 10.1007/s10514-023-10153-1

引用次数: 0

TidyBot: personalized robot assistance with large language models TidyBot:具有大型语言模型的个性化机器人辅助

IF 3.5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Autonomous Robots

Pub Date : 2023-11-16 DOI: 10.1007/s10514-023-10139-z

Jimmy Wu, Rika Antonova, Adam Kan, Marion Lepert, Andy Zeng, Shuran Song, Jeannette Bohg, Szymon Rusinkiewicz, Thomas Funkhouser

For a robot to personalize physical assistance effectively, it must learn user preferences that can be generally reapplied to future scenarios. In this work, we investigate personalization of household cleanup with robots that can tidy up rooms by picking up objects and putting them away. A key challenge is determining the proper place to put each object, as people’s preferences can vary greatly depending on personal taste or cultural background. For instance, one person may prefer storing shirts in the drawer, while another may prefer them on the shelf. We aim to build systems that can learn such preferences from just a handful of examples via prior interactions with a particular person. We show that robots can combine language-based planning and perception with the few-shot summarization capabilities of large language models to infer generalized user preferences that are broadly applicable to future interactions. This approach enables fast adaptation and achieves 91.2% accuracy on unseen objects in our benchmark dataset. We also demonstrate our approach on a real-world mobile manipulator called TidyBot, which successfully puts away 85.0% of objects in real-world test scenarios.

为了让机器人有效地个性化物理辅助，它必须了解用户的偏好，这些偏好通常可以在未来的场景中重新应用。在这项工作中，我们研究了家庭清洁的个性化，机器人可以通过捡起物体并把它们放好来清理房间。一个关键的挑战是确定每件物品的合适放置位置，因为人们的偏好可能因个人品味或文化背景而有很大差异。例如，一个人可能喜欢把衬衫放在抽屉里，而另一个人可能喜欢把它们放在架子上。我们的目标是建立一个系统，可以通过与特定的人之前的互动，从少数例子中学习这种偏好。我们表明，机器人可以将基于语言的规划和感知与大型语言模型的少量汇总能力相结合，以推断广泛适用于未来交互的广义用户偏好。该方法实现了快速自适应，并在基准数据集中对未见对象实现了91.2%的准确率。我们还在一个名为TidyBot的真实世界的移动机械手上展示了我们的方法，它在真实世界的测试场景中成功地收起了85.0%的物体。

{"title":"TidyBot: personalized robot assistance with large language models","authors":"Jimmy Wu, Rika Antonova, Adam Kan, Marion Lepert, Andy Zeng, Shuran Song, Jeannette Bohg, Szymon Rusinkiewicz, Thomas Funkhouser","doi":"10.1007/s10514-023-10139-z","DOIUrl":"10.1007/s10514-023-10139-z","url":null,"abstract":"<div><p>For a robot to personalize physical assistance effectively, it must learn user preferences that can be generally reapplied to future scenarios. In this work, we investigate personalization of household cleanup with robots that can tidy up rooms by picking up objects and putting them away. A key challenge is determining the proper place to put each object, as people’s preferences can vary greatly depending on personal taste or cultural background. For instance, one person may prefer storing shirts in the drawer, while another may prefer them on the shelf. We aim to build systems that can learn such preferences from just a handful of examples via prior interactions with a particular person. We show that robots can combine language-based planning and perception with the few-shot summarization capabilities of large language models to infer generalized user preferences that are broadly applicable to future interactions. This approach enables fast adaptation and achieves 91.2% accuracy on unseen objects in our benchmark dataset. We also demonstrate our approach on a real-world mobile manipulator called TidyBot, which successfully puts away 85.0% of objects in real-world test scenarios.</p></div>","PeriodicalId":55409,"journal":{"name":"Autonomous Robots","volume":"47 8","pages":"1087 - 1102"},"PeriodicalIF":3.5,"publicationDate":"2023-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138473086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 68

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Autonomous Robots

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀