Human-in-the-loop Robotic Grasping Using BERT Scene Representation

Proceedings of COLING. International Conference on Computational Linguistics Pub Date : 2022-09-28 DOI:10.48550/arXiv.2209.14026

Yaoxian Song, Penglei Sun, Pengfei Fang, Linyi Yang, Yanghua Xiao, Yue Zhang

引用次数: 2

Abstract

Current NLP techniques have been greatly applied in different domains. In this paper, we propose a human-in-the-loop framework for robotic grasping in cluttered scenes, investigating a language interface to the grasping process, which allows the user to intervene by natural language commands. This framework is constructed on a state-of-the-art grasping baseline, where we substitute a scene-graph representation with a text representation of the scene using BERT. Experiments on both simulation and physical robot show that the proposed method outperforms conventional object-agnostic and scene-graph based methods in the literature. In addition, we find that with human intervention, performance can be significantly improved. Our dataset and code are available on our project website https://sites.google.com/view/hitl-grasping-bert.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于BERT场景表示的人在环机器人抓取

目前的自然语言处理技术在不同的领域得到了广泛的应用。在本文中，我们提出了一个用于混乱场景中机器人抓取的人在环框架，研究了抓取过程的语言接口，允许用户通过自然语言命令进行干预。该框架是在最先进的抓取基线上构建的，其中我们使用BERT用场景的文本表示代替场景图表示。在仿真和物理机器人上的实验表明，该方法优于传统的基于场景图和物体不可知的方法。此外，我们发现，通过人为干预，性能可以显著提高。我们的数据集和代码可在我们的项目网站https://sites.google.com/view/hitl-grasping-bert上获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of COLING. International Conference on Computational Linguistics

自引率

0.00%

发文量