Scarecrows in Oz: The Use of Large Language Models in HRI

IF 4.2 Q2 ROBOTICS ACM Transactions on Human-Robot Interaction Pub Date : 2024-01-30 DOI:10.1145/3606261

Tom Williams, Cynthia Matuszek, Ross Mead, Nick Depalma

{"title":"Scarecrows in Oz: The Use of Large Language Models in HRI","authors":"Tom Williams, Cynthia Matuszek, Ross Mead, Nick Depalma","doi":"10.1145/3606261","DOIUrl":null,"url":null,"abstract":"\n The proliferation of Large Language Models (LLMs) presents both a critical design challenge and a remarkable opportunity for the field of Human–Robot Interaction (HRI). While the direct deployment of LLMs on interactive robots may be unsuitable for reasons of ethics, safety, and control, LLMs might nevertheless provide a promising baseline technique for many elements of HRI. Specifically, in this article, we argue for the use of LLMs as\n Scarecrows\n : “brainless,” straw-man black-box modules integrated into robot architectures for the purpose of quickly enabling full-pipeline solutions, much like the use of “Wizard of Oz” (WoZ) and other human-in-the-loop approaches. We explicitly acknowledge that these Scarecrows, rather than providing a satisfying or scientifically complete solution, incorporate a form of the wisdom of the crowd and, in at least some cases, will ultimately need to be replaced or supplemented by a robust and theoretically motivated solution. We provide examples of how Scarecrows could be used in language-capable robot architectures as useful placeholders and suggest initial reporting guidelines for authors, mirroring existing guidelines for the use and reporting of WoZ techniques.\n","PeriodicalId":36515,"journal":{"name":"ACM Transactions on Human-Robot Interaction","volume":null,"pages":null},"PeriodicalIF":4.2000,"publicationDate":"2024-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Human-Robot Interaction","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3606261","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ROBOTICS","Score":null,"Total":0}

引用次数: 6

Abstract

The proliferation of Large Language Models (LLMs) presents both a critical design challenge and a remarkable opportunity for the field of Human–Robot Interaction (HRI). While the direct deployment of LLMs on interactive robots may be unsuitable for reasons of ethics, safety, and control, LLMs might nevertheless provide a promising baseline technique for many elements of HRI. Specifically, in this article, we argue for the use of LLMs as Scarecrows : “brainless,” straw-man black-box modules integrated into robot architectures for the purpose of quickly enabling full-pipeline solutions, much like the use of “Wizard of Oz” (WoZ) and other human-in-the-loop approaches. We explicitly acknowledge that these Scarecrows, rather than providing a satisfying or scientifically complete solution, incorporate a form of the wisdom of the crowd and, in at least some cases, will ultimately need to be replaced or supplemented by a robust and theoretically motivated solution. We provide examples of how Scarecrows could be used in language-capable robot architectures as useful placeholders and suggest initial reporting guidelines for authors, mirroring existing guidelines for the use and reporting of WoZ techniques.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

绿野仙踪中的稻草人：大型语言模型在人机交互中的应用

大型语言模型（LLMs）的大量涌现，对人机交互（HRI）领域来说，既是一个严峻的设计挑战，也是一个难得的机遇。虽然出于道德、安全和控制方面的考虑，在交互式机器人上直接部署 LLMs 可能并不合适，但 LLMs 仍有可能为 HRI 的许多要素提供一种前景广阔的基准技术。具体来说，在本文中，我们主张将 LLMs 用作 "稻草人"：将 "无脑 "的草人黑盒子模块集成到机器人架构中，目的是快速实现全管道解决方案，就像使用 "绿野仙踪"（WoZ）和其他 "人在回路中 "的方法一样。我们明确承认，这些 "稻草人 "并不能提供令人满意或科学上完整的解决方案，而是包含了某种形式的群众智慧，至少在某些情况下，最终需要由一个强大的、有理论依据的解决方案来取代或补充。我们举例说明了稻草人如何作为有用的占位符用于可使用语言的机器人架构，并为作者提出了初步的报告指南，这与现有的 WoZ 技术使用和报告指南如出一辙。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

ACM Transactions on Human-Robot Interaction Computer Science-Artificial Intelligence

CiteScore

7.70

自引率

5.90%

发文量

期刊介绍： ACM Transactions on Human-Robot Interaction (THRI) is a prestigious Gold Open Access journal that aspires to lead the field of human-robot interaction as a top-tier, peer-reviewed, interdisciplinary publication. The journal prioritizes articles that significantly contribute to the current state of the art, enhance overall knowledge, have a broad appeal, and are accessible to a diverse audience. Submissions are expected to meet a high scholarly standard, and authors are encouraged to ensure their research is well-presented, advancing the understanding of human-robot interaction, adding cutting-edge or general insights to the field, or challenging current perspectives in this research domain. THRI warmly invites well-crafted paper submissions from a variety of disciplines, encompassing robotics, computer science, engineering, design, and the behavioral and social sciences. The scholarly articles published in THRI may cover a range of topics such as the nature of human interactions with robots and robotic technologies, methods to enhance or enable novel forms of interaction, and the societal or organizational impacts of these interactions. The editorial team is also keen on receiving proposals for special issues that focus on specific technical challenges or that apply human-robot interaction research to further areas like social computing, consumer behavior, health, and education.