A Pilot Evaluation of a Conversational Listener for Conversational User Interfaces

Proceedings of the 5th International Conference on Conversational User Interfaces Pub Date : 2023-07-19 DOI:10.1145/3571884.3605871

M. Aylett, Andrea Carmantini, Chris Pidcock, Eric Nichols, Randy Gomez

引用次数: 1

Abstract

Current spoken conversational user interfaces (CUIs) are predominantly implemented using a sequential, utterance based, two-party, speak-wait/speak-wait approach. Human-human conversation 1) is not sequential, with overlap, interruption and back channels; 2) processes utterances before they are complete and 3) are often multi-party. As part of Honda Research Institute’s Haru project a light weight word spotting speech recognition system - A conversational listener - was implemented to allow very fast turn-taking in simple voice interaction conditions. In this paper, we present a pilot evaluation of the conversational listener in a script follower context (which allows a robot to act out a dialog with a user). We compare a disembodied version of the system with expressive synthesis to Alexa with and without fast turn-taking. Qualitative results indicate that users were sensitive to turn-taking delay and characterful speech synthesis.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

会话式用户界面会话式监听器的初步评估

当前的语音会话用户界面(gui)主要使用顺序的、基于话语的、双方的、说话-等待/说话-等待的方法来实现。人与人之间的对话1)不是顺序的，有重叠、中断和反向渠道;2)在话语完整之前对其进行加工;3)通常是多方参与的。作为本田研究所Haru项目的一部分，一种轻量级的单词识别语音识别系统——一种会话式监听器——被实现，可以在简单的语音交互条件下快速轮到。在本文中，我们提出了在脚本跟随上下文(它允许机器人与用户进行对话)中会话侦听器的试点评估。我们将具有表达性合成的系统的无实体版本与具有和不具有快速轮流的Alexa进行比较。定性结果表明，用户对轮取延迟和特征语音合成敏感。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the 5th International Conference on Conversational User Interfaces

自引率

0.00%

发文量