{"title":"The IBM natural conversation framework: a new paradigm for conversational UX design","authors":"Robert J. Moore, Sungeun An, G. Ren","doi":"10.1080/07370024.2022.2081571","DOIUrl":null,"url":null,"abstract":"User interfaces that take human conversation as their interaction metaphor work fundamentally differently than those that employ spatial metaphors, such as a desktop or a page. While the fundamental concept in visual interface design is the layout, the fundamental concept in conversational interface design is the sequence. Each provides for the overall structure of the user experience. In the past, user-interface designers have borrowed elements from the various areas of physical design. From industrial design, they have borrowed concepts such as, buttons, levers, wheels, and more from the print industry, they have borrowed the page, typography, iconography, illustration, and photography and more. These concepts from the physical world are then adapted to persistent, visual representations on a computer screen. Of course, virtual buttons are different from physical buttons and displayed words are different from printed words, but they evoke familiar ways of interacting with the real world that are then repurposed for a computer–user interface. And graphical user interface design, web design and mobile design are mature disciplines with shared standards and communities of practitioners. However, the spatial interaction metaphors of these areas of visual design largely do not apply to the design of conversational user interfaces (Moore & Arar, 2019; Moore et al., 2020; Murad et al., 2021; Yankelovich et al., 1995). Human conversational interaction consists primarily of sequences of words and embodied actions, not of layouts of visual elements. Buttons or pages cannot be represented as a stream of words produced by different parties. Conversational interfaces are more akin to command-line interaction, which involves sequences of specialized commands. The interaction conventions of visual design, graphical, web and mobile, were invented as an alternative to language-based interfaces and are not applicable to the design of conversational user experience. Where then can UX designers find inspiration when creating conversational interfaces with their sequences of natural-language utterances? In part, they can turn to Natural Language Processing (NLP), which provides mature methods for recognizing, classifying, and generating natural-language input and output (Chowdhury, 2003; Goldberg, 2017; Graves et al., 2013; López-Cózar et al., 2011; McTear et al., 2016; Reiter & Dale, 1997). These methods help designers understand what the user said and render realistic voice responses. However, NLP provides resources primarily for managing natural language, not for managing natural conversation. NLP addresses language use in any form: novels, poems, tweets, e-mails, conversations, etc. (Berg, 2015; Mitri, 2022; Peng et al., 2018; Zhang & Gao, 2017). Any bit of natural language, be it Spanish, English, Mandarin, etc., is analyzable with NLP.","PeriodicalId":56306,"journal":{"name":"Human-Computer Interaction","volume":null,"pages":null},"PeriodicalIF":4.5000,"publicationDate":"2022-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Human-Computer Interaction","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1080/07370024.2022.2081571","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, CYBERNETICS","Score":null,"Total":0}
引用次数: 5
Abstract
User interfaces that take human conversation as their interaction metaphor work fundamentally differently than those that employ spatial metaphors, such as a desktop or a page. While the fundamental concept in visual interface design is the layout, the fundamental concept in conversational interface design is the sequence. Each provides for the overall structure of the user experience. In the past, user-interface designers have borrowed elements from the various areas of physical design. From industrial design, they have borrowed concepts such as, buttons, levers, wheels, and more from the print industry, they have borrowed the page, typography, iconography, illustration, and photography and more. These concepts from the physical world are then adapted to persistent, visual representations on a computer screen. Of course, virtual buttons are different from physical buttons and displayed words are different from printed words, but they evoke familiar ways of interacting with the real world that are then repurposed for a computer–user interface. And graphical user interface design, web design and mobile design are mature disciplines with shared standards and communities of practitioners. However, the spatial interaction metaphors of these areas of visual design largely do not apply to the design of conversational user interfaces (Moore & Arar, 2019; Moore et al., 2020; Murad et al., 2021; Yankelovich et al., 1995). Human conversational interaction consists primarily of sequences of words and embodied actions, not of layouts of visual elements. Buttons or pages cannot be represented as a stream of words produced by different parties. Conversational interfaces are more akin to command-line interaction, which involves sequences of specialized commands. The interaction conventions of visual design, graphical, web and mobile, were invented as an alternative to language-based interfaces and are not applicable to the design of conversational user experience. Where then can UX designers find inspiration when creating conversational interfaces with their sequences of natural-language utterances? In part, they can turn to Natural Language Processing (NLP), which provides mature methods for recognizing, classifying, and generating natural-language input and output (Chowdhury, 2003; Goldberg, 2017; Graves et al., 2013; López-Cózar et al., 2011; McTear et al., 2016; Reiter & Dale, 1997). These methods help designers understand what the user said and render realistic voice responses. However, NLP provides resources primarily for managing natural language, not for managing natural conversation. NLP addresses language use in any form: novels, poems, tweets, e-mails, conversations, etc. (Berg, 2015; Mitri, 2022; Peng et al., 2018; Zhang & Gao, 2017). Any bit of natural language, be it Spanish, English, Mandarin, etc., is analyzable with NLP.
将人类对话作为交互隐喻的用户界面与使用空间隐喻(如桌面或页面)的用户界面的工作方式根本不同。视觉界面设计的基本概念是布局,而对话界面设计的基本概念是顺序。每个都提供了用户体验的整体结构。在过去,用户界面设计师从物理设计的各个领域借鉴元素。从工业设计中,他们借鉴了诸如按钮、杠杆、轮子等概念,从印刷行业中,他们借鉴了页面、排版、图像、插图和摄影等等。然后,这些来自物理世界的概念被适应为计算机屏幕上持久的视觉表现。当然,虚拟按钮不同于物理按钮,显示的文字也不同于打印的文字,但它们唤起了与现实世界交互的熟悉方式,然后将其重新用于计算机用户界面。图形用户界面设计、网页设计和移动设计是成熟的学科,有共同的标准和从业者社区。然而,这些视觉设计领域的空间交互隐喻在很大程度上不适用于会话用户界面的设计(Moore & Arar, 2019;Moore et al., 2020;Murad et al., 2021;Yankelovich et al., 1995)。人类的对话互动主要由文字序列和具体的动作组成,而不是视觉元素的布局。按钮或页面不能表示为由不同方产生的单词流。会话接口更类似于命令行交互,它涉及一系列专门的命令。视觉设计、图形、网络和移动的交互约定是作为基于语言的界面的替代而发明的,并不适用于会话用户体验的设计。那么,用户体验设计师在用自然语言话语序列创建会话界面时,从哪里找到灵感呢?在某种程度上,他们可以转向自然语言处理(NLP),它为识别、分类和生成自然语言输入和输出提供了成熟的方法(Chowdhury, 2003;戈德堡,2017;Graves et al., 2013;López-Cózar等人,2011;McTear et al., 2016;Reiter & Dale出版社,1997)。这些方法可以帮助设计师理解用户所说的内容并呈现真实的声音响应。然而,NLP提供的资源主要用于管理自然语言,而不是用于管理自然对话。NLP处理任何形式的语言使用:小说、诗歌、推特、电子邮件、对话等(Berg, 2015;米特里,2022;Peng et al., 2018;Zhang & Gao, 2017)。任何自然语言,无论是西班牙语、英语、普通话等,都可以用NLP进行分析。
期刊介绍:
Human-Computer Interaction (HCI) is a multidisciplinary journal defining and reporting
on fundamental research in human-computer interaction. The goal of HCI is to be a journal
of the highest quality that combines the best research and design work to extend our
understanding of human-computer interaction. The target audience is the research
community with an interest in both the scientific implications and practical relevance of
how interactive computer systems should be designed and how they are actually used. HCI is
concerned with the theoretical, empirical, and methodological issues of interaction science
and system design as it affects the user.