Proceedings of the 28th International Conference on Intelligent User Interfaces最新文献_第6页

Improving Fairness in Adaptive Social Exergames via Shapley Bandits Shapley Bandits提高适应性社交游戏公平性

Proceedings of the 28th International Conference on Intelligent User Interfaces

Pub Date : 2023-02-18 DOI: 10.1145/3581641.3584050

Robert C. Gray, Jennifer Villareale, Thomas Fox, Diane H Dallal, Santiago Ontan'on, D. Arigo, S. Jabbari, Jichen Zhu

Algorithmic fairness is an essential requirement as AI becomes integrated in society. In the case of social applications where AI distributes resources, algorithms often must make decisions that will benefit a subset of users, sometimes repeatedly or exclusively, while attempting to maximize specific outcomes. How should we design such systems to serve users more fairly? This paper explores this question in the case where a group of users works toward a shared goal in a social exergame called Step Heroes. We identify adverse outcomes in traditional multi-armed bandits (MABs) and formalize the Greedy Bandit Problem. We then propose a solution based on a new type of fairness-aware multi-armed bandit, Shapley Bandits. It uses the Shapley Value for increasing overall player participation and intervention adherence rather than the maximization of total group output, which is traditionally achieved by favoring only high-performing participants. We evaluate our approach via a user study (n=46). Our results indicate that our Shapley Bandits effectively mediates the Greedy Bandit Problem and achieves better user retention and motivation across the participants.

随着人工智能融入社会，算法公平性是一项基本要求。在人工智能分配资源的社交应用程序中，算法通常必须做出有利于一小部分用户的决策，有时是重复的或单独的，同时试图最大化特定结果。我们应该如何设计这样的系统来更公平地为用户服务?本文以一款名为《Step Heroes》的社交游戏中的一群用户为实现共同目标而努力为例来探讨这个问题。我们确定了传统多武装土匪(mab)的不良后果，并形式化了贪婪土匪问题。然后，我们提出了一种基于新型公平意识的多臂强盗Shapley匪徒的解决方案。它使用Shapley值来增加整体参与者的参与和干预依从性，而不是最大化总团队产出，这是传统上只支持高绩效参与者来实现的。我们通过一项用户研究(n=46)来评估我们的方法。我们的研究结果表明，我们的Shapley土匪有效地调解了贪婪土匪问题，并在参与者中获得了更好的用户留存率和动机。

{"title":"Improving Fairness in Adaptive Social Exergames via Shapley Bandits","authors":"Robert C. Gray, Jennifer Villareale, Thomas Fox, Diane H Dallal, Santiago Ontan'on, D. Arigo, S. Jabbari, Jichen Zhu","doi":"10.1145/3581641.3584050","DOIUrl":"https://doi.org/10.1145/3581641.3584050","url":null,"abstract":"Algorithmic fairness is an essential requirement as AI becomes integrated in society. In the case of social applications where AI distributes resources, algorithms often must make decisions that will benefit a subset of users, sometimes repeatedly or exclusively, while attempting to maximize specific outcomes. How should we design such systems to serve users more fairly? This paper explores this question in the case where a group of users works toward a shared goal in a social exergame called Step Heroes. We identify adverse outcomes in traditional multi-armed bandits (MABs) and formalize the Greedy Bandit Problem. We then propose a solution based on a new type of fairness-aware multi-armed bandit, Shapley Bandits. It uses the Shapley Value for increasing overall player participation and intervention adherence rather than the maximization of total group output, which is traditionally achieved by favoring only high-performing participants. We evaluate our approach via a user study (n=46). Our results indicate that our Shapley Bandits effectively mediates the Greedy Bandit Problem and achieves better user retention and motivation across the participants.","PeriodicalId":118159,"journal":{"name":"Proceedings of the 28th International Conference on Intelligent User Interfaces","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122712070","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

SeeChart: Enabling Accessible Visualizations Through Interactive Natural Language Interface For People with Visual Impairments 参见图表:通过交互式自然语言界面为视障人士提供可访问的可视化

Proceedings of the 28th International Conference on Intelligent User Interfaces

Pub Date : 2023-02-15 DOI: 10.1145/3581641.3584099

Md. Zubair Ibne Alam, Shehnaz Islam, Enamul Hoque

Web-based data visualizations have become very popular for exploring data and communicating insights. Newspapers, journals, and reports regularly publish visualizations to tell compelling stories with data. Unfortunately, most visualizations are inaccessible to readers with visual impairments. For many charts on the web, there are no accompanying alternative (alt) texts, and even if such texts exist they do not adequately describe important insights from charts. To address the problem, we first interviewed 15 blind users to understand their challenges and requirements for reading data visualizations. Based on the insights from these interviews, we developed SeeChart, an interactive tool that automatically deconstructs charts from web pages and then converts them to accessible visualizations for blind people by enabling them to hear the chart summary as well as to interact through data points using the keyboard. Our evaluation with 14 blind participants suggests the efficacy of SeeChart in understanding key insights from charts and fulfilling their information needs while reducing their required time and cognitive burden.

基于web的数据可视化在探索数据和交流见解方面变得非常流行。报纸、期刊和报告定期发布可视化，用数据讲述引人注目的故事。不幸的是，大多数可视化对有视觉障碍的读者来说是无法理解的。对于网络上的许多图表，没有附带的替代(alt)文本，即使存在这样的文本，它们也不能充分描述图表的重要见解。为了解决这个问题，我们首先采访了15位盲人用户，以了解他们在阅读数据可视化时面临的挑战和需求。基于这些访谈的见解，我们开发了SeeChart，这是一个交互式工具，可以自动从网页上解构图表，然后通过让盲人听到图表摘要以及使用键盘通过数据点进行交互，将它们转换为可访问的可视化。我们对14名盲人参与者的评估表明，SeeChart在理解图表中的关键见解和满足他们的信息需求方面是有效的，同时减少了他们所需的时间和认知负担。

{"title":"SeeChart: Enabling Accessible Visualizations Through Interactive Natural Language Interface For People with Visual Impairments","authors":"Md. Zubair Ibne Alam, Shehnaz Islam, Enamul Hoque","doi":"10.1145/3581641.3584099","DOIUrl":"https://doi.org/10.1145/3581641.3584099","url":null,"abstract":"Web-based data visualizations have become very popular for exploring data and communicating insights. Newspapers, journals, and reports regularly publish visualizations to tell compelling stories with data. Unfortunately, most visualizations are inaccessible to readers with visual impairments. For many charts on the web, there are no accompanying alternative (alt) texts, and even if such texts exist they do not adequately describe important insights from charts. To address the problem, we first interviewed 15 blind users to understand their challenges and requirements for reading data visualizations. Based on the insights from these interviews, we developed SeeChart, an interactive tool that automatically deconstructs charts from web pages and then converts them to accessible visualizations for blind people by enabling them to hear the chart summary as well as to interact through data points using the keyboard. Our evaluation with 14 blind participants suggests the efficacy of SeeChart in understanding key insights from charts and fulfilling their information needs while reducing their required time and cognitive burden.","PeriodicalId":118159,"journal":{"name":"Proceedings of the 28th International Conference on Intelligent User Interfaces","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125765651","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

ScatterShot: Interactive In-context Example Curation for Text Transformation ScatterShot:文本转换的交互式上下文示例管理

Proceedings of the 28th International Conference on Intelligent User Interfaces

Pub Date : 2023-02-14 DOI: 10.1145/3581641.3584059

Tongshuang Sherry Wu, Hua Shen, Daniel S. Weld, Jeffrey Heer, Marco Tulio Ribeiro

The in-context learning capabilities of LLMs like GPT-3 allow annotators to customize an LLM to their specific tasks with a small number of examples. However, users tend to include only the most obvious patterns when crafting examples, resulting in underspecified in-context functions that fall short on unseen cases. Further, it is hard to know when “enough” examples have been included even for known patterns. In this work, we present ScatterShot, an interactive system for building high-quality demonstration sets for in-context learning. ScatterShot iteratively slices unlabeled data into task-specific patterns, samples informative inputs from underexplored or not-yet-saturated slices in an active learning manner, and helps users label more efficiently with the help of an LLM and the current example set. In simulation studies on two text perturbation scenarios, ScatterShot sampling improves the resulting few-shot functions by 4-5 percentage points over random sampling, with less variance as more examples are added. In a user study, ScatterShot greatly helps users in covering different patterns in the input space and labeling in-context examples more efficiently, resulting in better in-context learning and less user effort.

像GPT-3这样的法学硕士的上下文学习能力允许注释者使用少量示例来定制法学硕士，以适应他们的特定任务。然而，在编写示例时，用户倾向于只包含最明显的模式，从而导致未指定的上下文函数在看不见的情况下不足。此外，即使对于已知的模式，也很难知道何时包含了“足够的”示例。在这项工作中，我们提出了ScatterShot，这是一个用于构建高质量演示集的交互式系统。ScatterShot迭代地将未标记的数据切片为任务特定的模式，以主动学习的方式从未充分探索或尚未饱和的切片中采样信息输入，并在LLM和当前示例集的帮助下帮助用户更有效地进行标记。在两种文本摄动场景的模拟研究中，ScatterShot抽样比随机抽样得到的few-shot函数提高了4-5个百分点，随着样本的增加，方差也越来越小。在用户研究中，ScatterShot极大地帮助用户覆盖输入空间中的不同模式，并更有效地标记上下文示例，从而实现更好的上下文学习，减少用户的工作量。

{"title":"ScatterShot: Interactive In-context Example Curation for Text Transformation","authors":"Tongshuang Sherry Wu, Hua Shen, Daniel S. Weld, Jeffrey Heer, Marco Tulio Ribeiro","doi":"10.1145/3581641.3584059","DOIUrl":"https://doi.org/10.1145/3581641.3584059","url":null,"abstract":"The in-context learning capabilities of LLMs like GPT-3 allow annotators to customize an LLM to their specific tasks with a small number of examples. However, users tend to include only the most obvious patterns when crafting examples, resulting in underspecified in-context functions that fall short on unseen cases. Further, it is hard to know when “enough” examples have been included even for known patterns. In this work, we present ScatterShot, an interactive system for building high-quality demonstration sets for in-context learning. ScatterShot iteratively slices unlabeled data into task-specific patterns, samples informative inputs from underexplored or not-yet-saturated slices in an active learning manner, and helps users label more efficiently with the help of an LLM and the current example set. In simulation studies on two text perturbation scenarios, ScatterShot sampling improves the resulting few-shot functions by 4-5 percentage points over random sampling, with less variance as more examples are added. In a user study, ScatterShot greatly helps users in covering different patterns in the input space and labeling in-context examples more efficiently, resulting in better in-context learning and less user effort.","PeriodicalId":118159,"journal":{"name":"Proceedings of the 28th International Conference on Intelligent User Interfaces","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125431184","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Lessons from the Development of an Anomaly Detection Interface on the Mars Perseverance Rover using the ISHMAP Framework 利用ISHMAP框架开发火星毅力号火星车异常检测接口的经验教训

Proceedings of the 28th International Conference on Intelligent User Interfaces

Pub Date : 2023-02-14 DOI: 10.1145/3581641.3584036

Austin P. Wright, P. Nemere, A. Galvin, Duen Horng Chau, Scott Davidoff

While anomaly detection stands among the most important and valuable problems across many scientific domains, anomaly detection research often focuses on AI methods that can lack the nuance and interpretability so critical to conducting scientific inquiry. We believe this exclusive focus on algorithms with a fixed framing ultimately blocks scientists from adopting even high-accuracy anomaly detection models in many scientific use cases. In this application paper we present the results of utilizing an alternative approach that situates the mathematical framing of machine learning based anomaly detection within a participatory design framework. In a collaboration with NASA scientists working with the PIXL instrument studying Martian planetary geochemistry as a part of the search for extra-terrestrial life; we report on over 18 months of in-context user research and co-design to define the key problems NASA scientists face when looking to detect and interpret spectral anomalies. We address these problems and develop a novel spectral anomaly detection toolkit for PIXL scientists that is highly accurate (93.4% test accuracy on detecting diffraction anomalies), while maintaining strong transparency to scientific interpretation. We also describe outcomes from a yearlong field deployment of the algorithm and associated interface, now used daily as a core component of the PIXL science team’s workflow, and directly situate the algorithm as a key contributor to discoveries around the potential habitability of Mars. Finally we introduce a new design framework which we developed through the course of this collaboration for co-creating anomaly detection algorithms: Iterative Semantic Heuristic Modeling of Anomalous Phenomena (ISHMAP), which provides a process for scientists and researchers to produce natively interpretable anomaly detection models. This work showcases an example of successfully bridging methodologies from AI and HCI within a scientific domain, and provides a resource in ISHMAP which may be used by other researchers and practitioners looking to partner with other scientific teams to achieve better science through more effective and interpretable anomaly detection tools.

虽然异常检测是许多科学领域中最重要和最有价值的问题之一，但异常检测研究通常集中在人工智能方法上，这些方法可能缺乏对进行科学探究至关重要的细微差别和可解释性。我们认为，这种对固定框架算法的独家关注最终会阻碍科学家在许多科学用例中采用高精度异常检测模型。在这篇应用论文中，我们展示了利用一种替代方法的结果，该方法将基于机器学习的异常检测的数学框架置于参与式设计框架中。与NASA的科学家合作，使用PIXL仪器研究火星行星地球化学，作为寻找地外生命的一部分;我们报告了超过18个月的上下文用户研究和共同设计，以定义NASA科学家在寻找检测和解释光谱异常时面临的关键问题。我们解决了这些问题，并为PIXL科学家开发了一种新的光谱异常检测工具包，该工具包具有很高的准确性(检测衍射异常的测试精度为93.4%)，同时保持了很强的科学解释透明度。我们还描述了该算法和相关界面长达一年的现场部署的结果，现在作为PIXL科学团队工作流程的核心组成部分每天使用，并直接将该算法定位为发现火星潜在宜居性的关键贡献者。最后，我们介绍了一个新的设计框架，这是我们在合作过程中开发的共同创建异常检测算法:异常现象迭代语义启发式建模(ISHMAP)，它为科学家和研究人员提供了一个生成本地可解释异常检测模型的过程。这项工作展示了在科学领域内成功桥接人工智能和HCI方法的一个例子，并提供了ISHMAP中的资源，可供其他研究人员和实践者使用，他们希望与其他科学团队合作，通过更有效和可解释的异常检测工具实现更好的科学。

{"title":"Lessons from the Development of an Anomaly Detection Interface on the Mars Perseverance Rover using the ISHMAP Framework","authors":"Austin P. Wright, P. Nemere, A. Galvin, Duen Horng Chau, Scott Davidoff","doi":"10.1145/3581641.3584036","DOIUrl":"https://doi.org/10.1145/3581641.3584036","url":null,"abstract":"While anomaly detection stands among the most important and valuable problems across many scientific domains, anomaly detection research often focuses on AI methods that can lack the nuance and interpretability so critical to conducting scientific inquiry. We believe this exclusive focus on algorithms with a fixed framing ultimately blocks scientists from adopting even high-accuracy anomaly detection models in many scientific use cases. In this application paper we present the results of utilizing an alternative approach that situates the mathematical framing of machine learning based anomaly detection within a participatory design framework. In a collaboration with NASA scientists working with the PIXL instrument studying Martian planetary geochemistry as a part of the search for extra-terrestrial life; we report on over 18 months of in-context user research and co-design to define the key problems NASA scientists face when looking to detect and interpret spectral anomalies. We address these problems and develop a novel spectral anomaly detection toolkit for PIXL scientists that is highly accurate (93.4% test accuracy on detecting diffraction anomalies), while maintaining strong transparency to scientific interpretation. We also describe outcomes from a yearlong field deployment of the algorithm and associated interface, now used daily as a core component of the PIXL science team’s workflow, and directly situate the algorithm as a key contributor to discoveries around the potential habitability of Mars. Finally we introduce a new design framework which we developed through the course of this collaboration for co-creating anomaly detection algorithms: Iterative Semantic Heuristic Modeling of Anomalous Phenomena (ISHMAP), which provides a process for scientists and researchers to produce natively interpretable anomaly detection models. This work showcases an example of successfully bridging methodologies from AI and HCI within a scientific domain, and provides a resource in ISHMAP which may be used by other researchers and practitioners looking to partner with other scientific teams to achieve better science through more effective and interpretable anomaly detection tools.","PeriodicalId":118159,"journal":{"name":"Proceedings of the 28th International Conference on Intelligent User Interfaces","volume":"102 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123218590","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

The Programmer’s Assistant: Conversational Interaction with a Large Language Model for Software Development 程序员的助手:与软件开发的大型语言模型的会话交互

Proceedings of the 28th International Conference on Intelligent User Interfaces

Pub Date : 2023-02-14 DOI: 10.1145/3581641.3584037

Steven I. Ross, Fernando Martinez, Stephanie Houde, Michael J. Muller, Justin D. Weisz

Large language models (LLMs) have recently been applied in software engineering to perform tasks such as translating code between programming languages, generating code from natural language, and autocompleting code as it is being written. When used within development tools, these systems typically treat each model invocation independently from all previous invocations, and only a specific limited functionality is exposed within the user interface. This approach to user interaction misses an opportunity for users to more deeply engage with the model by having the context of their previous interactions, as well as the context of their code, inform the model’s responses. We developed a prototype system – the Programmer’s Assistant – in order to explore the utility of conversational interactions grounded in code, as well as software engineers’ receptiveness to the idea of conversing with, rather than invoking, a code-fluent LLM. Through an evaluation with 42 participants with varied levels of programming experience, we found that our system was capable of conducting extended, multi-turn discussions, and that it enabled additional knowledge and capabilities beyond code generation to emerge from the LLM. Despite skeptical initial expectations for conversational programming assistance, participants were impressed by the breadth of the assistant’s capabilities, the quality of its responses, and its potential for improving their productivity. Our work demonstrates the unique potential of conversational interactions with LLMs for co-creative processes like software development.

大型语言模型(llm)最近被应用于软件工程中，以执行诸如在编程语言之间翻译代码、从自然语言生成代码以及在编写代码时自动完成代码等任务。当在开发工具中使用时，这些系统通常独立于所有以前的调用来处理每个模型调用，并且在用户界面中只公开特定的有限功能。这种用户交互的方法错过了一个让用户更深入地参与模型的机会，因为他们拥有之前交互的上下文，以及他们的代码的上下文，通知模型的响应。我们开发了一个原型系统——程序员助理——以探索基于代码的对话交互的效用，以及软件工程师对与代码流畅的LLM对话而不是调用的想法的接受程度。通过对42名具有不同编程经验水平的参与者的评估，我们发现我们的系统能够进行扩展的、多回合的讨论，并且它能够从LLM中获得代码生成之外的额外知识和能力。尽管最初对会话编程辅助的期望持怀疑态度，但参与者对助手的能力广度、响应质量以及提高工作效率的潜力印象深刻。我们的工作展示了与法学硕士对话互动在软件开发等共同创造过程中的独特潜力。

{"title":"The Programmer’s Assistant: Conversational Interaction with a Large Language Model for Software Development","authors":"Steven I. Ross, Fernando Martinez, Stephanie Houde, Michael J. Muller, Justin D. Weisz","doi":"10.1145/3581641.3584037","DOIUrl":"https://doi.org/10.1145/3581641.3584037","url":null,"abstract":"Large language models (LLMs) have recently been applied in software engineering to perform tasks such as translating code between programming languages, generating code from natural language, and autocompleting code as it is being written. When used within development tools, these systems typically treat each model invocation independently from all previous invocations, and only a specific limited functionality is exposed within the user interface. This approach to user interaction misses an opportunity for users to more deeply engage with the model by having the context of their previous interactions, as well as the context of their code, inform the model’s responses. We developed a prototype system – the Programmer’s Assistant – in order to explore the utility of conversational interactions grounded in code, as well as software engineers’ receptiveness to the idea of conversing with, rather than invoking, a code-fluent LLM. Through an evaluation with 42 participants with varied levels of programming experience, we found that our system was capable of conducting extended, multi-turn discussions, and that it enabled additional knowledge and capabilities beyond code generation to emerge from the LLM. Despite skeptical initial expectations for conversational programming assistance, participants were impressed by the breadth of the assistant’s capabilities, the quality of its responses, and its potential for improving their productivity. Our work demonstrates the unique potential of conversational interactions with LLMs for co-creative processes like software development.","PeriodicalId":118159,"journal":{"name":"Proceedings of the 28th International Conference on Intelligent User Interfaces","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126881409","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 40

Mixed Multi-Model Semantic Interaction for Graph-based Narrative Visualizations 基于图的叙事可视化混合多模型语义交互

Proceedings of the 28th International Conference on Intelligent User Interfaces

Pub Date : 2023-02-13 DOI: 10.1145/3581641.3584076

Brian Felipe Keith Norambuena, Tanushree Mitra, Chris North

Narrative sensemaking is an essential part of understanding sequential data. Narrative maps are a visual representation model that can assist analysts to understand narratives. In this work, we present a semantic interaction (SI) framework for narrative maps that can support analysts through their sensemaking process. In contrast to traditional SI systems which rely on dimensionality reduction and work on a projection space, our approach has an additional abstraction layer—the structure space—that builds upon the projection space and encodes the narrative in a discrete structure. This extra layer introduces additional challenges that must be addressed when integrating SI with the narrative extraction pipeline. We address these challenges by presenting the general concept of Mixed Multi-Model Semantic Interaction (3MSI)—an SI pipeline, where the highest-level model corresponds to an abstract discrete structure and the lower-level models are continuous. To evaluate the performance of our 3MSI models for narrative maps, we present a quantitative simulation-based evaluation and a qualitative evaluation with case studies and expert feedback. We find that our SI system can model the analysts’ intent and support incremental formalism for narrative maps.

叙事意义是理解序列数据的重要组成部分。叙事地图是一种视觉表现模型，可以帮助分析人员理解叙事。在这项工作中，我们提出了一个用于叙事地图的语义交互(SI)框架，可以通过他们的语义构建过程来支持分析师。与传统的SI系统依赖于降维并在投影空间上工作相比，我们的方法有一个额外的抽象层——结构空间——它建立在投影空间之上，并以离散结构对叙事进行编码。这个额外的层引入了在集成SI和叙事提取管道时必须解决的额外挑战。我们通过提出混合多模型语义交互(3MSI)的一般概念来解决这些挑战-一个SI管道，其中最高级别模型对应于抽象离散结构，较低级别模型是连续的。为了评估我们的3MSI叙事地图模型的性能，我们提出了基于定量模拟的评估和基于案例研究和专家反馈的定性评估。我们发现我们的SI系统可以模拟分析人员的意图，并支持叙事地图的增量形式主义。

{"title":"Mixed Multi-Model Semantic Interaction for Graph-based Narrative Visualizations","authors":"Brian Felipe Keith Norambuena, Tanushree Mitra, Chris North","doi":"10.1145/3581641.3584076","DOIUrl":"https://doi.org/10.1145/3581641.3584076","url":null,"abstract":"Narrative sensemaking is an essential part of understanding sequential data. Narrative maps are a visual representation model that can assist analysts to understand narratives. In this work, we present a semantic interaction (SI) framework for narrative maps that can support analysts through their sensemaking process. In contrast to traditional SI systems which rely on dimensionality reduction and work on a projection space, our approach has an additional abstraction layer—the structure space—that builds upon the projection space and encodes the narrative in a discrete structure. This extra layer introduces additional challenges that must be addressed when integrating SI with the narrative extraction pipeline. We address these challenges by presenting the general concept of Mixed Multi-Model Semantic Interaction (3MSI)—an SI pipeline, where the highest-level model corresponds to an abstract discrete structure and the lower-level models are continuous. To evaluate the performance of our 3MSI models for narrative maps, we present a quantitative simulation-based evaluation and a qualitative evaluation with case studies and expert feedback. We find that our SI system can model the analysts’ intent and support incremental formalism for narrative maps.","PeriodicalId":118159,"journal":{"name":"Proceedings of the 28th International Conference on Intelligent User Interfaces","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131719319","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

The Impact of Expertise in the Loop for Exploring Machine Rationality 循环中的专业知识对探索机器合理性的影响

Proceedings of the 28th International Conference on Intelligent User Interfaces

Pub Date : 2023-02-11 DOI: 10.1145/3581641.3584040

Changkun Ou, S. Mayer, A. Butz

Human-in-the-loop optimization utilizes human expertise to guide machine optimizers iteratively and search for an optimal solution in a solution space. While prior empirical studies mainly investigated novices, we analyzed the impact of the levels of expertise on the outcome quality and corresponding subjective satisfaction. We conducted a study (N=60) in text, photo, and 3D mesh optimization contexts. We found that novices can achieve an expert level of quality performance, but participants with higher expertise led to more optimization iteration with more explicit preference while keeping satisfaction low. In contrast, novices were more easily satisfied and terminated faster. Therefore, we identified that experts seek more diverse outcomes while the machine reaches optimal results, and the observed behavior can be used as a performance indicator for human-in-the-loop system designers to improve underlying models. We inform future research to be cautious about the impact of user expertise when designing human-in-the-loop systems.

人在环优化利用人的专业知识来指导机器优化器迭代并在解空间中搜索最优解。先前的实证研究主要针对新手，我们分析了专业水平对结果质量和相应主观满意度的影响。我们在文本、照片和3D网格优化上下文中进行了一项研究(N=60)。我们发现，新手可以达到专家水平的质量绩效，但高专业水平的参与者导致更多的优化迭代，更明确的偏好，同时保持较低的满意度。相比之下，新手更容易满足，终止更快。因此，我们确定专家在机器达到最佳结果时寻求更多样化的结果，并且观察到的行为可以用作人在环系统设计人员改进底层模型的性能指标。我们告知未来的研究在设计人在环系统时要谨慎对待用户专业知识的影响。

引用次数: 1

Appropriate Reliance on AI Advice: Conceptualization and the Effect of Explanations 适当依赖AI建议:概念化和解释的效果

Proceedings of the 28th International Conference on Intelligent User Interfaces

Pub Date : 2023-02-04 DOI: 10.1145/3581641.3584066

M. Schemmer, Niklas Kühl, Carina Benz, Andrea Bartos, G. Satzger

AI advice is becoming increasingly popular, e.g., in investment and medical treatment decisions. As this advice is typically imperfect, decision-makers have to exert discretion as to whether actually follow that advice: they have to “appropriately” rely on correct and turn down incorrect advice. However, current research on appropriate reliance still lacks a common definition as well as an operational measurement concept. Additionally, no in-depth behavioral experiments have been conducted that help understand the factors influencing this behavior. In this paper, we propose Appropriateness of Reliance (AoR) as an underlying, quantifiable two-dimensional measurement concept. We develop a research model that analyzes the effect of providing explanations for AI advice. In an experiment with 200 participants, we demonstrate how these explanations influence the AoR, and, thus, the effectiveness of AI advice. Our work contributes fundamental concepts for the analysis of reliance behavior and the purposeful design of AI advisors.

人工智能建议正变得越来越受欢迎，例如在投资和医疗决策方面。由于这些建议通常是不完美的，决策者必须对是否真正遵循这些建议行使自由裁量权:他们必须“适当地”依赖正确的建议，拒绝不正确的建议。然而，目前关于适当依赖的研究仍然缺乏一个共同的定义和一个可操作的测量概念。此外，没有进行深入的行为实验来帮助理解影响这种行为的因素。在本文中，我们提出适当的信赖(AoR)作为一个潜在的，可量化的二维测量概念。我们开发了一个研究模型，分析了为人工智能建议提供解释的效果。在一个有200名参与者的实验中，我们展示了这些解释如何影响AoR，从而影响人工智能建议的有效性。我们的工作为分析依赖行为和人工智能顾问的有目的设计提供了基本概念。

引用次数: 7

VR-LENS: Super Learning-based Cybersickness Detection and Explainable AI-Guided Deployment in Virtual Reality VR-LENS:基于学习的超级晕机检测和虚拟现实中可解释的人工智能引导部署

Proceedings of the 28th International Conference on Intelligent User Interfaces

Pub Date : 2023-02-03 DOI: 10.1145/3581641.3584044

Ripan Kumar Kundu, Osama Yahia Elsaid, P. Calyam, K. A. Hoque

Virtual reality (VR) systems are known for their susceptibility to cybersickness, which can seriously hinder users’ experience. Therefore, a plethora of recent research has proposed several automated methods based on machine learning (ML) and deep learning (DL) to detect cybersickness. However, these detection methods are perceived as computationally intensive and black-box methods. Thus, those techniques are neither trustworthy nor practical for deploying on standalone VR head-mounted displays (HMDs). This work presents an explainable artificial intelligence (XAI)-based framework VR-LENS for developing cybersickness detection ML models, explaining them, reducing their size, and deploying them in a Qualcomm Snapdragon 750G processor-based Samsung A52 device. Specifically, we first develop a novel super learning-based ensemble ML model for cybersickness detection. Next, we employ a post-hoc explanation method, such as SHapley Additive exPlanations (SHAP), Morris Sensitivity Analysis (MSA), Local Interpretable Model-Agnostic Explanations (LIME), and Partial Dependence Plot (PDP) to explain the expected results and identify the most dominant features. The super learner cybersickness model is then retrained using the identified dominant features. Our proposed method identified eye tracking, player position, and galvanic skin/heart rate response as the most dominant features for the integrated sensor, gameplay, and bio-physiological datasets. We also show that the proposed XAI-guided feature reduction significantly reduces the model training and inference time by 1.91X and 2.15X while maintaining baseline accuracy. For instance, using the integrated sensor dataset, our reduced super learner model outperforms the state-of-the-art works by classifying cybersickness into 4 classes (none, low, medium, and high) with an accuracy of and regressing (FMS 1–10) with a Root Mean Square Error (RMSE) of 0.03. Our proposed method can help researchers analyze, detect, and mitigate cybersickness in real time and deploy the super learner-based cybersickness detection model in standalone VR headsets.

众所周知，虚拟现实(VR)系统容易产生晕动症，这会严重影响用户的体验。因此，最近的大量研究提出了几种基于机器学习(ML)和深度学习(DL)的自动化方法来检测晕动症。然而，这些检测方法被认为是计算密集型和黑盒方法。因此，这些技术既不可靠，也不实用，无法部署在独立的VR头戴式显示器(hmd)上。这项工作提出了一个可解释的基于人工智能(XAI)的框架VR-LENS，用于开发晕动病检测ML模型，解释它们，减小它们的尺寸，并将它们部署在基于高通骁龙750G处理器的三星A52设备中。具体来说，我们首先开发了一种新的基于超级学习的集成ML模型，用于晕机检测。接下来，我们采用事后解释方法，如SHapley加性解释(SHAP)、Morris敏感性分析(MSA)、局部可解释模型不可知解释(LIME)和部分依赖图(PDP)来解释预期结果并确定最主要的特征。然后使用识别出的主导特征对超级学习者晕动症模型进行再训练。我们提出的方法确定了眼动追踪、玩家位置和皮肤电/心率反应是集成传感器、游戏玩法和生物生理数据集的最主要特征。我们还表明，在保持基线精度的情况下，提出的xai引导的特征缩减显着减少了1.91X和2.15X的模型训练和推理时间。例如，使用集成的传感器数据集，我们的简化超级学习器模型通过将晕动症分为4类(无，低，中，高)，并回归(FMS 1-10)，均方根误差(RMSE)为0.03，优于最先进的工作。我们提出的方法可以帮助研究人员实时分析、检测和减轻晕动病，并将基于超级学习者的晕动病检测模型部署在独立的VR头显中。

{"title":"VR-LENS: Super Learning-based Cybersickness Detection and Explainable AI-Guided Deployment in Virtual Reality","authors":"Ripan Kumar Kundu, Osama Yahia Elsaid, P. Calyam, K. A. Hoque","doi":"10.1145/3581641.3584044","DOIUrl":"https://doi.org/10.1145/3581641.3584044","url":null,"abstract":"Virtual reality (VR) systems are known for their susceptibility to cybersickness, which can seriously hinder users’ experience. Therefore, a plethora of recent research has proposed several automated methods based on machine learning (ML) and deep learning (DL) to detect cybersickness. However, these detection methods are perceived as computationally intensive and black-box methods. Thus, those techniques are neither trustworthy nor practical for deploying on standalone VR head-mounted displays (HMDs). This work presents an explainable artificial intelligence (XAI)-based framework VR-LENS for developing cybersickness detection ML models, explaining them, reducing their size, and deploying them in a Qualcomm Snapdragon 750G processor-based Samsung A52 device. Specifically, we first develop a novel super learning-based ensemble ML model for cybersickness detection. Next, we employ a post-hoc explanation method, such as SHapley Additive exPlanations (SHAP), Morris Sensitivity Analysis (MSA), Local Interpretable Model-Agnostic Explanations (LIME), and Partial Dependence Plot (PDP) to explain the expected results and identify the most dominant features. The super learner cybersickness model is then retrained using the identified dominant features. Our proposed method identified eye tracking, player position, and galvanic skin/heart rate response as the most dominant features for the integrated sensor, gameplay, and bio-physiological datasets. We also show that the proposed XAI-guided feature reduction significantly reduces the model training and inference time by 1.91X and 2.15X while maintaining baseline accuracy. For instance, using the integrated sensor dataset, our reduced super learner model outperforms the state-of-the-art works by classifying cybersickness into 4 classes (none, low, medium, and high) with an accuracy of and regressing (FMS 1–10) with a Root Mean Square Error (RMSE) of 0.03. Our proposed method can help researchers analyze, detect, and mitigate cybersickness in real time and deploy the super learner-based cybersickness detection model in standalone VR headsets.","PeriodicalId":118159,"journal":{"name":"Proceedings of the 28th International Conference on Intelligent User Interfaces","volume":"99 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125116702","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Powering an AI Chatbot with Expert Sourcing to Support Credible Health Information Access 为AI聊天机器人提供专家资源，以支持可靠的健康信息访问

Proceedings of the 28th International Conference on Intelligent User Interfaces

Pub Date : 2023-01-25 DOI: 10.1145/3581641.3584031

Ziang Xiao, Q. Liao, Michelle X. Zhou, Tyrone Grandison, Yunyao Li

During a public health crisis like the COVID-19 pandemic, a credible and easy-to-access information portal is highly desirable. It helps with disease prevention, public health planning, and misinformation mitigation. However, creating such an information portal is challenging because 1) domain expertise is required to identify and curate credible and intelligible content, 2) the information needs to be updated promptly in response to the fast-changing environment, and 3) the information should be easily accessible by the general public; which is particularly difficult when most people do not have the domain expertise about the crisis. In this paper, we presented an expert-sourcing framework and created Jennifer, an AI chatbot, which serves as a credible and easy-to-access information portal for individuals during the COVID-19 pandemic. Jennifer was created by a team of over 150 scientists and health professionals around the world, deployed in the real world and answered thousands of user questions about COVID-19. We evaluated Jennifer from two key stakeholders’ perspectives, expert volunteers and information seekers. We first interviewed experts who contributed to the collaborative creation of Jennifer to learn about the challenges in the process and opportunities for future improvement. We then conducted an online experiment that examined Jennifer’s effectiveness in supporting information seekers in locating COVID-19 information and gaining their trust. We share the key lessons learned and discuss design implications for building expert-sourced and AI-powered information portals, along with the risks and opportunities of misinformation mitigation and beyond.

在COVID-19大流行等公共卫生危机期间，非常需要一个可信且易于访问的信息门户。它有助于疾病预防、公共卫生规划和减少错误信息。然而，创建这样一个信息门户是具有挑战性的，因为1)需要领域专业知识来识别和管理可信和可理解的内容，2)信息需要及时更新以响应快速变化的环境，以及3)信息应该容易被公众访问;在大多数人对危机缺乏专业知识的情况下，这一点尤其困难。在本文中，我们提出了一个专家采购框架，并创建了人工智能聊天机器人Jennifer，作为COVID-19大流行期间个人可靠且易于访问的信息门户。珍妮弗是由世界各地150多名科学家和卫生专业人员组成的团队创建的，部署在现实世界中，回答了数千个关于COVID-19的用户问题。我们从两个关键的利益相关者，专家志愿者和信息寻求者的角度来评估Jennifer。我们首先采访了参与Jennifer协作创作的专家，以了解过程中的挑战和未来改进的机会。然后，我们进行了一项在线实验，检验了詹妮弗在支持信息寻求者查找COVID-19信息并获得他们信任方面的有效性。我们分享了吸取的主要经验教训，并讨论了构建专家来源和人工智能支持的信息门户的设计含义，以及缓解错误信息等方面的风险和机会。

{"title":"Powering an AI Chatbot with Expert Sourcing to Support Credible Health Information Access","authors":"Ziang Xiao, Q. Liao, Michelle X. Zhou, Tyrone Grandison, Yunyao Li","doi":"10.1145/3581641.3584031","DOIUrl":"https://doi.org/10.1145/3581641.3584031","url":null,"abstract":"During a public health crisis like the COVID-19 pandemic, a credible and easy-to-access information portal is highly desirable. It helps with disease prevention, public health planning, and misinformation mitigation. However, creating such an information portal is challenging because 1) domain expertise is required to identify and curate credible and intelligible content, 2) the information needs to be updated promptly in response to the fast-changing environment, and 3) the information should be easily accessible by the general public; which is particularly difficult when most people do not have the domain expertise about the crisis. In this paper, we presented an expert-sourcing framework and created Jennifer, an AI chatbot, which serves as a credible and easy-to-access information portal for individuals during the COVID-19 pandemic. Jennifer was created by a team of over 150 scientists and health professionals around the world, deployed in the real world and answered thousands of user questions about COVID-19. We evaluated Jennifer from two key stakeholders’ perspectives, expert volunteers and information seekers. We first interviewed experts who contributed to the collaborative creation of Jennifer to learn about the challenges in the process and opportunities for future improvement. We then conducted an online experiment that examined Jennifer’s effectiveness in supporting information seekers in locating COVID-19 information and gaining their trust. We share the key lessons learned and discuss design implications for building expert-sourced and AI-powered information portals, along with the risks and opportunities of misinformation mitigation and beyond.","PeriodicalId":118159,"journal":{"name":"Proceedings of the 28th International Conference on Intelligent User Interfaces","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122432511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10