Explainability pitfalls: Beyond dark patterns in explainable AI

IF 6.7 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Patterns Pub Date : 2024-06-14 DOI:10.1016/j.patter.2024.100971

Upol Ehsan, Mark O. Riedl

引用次数: 0

Abstract

To make explainable artificial intelligence (XAI) systems trustworthy, understanding harmful effects is important. In this paper, we address an important yet unarticulated type of negative effect in XAI. We introduce explainability pitfalls (EPs), unanticipated negative downstream effects from AI explanations manifesting even when there is no intention to manipulate users. EPs are different from dark patterns, which are intentionally deceptive practices. We articulate the concept of EPs by demarcating it from dark patterns and highlighting the challenges arising from uncertainties around pitfalls. We situate and operationalize the concept using a case study that showcases how, despite best intentions, unsuspecting negative effects, such as unwarranted trust in numerical explanations, can emerge. We propose proactive and preventative strategies to address EPs at three interconnected levels: research, design, and organizational. We discuss design and societal implications around reframing AI adoption, recalibrating stakeholder empowerment, and resisting the “move fast and break things” mindset.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

可解释性陷阱：超越可解释人工智能的黑暗模式

要使可解释人工智能（XAI）系统值得信赖，了解有害效应非常重要。在本文中，我们将讨论 XAI 中一种重要但尚未阐明的负面效应。我们引入了可解释性陷阱（EPs），这是人工智能解释所产生的意料之外的负面下游效应，即使在无意操纵用户的情况下也会表现出来。EPs不同于黑暗模式，后者是有意的欺骗行为。我们阐明了EPs的概念，将其与黑暗模式区分开来，并强调了陷阱的不确定性所带来的挑战。我们通过一个案例研究来定位和操作这一概念，该案例研究展示了尽管用心良苦，但还是会出现意想不到的负面影响，例如对数字解释的无端信任。我们从研究、设计和组织三个相互关联的层面提出了应对 EPs 的积极预防策略。我们将围绕重新构建人工智能的采用、重新调整利益相关者的授权以及抵制 "快速行动、打破常规 "的思维方式，讨论设计和社会影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊