Explainable, interactive content-based image retrieval

Applied AI letters Pub Date : 2021-10-19 DOI:10.1002/ail2.41

Bhavan Vasu, Brian Hu, Bo Dong, Roddy Collins, Anthony Hoogs

{"title":"Explainable, interactive content-based image retrieval","authors":"Bhavan Vasu, Brian Hu, Bo Dong, Roddy Collins, Anthony Hoogs","doi":"10.1002/ail2.41","DOIUrl":null,"url":null,"abstract":"<p>Quantifying the value of explanations in a human-in-the-loop (HITL) system is difficult. Previous methods either measure explanation-specific values that do not correspond to user tasks and needs or poll users on how useful they find the explanations to be. In this work, we quantify how much explanations help the user through a utility-based paradigm that measures change in task performance when using explanations vs not. Our chosen task is content-based image retrieval (CBIR), which has well-established baselines and performance metrics independent of explainability. We extend an existing HITL image retrieval system that incorporates user feedback with similarity-based saliency maps (SBSM) that indicate to the user which parts of the retrieved images are most similar to the query image. The system helps the user understand what it is paying attention to through saliency maps, and the user helps the system understand their goal through saliency-guided relevance feedback. Using the MS-COCO dataset, a standard object detection and segmentation dataset, we conducted extensive, crowd-sourced experiments validating that SBSM improves interactive image retrieval. Although the performance increase is modest in the general case, in more difficult cases such as cluttered scenes, using explanations yields an 6.5% increase in accuracy. To the best of our knowledge, this is the first large-scale user study showing that visual saliency map explanations improve performance on a real-world, interactive task. Our utility-based evaluation paradigm is general and potentially applicable to any task for which explainability can be incorporated.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"2 4","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.41","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied AI letters","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ail2.41","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Quantifying the value of explanations in a human-in-the-loop (HITL) system is difficult. Previous methods either measure explanation-specific values that do not correspond to user tasks and needs or poll users on how useful they find the explanations to be. In this work, we quantify how much explanations help the user through a utility-based paradigm that measures change in task performance when using explanations vs not. Our chosen task is content-based image retrieval (CBIR), which has well-established baselines and performance metrics independent of explainability. We extend an existing HITL image retrieval system that incorporates user feedback with similarity-based saliency maps (SBSM) that indicate to the user which parts of the retrieved images are most similar to the query image. The system helps the user understand what it is paying attention to through saliency maps, and the user helps the system understand their goal through saliency-guided relevance feedback. Using the MS-COCO dataset, a standard object detection and segmentation dataset, we conducted extensive, crowd-sourced experiments validating that SBSM improves interactive image retrieval. Although the performance increase is modest in the general case, in more difficult cases such as cluttered scenes, using explanations yields an 6.5% increase in accuracy. To the best of our knowledge, this is the first large-scale user study showing that visual saliency map explanations improve performance on a real-world, interactive task. Our utility-based evaluation paradigm is general and potentially applicable to any task for which explainability can be incorporated.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

可解释的，交互式的基于内容的图像检索

在人在循环(HITL)系统中，量化解释的价值是困难的。以前的方法要么测量与用户任务和需求不对应的特定于解释的值，要么调查用户对解释的有用程度。在这项工作中，我们通过一种基于效用的范式来量化解释对用户的帮助程度，该范式衡量使用解释与不使用解释时任务性能的变化。我们选择的任务是基于内容的图像检索(CBIR)，它具有良好的基线和独立于可解释性的性能指标。我们扩展了现有的HITL图像检索系统，该系统将用户反馈与基于相似性的显著性映射(SBSM)结合在一起，该映射向用户指示检索图像的哪些部分与查询图像最相似。系统通过显著性地图帮助用户理解自己关注的是什么，用户通过显著性引导的相关性反馈帮助系统理解自己的目标。使用MS-COCO数据集(一个标准的目标检测和分割数据集)，我们进行了广泛的众包实验，验证了SBSM改进了交互式图像检索。虽然在一般情况下，性能的提高是适度的，但在更困难的情况下，比如混乱的场景，使用解释可以提高6.5%的准确性。据我们所知，这是第一次大规模的用户研究，表明视觉显著性地图解释可以提高现实世界中交互式任务的表现。我们基于效用的评估范式是通用的，并且潜在地适用于任何可解释性可以被纳入的任务。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Applied AI letters

自引率

0.00%

发文量

期刊最新文献

On Training Spiking Neural Networks by Means of a Novel Quantum Inspired Machine Learning Method Issue Information Fault Detection and Classification for Photovoltaic Panel System Using Machine Learning Techniques Issue Information Scaling Laws for Discriminative Classification in Large Language Models