HSI: Human Saliency Imitator for Benchmarking Saliency-Based Model Explanations

Proceedings of the ... AAAI Conference on Human Computation and Crowdsourcing Pub Date : 2022-10-14 DOI:10.1609/hcomp.v10i1.22002

Yi Yang, Yueyuan Zheng, Didan Deng, Jindi Zhang, Yongxiang Huang, Yumeng Yang, J. Hsiao, Caleb Chen Cao

{"title":"HSI: Human Saliency Imitator for Benchmarking Saliency-Based Model Explanations","authors":"Yi Yang, Yueyuan Zheng, Didan Deng, Jindi Zhang, Yongxiang Huang, Yumeng Yang, J. Hsiao, Caleb Chen Cao","doi":"10.1609/hcomp.v10i1.22002","DOIUrl":null,"url":null,"abstract":"Model explanations are generated by XAI (explainable AI) methods to help people understand and interpret machine learning models. To study XAI methods from the human perspective, we propose a human-based benchmark dataset, i.e., human saliency benchmark (HSB), for evaluating saliency-based XAI methods. Different from existing human saliency annotations where class-related features are manually and subjectively labeled, this benchmark collects more objective human attention on vision information with a precise eye-tracking device and a novel crowdsourcing experiment. Taking the labor cost of human experiment into consideration, we further explore the potential of utilizing a prediction model trained on HSB to mimic saliency annotating by humans. Hence, a dense prediction problem is formulated, and we propose an encoder-decoder architecture which combines multi-modal and multi-scale features to produce the human saliency maps. Accordingly, a pretraining-finetuning method is designed to address the model training problem. Finally, we arrive at a model trained on HSB named human saliency imitator (HSI). We show, through an extensive evaluation, that HSI can successfully predict human saliency on our HSB dataset, and the HSI-generated human saliency dataset on ImageNet showcases the ability of benchmarking XAI methods both qualitatively and quantitatively.","PeriodicalId":87339,"journal":{"name":"Proceedings of the ... AAAI Conference on Human Computation and Crowdsourcing","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ... AAAI Conference on Human Computation and Crowdsourcing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1609/hcomp.v10i1.22002","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

Model explanations are generated by XAI (explainable AI) methods to help people understand and interpret machine learning models. To study XAI methods from the human perspective, we propose a human-based benchmark dataset, i.e., human saliency benchmark (HSB), for evaluating saliency-based XAI methods. Different from existing human saliency annotations where class-related features are manually and subjectively labeled, this benchmark collects more objective human attention on vision information with a precise eye-tracking device and a novel crowdsourcing experiment. Taking the labor cost of human experiment into consideration, we further explore the potential of utilizing a prediction model trained on HSB to mimic saliency annotating by humans. Hence, a dense prediction problem is formulated, and we propose an encoder-decoder architecture which combines multi-modal and multi-scale features to produce the human saliency maps. Accordingly, a pretraining-finetuning method is designed to address the model training problem. Finally, we arrive at a model trained on HSB named human saliency imitator (HSI). We show, through an extensive evaluation, that HSI can successfully predict human saliency on our HSB dataset, and the HSI-generated human saliency dataset on ImageNet showcases the ability of benchmarking XAI methods both qualitatively and quantitatively.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

HSI:人类显著性模仿者的基准显著性为基础的模型解释

模型解释由XAI(可解释的AI)方法生成，以帮助人们理解和解释机器学习模型。为了从人类的角度研究XAI方法，我们提出了一个基于人类的基准数据集，即人类显著性基准(HSB)，用于评估基于显著性的XAI方法。与现有的人类显著性标注不同，该基准通过精确的眼动追踪设备和新颖的众包实验，收集了更客观的人类对视觉信息的关注。考虑到人类实验的人工成本，我们进一步探索了利用HSB训练的预测模型来模拟人类显著性注释的潜力。因此，我们提出了一个密集预测问题，并提出了一个结合多模态和多尺度特征的编码器-解码器架构来生成人类显著性地图。为此，设计了一种预训练-微调方法来解决模型训练问题。最后，我们得到了一个在HSB上训练的模型，名为人类显著性模仿者(HSI)。我们通过广泛的评估表明，HSI可以成功地预测HSB数据集上的人类显著性，而HSI生成的ImageNet上的人类显著性数据集展示了对XAI方法进行定性和定量基准测试的能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the ... AAAI Conference on Human Computation and Crowdsourcing

自引率

0.00%

发文量