首页 > 最新文献

Applied AI letters最新文献

英文 中文
Qualitative Investigation in Explainable Artificial Intelligence: Further Insight from Social Science 可解释人工智能的定性研究——来自社会科学的进一步认识
Pub Date : 2022-01-17 DOI: 10.1002/ail2.64
Adam J. Johs, Denise E. Agosto, Rosina O. Weber
{"title":"Qualitative Investigation in Explainable Artificial Intelligence: Further Insight from Social Science","authors":"Adam J. Johs, Denise E. Agosto, Rosina O. Weber","doi":"10.1002/ail2.64","DOIUrl":"https://doi.org/10.1002/ail2.64","url":null,"abstract":"","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44715694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Generative model-enhanced human motion prediction 生成模型增强的人体运动预测
Pub Date : 2022-01-17 DOI: 10.1002/ail2.63
Anthony Bourached, Ryan-Rhys Griffiths, Robert Gray, Ashwani Jha, Parashkev Nachev

The task of predicting human motion is complicated by the natural heterogeneity and compositionality of actions, necessitating robustness to distributional shifts as far as out-of-distribution (OoD). Here, we formulate a new OoD benchmark based on the Human3.6M and Carnegie Mellon University (CMU) motion capture datasets, and introduce a hybrid framework for hardening discriminative architectures to OoD failure by augmenting them with a generative model. When applied to current state-of-the-art discriminative models, we show that the proposed approach improves OoD robustness without sacrificing in-distribution performance, and can theoretically facilitate model interpretability. We suggest human motion predictors ought to be constructed with OoD challenges in mind, and provide an extensible general framework for hardening diverse discriminative architectures to extreme distributional shift. The code is available at: https://github.com/bouracha/OoDMotion.

由于动作的自然异质性和组合性,预测人类运动的任务变得复杂,因此需要对分布变化的鲁棒性,直到分布外(OoD)。在这里,我们基于Human3.6M和卡内基梅隆大学(Carnegie Mellon University, CMU)的动作捕捉数据集制定了一个新的OoD基准,并引入了一个混合框架,通过生成模型增强区分性架构来增强OoD故障。当应用于当前最先进的判别模型时,我们表明所提出的方法在不牺牲分布内性能的情况下提高了OoD的鲁棒性,并且理论上可以促进模型的可解释性。我们建议在构建人体运动预测器时考虑到面向对象的挑战,并提供一个可扩展的通用框架,以强化多样化的判别体系结构以应对极端的分布转移。代码可从https://github.com/bouracha/OoDMotion获得。
{"title":"Generative model-enhanced human motion prediction","authors":"Anthony Bourached,&nbsp;Ryan-Rhys Griffiths,&nbsp;Robert Gray,&nbsp;Ashwani Jha,&nbsp;Parashkev Nachev","doi":"10.1002/ail2.63","DOIUrl":"10.1002/ail2.63","url":null,"abstract":"<p>The task of predicting human motion is complicated by the natural heterogeneity and compositionality of actions, necessitating robustness to distributional shifts as far as out-of-distribution (OoD). Here, we formulate a new OoD benchmark based on the Human3.6M and Carnegie Mellon University (CMU) motion capture datasets, and introduce a hybrid framework for hardening discriminative architectures to OoD failure by augmenting them with a generative model. When applied to current state-of-the-art discriminative models, we show that the proposed approach improves OoD robustness without sacrificing in-distribution performance, and can theoretically facilitate model interpretability. We suggest human motion predictors ought to be constructed with OoD challenges in mind, and provide an extensible general framework for hardening diverse discriminative architectures to extreme distributional shift. The code is available at: https://github.com/bouracha/OoDMotion.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"3 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.63","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41505789","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Deep Learning does not Replace Bayesian Modeling: Comparing research use via citation counting 深度学习不能取代贝叶斯建模:通过引文计数比较研究用途
Pub Date : 2022-01-05 DOI: 10.1002/ail2.62
B. Baldwin
{"title":"Deep Learning does not Replace Bayesian Modeling: Comparing research use via citation counting","authors":"B. Baldwin","doi":"10.1002/ail2.62","DOIUrl":"https://doi.org/10.1002/ail2.62","url":null,"abstract":"","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47852004","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DARPA's explainable AI (XAI) program: A retrospective DARPA的可解释人工智能(XAI)计划:回顾
Pub Date : 2021-12-04 DOI: 10.1002/ail2.61
David Gunning, Eric Vorm, Jennifer Yunyan Wang, Matt Turek

Summary of Defense Advanced Research Projects Agency's (DARPA) explainable artificial intelligence (XAI) program from the program managers' and evaluator's perspective.

美国国防高级研究计划局(DARPA)可解释人工智能(XAI)项目从项目经理和评估者的角度总结
{"title":"DARPA's explainable AI (XAI) program: A retrospective","authors":"David Gunning,&nbsp;Eric Vorm,&nbsp;Jennifer Yunyan Wang,&nbsp;Matt Turek","doi":"10.1002/ail2.61","DOIUrl":"10.1002/ail2.61","url":null,"abstract":"<p>Summary of Defense Advanced Research Projects Agency's (DARPA) explainable artificial intelligence (XAI) program from the program managers' and evaluator's perspective.\u0000 <figure>\u0000 <div><picture>\u0000 <source></source></picture><p></p>\u0000 </div>\u0000 </figure></p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"2 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.61","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48909197","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
Reframing explanation as an interactive medium: The EQUAS (Explainable QUestion Answering System) project 将解释作为一种互动媒介:EQUAS(可解释问答系统)项目
Pub Date : 2021-11-30 DOI: 10.1002/ail2.60
William Ferguson, Dhruv Batra, Raymond Mooney, Devi Parikh, Antonio Torralba, David Bau, David Diller, Josh Fasching, Jaden Fiotto-Kaufman, Yash Goyal, Jeff Miller, Kerry Moffitt, Alex Montes de Oca, Ramprasaath R. Selvaraju, Ayush Shrivastava, Jialin Wu, Stefan Lee

This letter is a retrospective analysis of our team's research for the Defense Advanced Research Projects Agency Explainable Artificial Intelligence project. Our initial approach was to use salience maps, English sentences, and lists of feature names to explain the behavior of deep-learning-based discriminative systems, with particular focus on visual question answering systems. We found that presenting static explanations along with answers led to limited positive effects. By exploring various combinations of machine and human explanation production and consumption, we evolved a notion of explanation as an interactive process that takes place usually between humans and artificial intelligence systems but sometimes within the software system. We realized that by interacting via explanations people could task and adapt machine learning (ML) agents. We added affordances for editing explanations and modified the ML system to act in accordance with the edits to produce an interpretable interface to the agent. Through this interface, editing an explanation can adapt a system's performance to new, modified purposes. This deep tasking, wherein the agent knows its objective and the explanation for that objective, will be critical to enable higher levels of autonomy.

这封信是对我们团队为国防高级研究计划局可解释人工智能项目所做研究的回顾性分析。我们最初的方法是使用显著性地图、英语句子和特征名称列表来解释基于深度学习的判别系统的行为,特别关注视觉问答系统。我们发现,在给出答案的同时给出静态的解释,其积极效果有限。通过探索机器和人类解释生产和消费的各种组合,我们进化出一种解释的概念,即解释是一种交互过程,通常发生在人类和人工智能系统之间,但有时也发生在软件系统内部。我们意识到,通过解释进行交互,人们可以分配任务并适应机器学习(ML)代理。我们添加了编辑解释的功能,并修改了机器学习系统,使其根据编辑内容采取行动,从而为代理生成可解释的界面。通过这个接口,编辑解释可以使系统的性能适应新的、修改过的目的。在这种深度任务中,智能体知道自己的目标和对目标的解释,这对于实现更高水平的自主性至关重要。
{"title":"Reframing explanation as an interactive medium: The EQUAS (Explainable QUestion Answering System) project","authors":"William Ferguson,&nbsp;Dhruv Batra,&nbsp;Raymond Mooney,&nbsp;Devi Parikh,&nbsp;Antonio Torralba,&nbsp;David Bau,&nbsp;David Diller,&nbsp;Josh Fasching,&nbsp;Jaden Fiotto-Kaufman,&nbsp;Yash Goyal,&nbsp;Jeff Miller,&nbsp;Kerry Moffitt,&nbsp;Alex Montes de Oca,&nbsp;Ramprasaath R. Selvaraju,&nbsp;Ayush Shrivastava,&nbsp;Jialin Wu,&nbsp;Stefan Lee","doi":"10.1002/ail2.60","DOIUrl":"10.1002/ail2.60","url":null,"abstract":"<p>This letter is a retrospective analysis of our team's research for the Defense Advanced Research Projects Agency Explainable Artificial Intelligence project. Our initial approach was to use salience maps, English sentences, and lists of feature names to explain the behavior of deep-learning-based discriminative systems, with particular focus on visual question answering systems. We found that presenting static explanations along with answers led to limited positive effects. By exploring various combinations of machine and human explanation production and consumption, we evolved a notion of explanation as an interactive process that takes place usually between humans and artificial intelligence systems but sometimes within the software system. We realized that by interacting via explanations people could task and adapt machine learning (ML) agents. We added affordances for editing explanations and modified the ML system to act in accordance with the edits to produce an interpretable interface to the agent. Through this interface, editing an explanation can adapt a system's performance to new, modified purposes. This deep tasking, wherein the agent knows its objective and the explanation for that objective, will be critical to enable higher levels of autonomy.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"2 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.60","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41941827","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards structured NLP interpretation via graph explainers 通过图形解释器实现结构化的NLP解释
Pub Date : 2021-11-26 DOI: 10.1002/ail2.58
Hao Yuan, Fan Yang, Mengnan Du, Shuiwang Ji, Xia Hu

Natural language processing (NLP) models have been increasingly deployed in real-world applications, and interpretation for textual data has also attracted dramatic attention recently. Most existing methods generate feature importance interpretation, which indicate the contribution of each word towards a specific model prediction. Text data typically possess highly structured characteristics and feature importance explanation cannot fully reveal the rich information contained in text. To bridge this gap, we propose to generate structured interpretations for textual data. Specifically, we pre-process the original text using dependency parsing, which could transform the text from sequences into graphs. Then graph neural networks (GNNs) are utilized to classify the transformed graphs. In particular, we explore two kinds of structured interpretation for pre-trained GNNs: edge-level interpretation and subgraph-level interpretation. Experimental results over three text datasets demonstrate that the structured interpretation can better reveal the structured knowledge encoded in the text. The experimental analysis further indicates that the proposed interpretations can faithfully reflect the decision-making process of the GNN model.

自然语言处理(NLP)模型越来越多地应用于现实世界,文本数据的解释也引起了人们的极大关注。大多数现有方法生成特征重要性解释,表明每个词对特定模型预测的贡献。文本数据通常具有高度结构化的特征,特征重要性的解释并不能充分揭示文本所包含的丰富信息。为了弥补这一差距,我们建议为文本数据生成结构化的解释。具体来说,我们使用依赖解析对原始文本进行预处理,这可以将文本从序列转换为图。然后利用图神经网络(gnn)对变换后的图进行分类。特别地,我们探索了预训练gnn的两种结构化解释:边缘级解释和子图级解释。在三个文本数据集上的实验结果表明,结构化解释可以更好地揭示文本中编码的结构化知识。实验分析进一步表明,所提出的解释能够真实地反映GNN模型的决策过程。
{"title":"Towards structured NLP interpretation via graph explainers","authors":"Hao Yuan,&nbsp;Fan Yang,&nbsp;Mengnan Du,&nbsp;Shuiwang Ji,&nbsp;Xia Hu","doi":"10.1002/ail2.58","DOIUrl":"10.1002/ail2.58","url":null,"abstract":"<p>Natural language processing (NLP) models have been increasingly deployed in real-world applications, and interpretation for textual data has also attracted dramatic attention recently. Most existing methods generate feature importance interpretation, which indicate the contribution of each word towards a specific model prediction. Text data typically possess highly structured characteristics and feature importance explanation cannot fully reveal the rich information contained in text. To bridge this gap, we propose to generate structured interpretations for textual data. Specifically, we pre-process the original text using dependency parsing, which could transform the text from sequences into graphs. Then graph neural networks (GNNs) are utilized to classify the transformed graphs. In particular, we explore two kinds of structured interpretation for pre-trained GNNs: edge-level interpretation and subgraph-level interpretation. Experimental results over three text datasets demonstrate that the structured interpretation can better reveal the structured knowledge encoded in the text. The experimental analysis further indicates that the proposed interpretations can faithfully reflect the decision-making process of the GNN model.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"2 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.58","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43677736","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Explainable activity recognition in videos: Lessons learned 视频中可解释的活动识别:经验教训
Pub Date : 2021-11-26 DOI: 10.1002/ail2.59
Chiradeep Roy, Mahsan Nourani, Donald R. Honeycutt, Jeremy E. Block, Tahrima Rahman, Eric D. Ragan, Nicholas Ruozzi, Vibhav Gogate

We consider the following activity recognition task: given a video, infer the set of activities being performed in the video and assign each frame to an activity. This task can be solved using modern deep learning architectures based on neural networks or conventional classifiers such as linear models and decision trees. While neural networks exhibit superior predictive performance as compared with decision trees and linear models, they are also uninterpretable and less explainable. We address this accuracy-explanability gap using a novel framework that feeds the output of a deep neural network to an interpretable, tractable probabilistic model called dynamic cutset networks, and performs joint reasoning over the two to answer questions. The neural network helps achieve high accuracy while dynamic cutset networks because of their polytime probabilistic reasoning capabilities make the system more explainable. We demonstrate the efficacy of our approach by using it to build three prototype systems that solve human-machine tasks having varying levels of difficulty using cooking videos as an accessible domain. We describe high-level technical details and key lessons learned in our human subjects evaluations of these systems.

我们考虑以下活动识别任务:给定一个视频,推断视频中正在执行的活动集,并将每一帧分配给一个活动。这个任务可以使用基于神经网络或传统分类器(如线性模型和决策树)的现代深度学习架构来解决。虽然与决策树和线性模型相比,神经网络表现出优越的预测性能,但它们也是不可解释和不可解释的。我们使用一种新的框架来解决这种准确性和可解释性之间的差距,该框架将深度神经网络的输出提供给一个可解释的、可处理的概率模型,称为动态割集网络,并对两者进行联合推理以回答问题。神经网络有助于实现高精度,而动态割集网络由于其多时概率推理能力使系统更具可解释性。我们通过使用它来构建三个原型系统来证明我们的方法的有效性,这些系统可以解决具有不同难度的人机任务,并将烹饪视频作为可访问域。我们描述了在这些系统的人类受试者评估中获得的高级技术细节和关键经验教训。
{"title":"Explainable activity recognition in videos: Lessons learned","authors":"Chiradeep Roy,&nbsp;Mahsan Nourani,&nbsp;Donald R. Honeycutt,&nbsp;Jeremy E. Block,&nbsp;Tahrima Rahman,&nbsp;Eric D. Ragan,&nbsp;Nicholas Ruozzi,&nbsp;Vibhav Gogate","doi":"10.1002/ail2.59","DOIUrl":"10.1002/ail2.59","url":null,"abstract":"<p>We consider the following activity recognition task: given a video, infer the set of activities being performed in the video and assign each frame to an activity. This task can be solved using modern deep learning architectures based on neural networks or conventional classifiers such as linear models and decision trees. While neural networks exhibit superior predictive performance as compared with decision trees and linear models, they are also uninterpretable and less explainable. We address this <i>accuracy-explanability gap</i> using a novel framework that feeds the output of a deep neural network to an interpretable, tractable probabilistic model called dynamic cutset networks, and performs joint reasoning over the two to answer questions. The neural network helps achieve high accuracy while dynamic cutset networks because of their polytime probabilistic reasoning capabilities make the system more explainable. We demonstrate the efficacy of our approach by using it to build three prototype systems that solve human-machine tasks having varying levels of difficulty using cooking videos as an accessible domain. We describe high-level technical details and key lessons learned in our human subjects evaluations of these systems.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"2 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.59","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43572819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Toward explainable and advisable model for self-driving cars 为自动驾驶汽车建立一个可解释且可行的模型
Pub Date : 2021-11-23 DOI: 10.1002/ail2.56
Jinkyu Kim, Anna Rohrbach, Zeynep Akata, Suhong Moon, Teruhisa Misu, Yi-Ting Chen, Trevor Darrell, John Canny

Humans learn to drive through both practice and theory, for example, by studying the rules, while most self-driving systems are limited to the former. Being able to incorporate human knowledge of typical causal driving behavior should benefit autonomous systems. We propose a new approach that learns vehicle control with the help of human advice. Specifically, our system learns to summarize its visual observations in natural language, predict an appropriate action response (eg, “I see a pedestrian crossing, so I stop”), and predict the controls, accordingly. Moreover, to enhance the interpretability of our system, we introduce a fine-grained attention mechanism that relies on semantic segmentation and object-centric RoI pooling. We show that our approach of training the autonomous system with human advice, grounded in a rich semantic representation, matches or outperforms prior work in terms of control prediction and explanation generation. Our approach also results in more interpretable visual explanations by visualizing object-centric attention maps. We evaluate our approach on a novel driving dataset with ground-truth human explanations, the Berkeley DeepDrive eXplanation (BDD-X) dataset.

人类通过实践和理论来学习驾驶,例如,通过研究规则,而大多数自动驾驶系统仅限于前者。能够将人类对典型因果驾驶行为的知识结合起来,应该有利于自动驾驶系统。我们提出了一种新的方法,在人类建议的帮助下学习车辆控制。具体来说,我们的系统学会了用自然语言总结它的视觉观察,预测适当的行动反应(例如,“我看到一个人行横道,所以我停下来”),并相应地预测控制。此外,为了增强系统的可解释性,我们引入了一种依赖于语义分割和以对象为中心的RoI池的细粒度注意力机制。我们表明,基于丰富的语义表示,我们用人类建议训练自主系统的方法,在控制预测和解释生成方面匹配或优于先前的工作。通过可视化以对象为中心的注意图,我们的方法也产生了更多可解释的视觉解释。我们在一个新的驾驶数据集上评估了我们的方法,该数据集具有真实的人类解释,即伯克利深度驾驶解释(BDD-X)数据集。
{"title":"Toward explainable and advisable model for self-driving cars","authors":"Jinkyu Kim,&nbsp;Anna Rohrbach,&nbsp;Zeynep Akata,&nbsp;Suhong Moon,&nbsp;Teruhisa Misu,&nbsp;Yi-Ting Chen,&nbsp;Trevor Darrell,&nbsp;John Canny","doi":"10.1002/ail2.56","DOIUrl":"10.1002/ail2.56","url":null,"abstract":"<p>Humans learn to drive through both practice and theory, for example, by studying the rules, while most self-driving systems are limited to the former. Being able to incorporate human knowledge of typical causal driving behavior should benefit autonomous systems. We propose a new approach that learns vehicle control with the help of human advice. Specifically, our system learns to summarize its visual observations in natural language, predict an appropriate action response (eg, “I see a pedestrian crossing, so I stop”), and predict the controls, accordingly. Moreover, to enhance the interpretability of our system, we introduce a fine-grained attention mechanism that relies on semantic segmentation and object-centric RoI pooling. We show that our approach of training the autonomous system with human advice, grounded in a rich semantic representation, matches or outperforms prior work in terms of control prediction and explanation generation. Our approach also results in more interpretable visual explanations by visualizing object-centric attention maps. We evaluate our approach on a novel driving dataset with ground-truth human explanations, the Berkeley DeepDrive eXplanation (BDD-X) dataset.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"2 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.56","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46237887","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Objective criteria for explanations of machine learning models 解释机器学习模型的客观标准
Pub Date : 2021-11-23 DOI: 10.1002/ail2.57
Chih-Kuan Yeh, Pradeep Ravikumar

Objective criteria to evaluate the performance of machine learning (ML) model explanations are a critical ingredient in bringing greater rigor to the field of explainable artificial intelligence. In this article, we survey three of our proposed criteria that each target different classes of explanations. In the first, targeted at real-valued feature importance explanations, we define a class of “infidelity” measures that capture how well the explanations match the ML models. We show that instances of such infidelity minimizing explanations correspond to many popular recently proposed explanations and, moreover, can be shown to satisfy well-known game-theoretic axiomatic properties. In the second, targeted to feature set explanations, we define a robustness analysis-based criterion and show that deriving explainable feature sets based on the robustness criterion yields more qualitatively impressive explanations. Lastly, for sample explanations, we provide a decomposition-based criterion that allows us to provide very scalable and compelling classes of sample-based explanations.

评估机器学习(ML)模型解释性能的客观标准是提高可解释人工智能领域的严谨性的关键因素。在本文中,我们调查了我们提出的三个标准,每个标准针对不同类别的解释。首先,针对实值特征重要性解释,我们定义了一类“不忠”度量,以捕获解释与ML模型的匹配程度。我们证明了这种不忠最小化解释的实例与最近提出的许多流行解释相对应,而且可以证明满足众所周知的博弈论公理性质。在第二部分中,针对特征集解释,我们定义了一个基于鲁棒性分析的标准,并表明基于鲁棒性标准推导可解释的特征集产生了更多的定性令人印象深刻的解释。最后,对于样本解释,我们提供了一个基于分解的标准,它允许我们提供非常可扩展和引人注目的基于样本的解释类。
{"title":"Objective criteria for explanations of machine learning models","authors":"Chih-Kuan Yeh,&nbsp;Pradeep Ravikumar","doi":"10.1002/ail2.57","DOIUrl":"10.1002/ail2.57","url":null,"abstract":"<p>Objective criteria to evaluate the performance of machine learning (ML) model explanations are a critical ingredient in bringing greater rigor to the field of explainable artificial intelligence. In this article, we survey three of our proposed criteria that each target different classes of explanations. In the first, targeted at real-valued feature importance explanations, we define a class of “infidelity” measures that capture how well the explanations match the ML models. We show that instances of such infidelity minimizing explanations correspond to many popular recently proposed explanations and, moreover, can be shown to satisfy well-known game-theoretic axiomatic properties. In the second, targeted to feature set explanations, we define a robustness analysis-based criterion and show that deriving explainable feature sets based on the robustness criterion yields more qualitatively impressive explanations. Lastly, for sample explanations, we provide a decomposition-based criterion that allows us to provide very scalable and compelling classes of sample-based explanations.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"2 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.57","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44018783","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Generating visual explanations with natural language 用自然语言生成视觉解释
Pub Date : 2021-11-22 DOI: 10.1002/ail2.55
Lisa Anne Hendricks, Anna Rohrbach, Bernt Schiele, Trevor Darrell, Zeynep Akata

We generate natural language explanations for a fine-grained visual recognition task. Our explanations fulfill two criteria. First, explanations are class discriminative, meaning they mention attributes in an image which are important to identify a class. Second, explanations are image relevant, meaning they reflect the actual content of an image. Our system, composed of an explanation sampler and phrase-critic model, generates class discriminative and image relevant explanations. In addition, we demonstrate that our explanations can help humans decide whether to accept or reject an AI decision.

我们为细粒度的视觉识别任务生成自然语言解释。我们的解释符合两个标准。首先,解释是类区分的,这意味着它们提到图像中的属性,这些属性对识别类很重要。其次,解释是与图像相关的,这意味着它们反映了图像的实际内容。我们的系统由解释采样器和短语批评模型组成,生成阶级区分和图像相关的解释。此外,我们证明了我们的解释可以帮助人类决定是否接受或拒绝人工智能的决定。
{"title":"Generating visual explanations with natural language","authors":"Lisa Anne Hendricks,&nbsp;Anna Rohrbach,&nbsp;Bernt Schiele,&nbsp;Trevor Darrell,&nbsp;Zeynep Akata","doi":"10.1002/ail2.55","DOIUrl":"10.1002/ail2.55","url":null,"abstract":"<p>We generate natural language explanations for a fine-grained visual recognition task. Our explanations fulfill two criteria. First, explanations are <i>class discriminative</i>, meaning they mention attributes in an image which are important to identify a class. Second, explanations are <i>image relevant</i>, meaning they reflect the actual content of an image. Our system, composed of an explanation sampler and phrase-critic model, generates class discriminative and image relevant explanations. In addition, we demonstrate that our explanations can help humans decide whether to accept or reject an AI decision.</p>","PeriodicalId":72253,"journal":{"name":"Applied AI letters","volume":"2 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ail2.55","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48200035","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
Applied AI letters
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1