Cardinality-Minimal Explanations for Monotonic Neural Networks

International Joint Conference on Artificial Intelligence Pub Date : 2022-05-19 DOI:10.24963/ijcai.2023/409

Ouns El Harzli, B. C. Grau, Ian Horrocks

引用次数: 0

Abstract

In recent years, there has been increasing interest in explanation methods for neural model predictions that offer precise formal guarantees. These include abductive (respectively, contrastive) methods, which aim to compute minimal subsets of input features that are sufficient for a given prediction to hold (respectively, to change a given prediction). The corresponding decision problems are, however, known to be intractable. In this paper, we investigate whether tractability can be regained by focusing on neural models implementing a monotonic function. Although the relevant decision problems remain intractable, we can show that they become solvable in polynomial time by means of greedy algorithms if we additionally assume that the activation functions are continuous everywhere and differentiable almost everywhere. Our experiments suggest favourable performance of our algorithms.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

单调神经网络的基数最小解释

近年来，人们对提供精确形式保证的神经模型预测的解释方法越来越感兴趣。这些方法包括溯因(分别是对比)方法，其目的是计算最小的输入特征子集，这些子集足以维持给定的预测(分别是改变给定的预测)。然而，相应的决策问题被认为是难以处理的。在本文中，我们通过关注实现单调函数的神经模型来研究是否可以恢复可溯性。虽然相关的决策问题仍然难以解决，但我们可以证明，如果我们另外假设激活函数处处连续且几乎处处可微，则可以用贪心算法在多项式时间内解决这些问题。我们的实验表明我们的算法具有良好的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

International Joint Conference on Artificial Intelligence

自引率

0.00%

发文量

期刊最新文献

Towards Formal Verification of Neuro-symbolic Multi-agent Systems RuleMatch: Matching Abstract Rules for Semi-supervised Learning of Human Standard Intelligence Tests Computing (1+epsilon)-Approximate Degeneracy in Sublinear Time AI and Decision Support for Sustainable Socio-Ecosystems Contrastive Learning and Reward Smoothing for Deep Portfolio Management