Journal of Artificial Intelligence Research最新文献

A Portfolio Approach to Massively Parallel Bayesian Optimization. 大规模并行贝叶斯优化的组合方法。

IF 4 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence Research

Pub Date : 2025-01-01 Epub Date: 2025-01-14 DOI: 10.1613/jair.1.16868

Mickaël Binois, Nicholson Collier, Jonathan Ozik

One way to reduce the time of conducting optimization studies is to evaluate designs in parallel rather than just one-at-a-time. For expensive-to-evaluate black-boxes, batch versions of Bayesian optimization have been proposed. They work by building a surrogate model of the black-box to simultaneously select multiple designs via an infill criterion. Still, despite the increased availability of computing resources that enable large-scale parallelism, the strategies that work for selecting a few tens of parallel designs for evaluations become limiting due to the complexity of selecting more designs. It is even more crucial when the black-box is noisy, necessitating more evaluations as well as repeating experiments. Here we propose a scalable strategy that can keep up with massive batching natively, focused on the exploration/exploitation trade-off and a portfolio allocation. We compare the approach with related methods on noisy functions, for mono and multi-objective optimization tasks. These experiments show orders of magnitude speed improvements over existing methods with similar or better performance.

减少进行优化研究时间的一种方法是并行评估设计，而不是一次评估一个。对于难以评估的黑盒，已经提出了批处理版本的贝叶斯优化。他们通过建立黑盒的代理模型来同时通过填充标准选择多种设计。尽管越来越多的计算资源支持大规模并行，但由于选择更多设计的复杂性，用于选择几十个并行设计进行评估的策略变得有限。当黑匣子有噪声时，这就更加重要了，需要更多的评估和重复的实验。在这里，我们提出了一种可扩展的策略，可以在本地跟上大规模批处理，专注于勘探/开发权衡和投资组合分配。对于单目标和多目标优化任务，我们将该方法与有关的噪声函数方法进行了比较。这些实验表明，与具有类似或更好性能的现有方法相比，速度提高了几个数量级。

{"title":"A Portfolio Approach to Massively Parallel Bayesian Optimization.","authors":"Mickaël Binois, Nicholson Collier, Jonathan Ozik","doi":"10.1613/jair.1.16868","DOIUrl":"10.1613/jair.1.16868","url":null,"abstract":"<p><p>One way to reduce the time of conducting optimization studies is to evaluate designs in parallel rather than just one-at-a-time. For expensive-to-evaluate black-boxes, batch versions of Bayesian optimization have been proposed. They work by building a surrogate model of the black-box to simultaneously select multiple designs via an infill criterion. Still, despite the increased availability of computing resources that enable large-scale parallelism, the strategies that work for selecting a few tens of parallel designs for evaluations become limiting due to the complexity of selecting more designs. It is even more crucial when the black-box is noisy, necessitating more evaluations as well as repeating experiments. Here we propose a scalable strategy that can keep up with massive batching natively, focused on the exploration/exploitation trade-off and a portfolio allocation. We compare the approach with related methods on noisy functions, for mono and multi-objective optimization tasks. These experiments show orders of magnitude speed improvements over existing methods with similar or better performance.</p>","PeriodicalId":54877,"journal":{"name":"Journal of Artificial Intelligence Research","volume":"82 ","pages":"137-167"},"PeriodicalIF":4.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12459664/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145152041","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Collective Belief Revision 集体信念修正

IF 5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence Research

Pub Date : 2023-12-29 DOI: 10.1613/jair.1.15745

T. Aravanis

In this article, we study the dynamics of collective beliefs. As a first step, we formulate David Westlund’s Principle of Collective Change (PCC) —a criterion that characterizes the evolution of collective knowledge— in the realm of belief revision. Thereafter, we establish a number of unsatisfiability results pointing out that the widely-accepted revision operators of Alchourrón, Gärdenfors and Makinson, combined with fundamental types of merging operations —including the ones proposed by Konieczny and Pino Pérez as well as Baral et al.— collide with the PCC. These impossibility results essentially extend in the context of belief revision the negative results established by Westlund for the operations of contraction and expansion. At the opposite of the impossibility results, we also establish a number of satisfiability results, proving that, under certain (rather strict) requirements, the PCC is indeed respected for specific merging operators. Overall, it is argued that the PCC is a rather unsuitable property for characterizing the process of collective change. Last but not least, mainly in response to the unsatisfactory situation related to the PCC, we explore some alternative criteria of collective change, and evaluate their compliance with belief revision and belief merging.

在本文中，我们将研究集体信念的动态变化。首先，我们提出了戴维-韦斯特伦德（David Westlund）的集体变化原则（PCC）--一个描述集体知识演变的标准--在信念修正领域。此后，我们建立了一系列不可满解性结果，指出阿尔丘龙、盖登福斯和马金森的修订算子与基本类型的合并操作（包括科尼茨尼和皮诺-佩雷斯以及巴拉尔等人提出的合并操作）结合在一起，会与 PCC 发生冲突。这些不可能性结果实质上是在信念修正的背景下扩展了韦斯特伦德（Westlund）为收缩和膨胀操作建立的负面结果。与不可能性结果相反，我们还建立了一些可满足性结果，证明在某些（相当严格的）要求下，特定合并算子确实遵守了 PCC。总之，我们认为 PCC 是描述集体变化过程的一个相当不合适的属性。最后但并非最不重要的一点，主要是为了应对与 PCC 相关的不尽如人意的情况，我们探讨了集体变化的一些替代标准，并评估了它们与信念修正和信念合并的一致性。

{"title":"Collective Belief Revision","authors":"T. Aravanis","doi":"10.1613/jair.1.15745","DOIUrl":"https://doi.org/10.1613/jair.1.15745","url":null,"abstract":"In this article, we study the dynamics of collective beliefs. As a first step, we formulate David Westlund’s Principle of Collective Change (PCC) —a criterion that characterizes the evolution of collective knowledge— in the realm of belief revision. Thereafter, we establish a number of unsatisfiability results pointing out that the widely-accepted revision operators of Alchourrón, Gärdenfors and Makinson, combined with fundamental types of merging operations —including the ones proposed by Konieczny and Pino Pérez as well as Baral et al.— collide with the PCC. These impossibility results essentially extend in the context of belief revision the negative results established by Westlund for the operations of contraction and expansion. At the opposite of the impossibility results, we also establish a number of satisfiability results, proving that, under certain (rather strict) requirements, the PCC is indeed respected for specific merging operators. Overall, it is argued that the PCC is a rather unsuitable property for characterizing the process of collective change. Last but not least, mainly in response to the unsatisfactory situation related to the PCC, we explore some alternative criteria of collective change, and evaluate their compliance with belief revision and belief merging.","PeriodicalId":54877,"journal":{"name":"Journal of Artificial Intelligence Research","volume":"65 s300","pages":""},"PeriodicalIF":5.0,"publicationDate":"2023-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139146750","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Competitive Equilibria with a Constant Number of Chores 家务数量不变的竞争性均衡

IF 5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence Research

Pub Date : 2023-12-28 DOI: 10.1613/jair.1.15260

J. Garg, Patricia C. McGlaughlin, Martin Hoefer, M. Schmalhofer

We study markets with mixed manna, where m divisible goods and chores shall be divided among n agents to obtain a competitive equilibrium. Equilibrium allocations are known to satisfy many fairness and efficiency conditions. While a lot of recent work in fair division is restricted to linear utilities and chores, we focus on a substantial generalization to separable piecewise-linear and concave (SPLC) utilities and mixed manna. We first derive polynomial-time algorithms for markets with a constant number of items or a constant number of agents. Our main result is a polynomial-time algorithm for instances with a constant number of chores (as well as any number of goods and agents) under the condition that chores dominate the utility of the agents. Interestingly, this stands in contrast to the case when the goods dominate the agents utility in equilibrium, where the problem is known to be PPAD-hard even without chores.

我们研究的是混合甘露市场，其中 m 种可分割商品和家务应在 n 个代理人之间进行分配，以获得竞争性均衡。众所周知，均衡分配满足许多公平和效率条件。最近很多关于公平分配的研究都局限于线性效用和家务，而我们则专注于对可分离的片状线性凹（SPLC）效用和混合甘露的实质性推广。我们首先推导了具有恒定项目数或恒定代理数的市场的多项式时间算法。我们的主要结果是，在家务活支配代理人效用的条件下，对于家务活数量不变的实例（以及任何数量的物品和代理人）的多项式时间算法。有趣的是，这与商品在均衡状态下支配代理人效用的情况形成了鲜明对比，在这种情况下，即使没有家务活，问题也是已知的 PPAD 难。

引用次数: 0

Improving Resource Allocations by Sharing in Pairs 通过成对共享改善资源分配

IF 5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence Research

Pub Date : 2023-12-20 DOI: 10.1613/jair.1.15001

Robert Bredereck, A. Kaczmarczyk, Junjie Luo, Rolf Niedermeier, Florian Sachse

Given an initial resource allocation, where some agents may envy others or where a different distribution of resources might lead to a higher social welfare, our goal is to improve the allocation without reassigning resources. We consider a sharing concept allowing resources being shared with social network neighbors of the resource owners. More precisely, our model allows agents to form pairs which then may share a limited number of resources. Sharing a resource can come at some costs or loss in utility. To this end, we introduce a formal model that allows a central authority to compute an optimal sharing between neighbors based on an initial allocation. Advocating this point of view, we focus on the most basic scenario where each agent can participate in a bounded number of sharings. We present algorithms for optimizing utilitarian and egalitarian social welfare of allocations and for reducing the number of envious agents. In particular, we examine the computational complexity with respect to several natural parameters. Furthermore, we study cases with restricted social network structures and, among others, devise polynomial-time algorithms in path- and tree-like (hierarchical) social networks.

在初始资源分配中，一些代理可能会嫉妒其他代理，或者不同的资源分配可能会带来更高的社会福利，我们的目标是在不重新分配资源的情况下改善资源分配。我们考虑了一种共享概念，即允许与资源所有者的社会网络邻居共享资源。更确切地说，我们的模型允许代理人结成对子，然后共享有限数量的资源。共享资源可能会产生一定的成本或效用损失。为此，我们引入了一个正式模型，允许中央机构根据初始分配计算出邻居之间的最佳共享方式。基于这一观点，我们将重点放在最基本的情况上，即每个代理可以参与一定数量的共享。我们提出了优化分配的功利性和平等性社会福利以及减少妒忌代理数量的算法。我们特别考察了与几个自然参数相关的计算复杂性。此外，我们还研究了社会网络结构受限的情况，并在路径和树状（分层）社会网络中设计了多项式时间算法。

{"title":"Improving Resource Allocations by Sharing in Pairs","authors":"Robert Bredereck, A. Kaczmarczyk, Junjie Luo, Rolf Niedermeier, Florian Sachse","doi":"10.1613/jair.1.15001","DOIUrl":"https://doi.org/10.1613/jair.1.15001","url":null,"abstract":"Given an initial resource allocation, where some agents may envy others or where a different distribution of resources might lead to a higher social welfare, our goal is to improve the allocation without reassigning resources. We consider a sharing concept allowing resources being shared with social network neighbors of the resource owners. More precisely, our model allows agents to form pairs which then may share a limited number of resources. Sharing a resource can come at some costs or loss in utility. To this end, we introduce a formal model that allows a central authority to compute an optimal sharing between neighbors based on an initial allocation. Advocating this point of view, we focus on the most basic scenario where each agent can participate in a bounded number of sharings. We present algorithms for optimizing utilitarian and egalitarian social welfare of allocations and for reducing the number of envious agents. In particular, we examine the computational complexity with respect to several natural parameters. Furthermore, we study cases with restricted social network structures and, among others, devise polynomial-time algorithms in path- and tree-like (hierarchical) social networks.","PeriodicalId":54877,"journal":{"name":"Journal of Artificial Intelligence Research","volume":"255 ","pages":""},"PeriodicalIF":5.0,"publicationDate":"2023-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139170294","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A General Model for Aggregating Annotations Across Simple, Complex, and Multi-Object Annotation Tasks 在简单、复杂和多对象注释任务中聚合注释的通用模型

IF 5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence Research

Pub Date : 2023-12-11 DOI: 10.1613/jair.1.14388

Alexander Braylan, Madalyn Marabella, Omar Alonso, Matthew Lease

Human annotations are vital to supervised learning, yet annotators often disagree on the correct label, especially as annotation tasks increase in complexity. A common strategy to improve label quality is to ask multiple annotators to label the same item and then aggregate their labels. To date, many aggregation models have been proposed for simple categorical or numerical annotation tasks, but far less work has considered more complex annotation tasks, such as those involving open-ended, multivariate, or structured responses. Similarly, while a variety of bespoke models have been proposed for specific tasks, our work is the first we are aware of to introduce aggregation methods that generalize across many, diverse complex tasks, including sequence labeling, translation, syntactic parsing, ranking, bounding boxes, and keypoints. This generality is achieved by applying readily available task-specific distance functions, then devising a task-agnostic method to model these distances between labels, rather than the labels themselves.This article presents a unified treatment of our prior work on complex annotation modeling and extends that work with investigation of three new research questions. First, how do complex annotation task and dataset properties impact aggregation accuracy? Second, how should a task owner navigate the many modeling choices in order to maximize aggregation accuracy? Finally, what tests and diagnoses can verify that aggregation models are specified correctly for the given data? To understand how various factors impact accuracy and to inform model selection, we conduct large-scale simulation studies and broad experiments on real, complex datasets. Regarding testing, we introduce the concept of unit tests for aggregation models and present a suite of such tests to ensure that a given model is not mis-specified and exhibits expected behavior.Beyond investigating these research questions above, we discuss the foundational concept and nature of annotation complexity, present a new aggregation model as a conceptual bridge between traditional models and our own, and contribute a new general semisupervised learning method for complex label aggregation that outperforms prior work.

人工标注对监督学习至关重要，但标注者经常会对正确的标签产生分歧，尤其是当标注任务的复杂性增加时。提高标签质量的常见策略是让多个标注者对同一项目进行标注，然后汇总他们的标签。迄今为止，许多聚合模型都是针对简单的分类或数字标注任务提出的，但考虑到更复杂的标注任务（如涉及开放式、多变量或结构化响应的任务）的工作则少得多。同样，虽然针对特定任务已经提出了多种定制模型，但我们的工作是我们所知的首个引入聚合方法的工作，该方法可通用于多种复杂任务，包括序列标注、翻译、句法分析、排序、边界框和关键点。本文对我们之前在复杂注释建模方面的工作进行了统一处理，并通过研究三个新的研究问题对这些工作进行了扩展。首先，复杂注释任务和数据集属性如何影响聚合准确性？其次，任务负责人应如何在众多建模选择中游刃有余，以最大限度地提高聚合准确性？最后，有哪些测试和诊断方法可以验证聚合模型是根据给定数据正确指定的？为了了解各种因素对准确性的影响并为模型选择提供信息，我们在真实、复杂的数据集上进行了大规模的模拟研究和广泛的实验。在测试方面，我们引入了聚合模型单元测试的概念，并提出了一套此类测试，以确保给定的模型没有被错误地指定，并表现出预期的行为。除了研究上述这些研究问题，我们还讨论了注释复杂性的基础概念和性质，提出了一种新的聚合模型，作为传统模型和我们自己的模型之间的概念桥梁，并为复杂标签聚合贡献了一种新的通用半监督学习方法，其性能优于之前的工作。

{"title":"A General Model for Aggregating Annotations Across Simple, Complex, and Multi-Object Annotation Tasks","authors":"Alexander Braylan, Madalyn Marabella, Omar Alonso, Matthew Lease","doi":"10.1613/jair.1.14388","DOIUrl":"https://doi.org/10.1613/jair.1.14388","url":null,"abstract":"Human annotations are vital to supervised learning, yet annotators often disagree on the correct label, especially as annotation tasks increase in complexity. A common strategy to improve label quality is to ask multiple annotators to label the same item and then aggregate their labels. To date, many aggregation models have been proposed for simple categorical or numerical annotation tasks, but far less work has considered more complex annotation tasks, such as those involving open-ended, multivariate, or structured responses. Similarly, while a variety of bespoke models have been proposed for specific tasks, our work is the first we are aware of to introduce aggregation methods that generalize across many, diverse complex tasks, including sequence labeling, translation, syntactic parsing, ranking, bounding boxes, and keypoints. This generality is achieved by applying readily available task-specific distance functions, then devising a task-agnostic method to model these distances between labels, rather than the labels themselves.\u0000This article presents a unified treatment of our prior work on complex annotation modeling and extends that work with investigation of three new research questions. First, how do complex annotation task and dataset properties impact aggregation accuracy? Second, how should a task owner navigate the many modeling choices in order to maximize aggregation accuracy? Finally, what tests and diagnoses can verify that aggregation models are specified correctly for the given data? To understand how various factors impact accuracy and to inform model selection, we conduct large-scale simulation studies and broad experiments on real, complex datasets. Regarding testing, we introduce the concept of unit tests for aggregation models and present a suite of such tests to ensure that a given model is not mis-specified and exhibits expected behavior.\u0000Beyond investigating these research questions above, we discuss the foundational concept and nature of annotation complexity, present a new aggregation model as a conceptual bridge between traditional models and our own, and contribute a new general semisupervised learning method for complex label aggregation that outperforms prior work.","PeriodicalId":54877,"journal":{"name":"Journal of Artificial Intelligence Research","volume":"194 1","pages":""},"PeriodicalIF":5.0,"publicationDate":"2023-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138981361","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Asymptotics of K-Fold Cross Validation K-Fold交叉验证的渐近性

3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence Research

Pub Date : 2023-11-14 DOI: 10.1613/jair.1.13974

Jessie Li

This paper investigates the asymptotic distribution of the K-fold cross validation error in an i.i.d. setting. As the number of observations n goes to infinity while keeping the number of folds K fixed, the K-fold cross validation error is √ n-consistent for the expected out-of-sample error and has an asymptotically normal distribution. A consistent estimate of the asymptotic variance is derived and used to construct asymptotically valid confidence intervals for the expected out-of-sample error. A hypothesis test is developed for comparing two estimators’ expected out-of-sample errors and a subsampling procedure is used to obtain critical values. Monte Carlo simulations demonstrate the asymptotic validity of our confidence intervals for the expected out-of-sample error and investigate the size and power properties of our test. In our empirical application, we use our estimator selection test to compare the out-of-sample predictive performance of OLS, Neural Networks, and Random Forests for predicting the sale price of a domain name in a GoDaddy expiry auction.

本文研究了K-fold交叉验证误差在i - id条件下的渐近分布。当观察值n趋于无穷，同时保持K的折叠次数固定时，K-fold交叉验证误差与期望的样本外误差是√n一致的，并且具有渐近正态分布。导出了渐近方差的一致估计，并用于构造期望样本外误差的渐近有效置信区间。假设检验用于比较两个估计器的期望样本外误差，并使用子抽样程序来获得临界值。蒙特卡罗模拟证明了期望样本外误差的置信区间的渐近有效性，并研究了我们测试的大小和功率特性。在我们的实证应用中，我们使用我们的估计器选择测试来比较OLS、神经网络和随机森林的样本外预测性能，以预测GoDaddy到期拍卖中域名的销售价格。

引用次数: 0

Diagnosing AI Explanation Methods with Folk Concepts of Behavior 用民间行为概念诊断人工智能解释方法

3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence Research

Pub Date : 2023-11-14 DOI: 10.1613/jair.1.14053

Alon Jacovi, Jasmijn Bastings, Sebastian Gehrmann, Yoav Goldberg, Katja Filippova

We investigate a formalism for the conditions of a successful explanation of AI. We consider “success” to depend not only on what information the explanation contains, but also on what information the human explainee understands from it. Theory of mind literature discusses the folk concepts that humans use to understand and generalize behavior. We posit that folk concepts of behavior provide us with a “language” that humans understand behavior with. We use these folk concepts as a framework of social attribution by the human explainee—the information constructs that humans are likely to comprehend from explanations—by introducing a blueprint for an explanatory narrative (Figure 1) that explains AI behavior with these constructs. We then demonstrate that many XAI methods today can be mapped to folk concepts of behavior in a qualitative evaluation. This allows us to uncover their failure modes that prevent current methods from explaining successfully—i.e., the information constructs that are missing for any given XAI method, and whose inclusion can decrease the likelihood of misunderstanding AI behavior.

我们研究了一个成功解释人工智能的条件的形式主义。我们认为“成功”不仅取决于解释包含什么信息，还取决于被解释者从中理解了什么信息。心理理论文献讨论了人类用来理解和概括行为的民间概念。我们假设，民间的行为概念为我们提供了一种人类理解行为的“语言”。我们使用这些民间概念作为人类解释者的社会归因框架-人类可能从解释中理解的信息结构-通过引入解释性叙述的蓝图(图1)，用这些结构解释人工智能行为。然后，我们证明了今天的许多XAI方法可以映射到定性评估中的民间行为概念。这使我们能够发现它们的失效模式，这些模式阻碍了当前方法的成功解释。即任何给定的XAI方法所缺少的信息结构，包含这些信息结构可以减少误解AI行为的可能性。

{"title":"Diagnosing AI Explanation Methods with Folk Concepts of Behavior","authors":"Alon Jacovi, Jasmijn Bastings, Sebastian Gehrmann, Yoav Goldberg, Katja Filippova","doi":"10.1613/jair.1.14053","DOIUrl":"https://doi.org/10.1613/jair.1.14053","url":null,"abstract":"We investigate a formalism for the conditions of a successful explanation of AI. We consider “success” to depend not only on what information the explanation contains, but also on what information the human explainee understands from it. Theory of mind literature discusses the folk concepts that humans use to understand and generalize behavior. We posit that folk concepts of behavior provide us with a “language” that humans understand behavior with. We use these folk concepts as a framework of social attribution by the human explainee—the information constructs that humans are likely to comprehend from explanations—by introducing a blueprint for an explanatory narrative (Figure 1) that explains AI behavior with these constructs. We then demonstrate that many XAI methods today can be mapped to folk concepts of behavior in a qualitative evaluation. This allows us to uncover their failure modes that prevent current methods from explaining successfully—i.e., the information constructs that are missing for any given XAI method, and whose inclusion can decrease the likelihood of misunderstanding AI behavior.","PeriodicalId":54877,"journal":{"name":"Journal of Artificial Intelligence Research","volume":"48 17","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134992188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 11

How to Tell Easy from Hard: Complexities of Conjunctive Query Entailment in Extensions of ALC 如何分辨难易:ALC扩展中连词查询蕴涵的复杂性

3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence Research

Pub Date : 2023-11-11 DOI: 10.1613/jair.1.14482

Bartosz Bednarczyk, Sebastian Rudolph

It is commonly known that the conjunctive query entailment problem for certain extensions of (the well-known ontology language) ALC is computationally harder than their knowledge base satisfiability problem while for others the complexities coincide, both under the standard and the finite-model semantics. We expose a uniform principle behind this divide by identifying a wide class of (finitely) locally-forward description logics, for which we prove that (finite) query entailment problem can be solved by a reduction to exponentially many calls of the (finite) knowledge base satisfiability problem. Consequently, our algorithm yields tight ExpTime upper bounds for locally-forward logics with ExpTime-complete knowledge base satisfiability problem, including logics between ALC and µALCHbregQ (and more), as well as ALCSCC with global cardinality constraints, for which the complexity of querying remained open. Moreover, to make our technique applicable in future research, we provide easy-to-check sufficient conditions for a logic to be locally-forward based several versions of the on model-theoretic notion of unravellings. Together with existing results, this provides a nearly complete classification of the “benign” vs. “malign” primitive modelling features extending ALC, missing out only the Self operator. We then show a rather counter-intuitive result, namely that the conjunctive entailment problem for ALCSelf is exponentially harder than for ALC. This places the seemingly innocuous Self operator among the “malign” modelling features, like inverses, transitivity or nominals.

众所周知，在标准语义和有限模型语义下，ALC的某些扩展的合取查询蕴涵问题比其知识库可满足性问题在计算上要困难，而其他扩展的复杂性是一致的。我们通过识别一类广泛的(有限)局部前向描述逻辑，揭示了这种划分背后的统一原则，为此我们证明了(有限)查询蕴涵问题可以通过将(有限)知识库可满足性问题的调用简化为指数级多来解决。因此，对于具有ExpTime-complete知识库可满足性问题的局部前向逻辑，包括ALC和µALCHbregQ之间的逻辑(以及更多)，以及具有全局基数约束的ALCSCC，我们的算法产生了严格的ExpTime上界，查询的复杂性仍然是开放的。此外，为了使我们的技术适用于未来的研究，我们提供了易于检查的充分条件，使逻辑能够基于非模型理论的解开概念的几个版本进行局部前向。与现有的结果一起，这提供了扩展ALC的“良性”与“恶性”原始建模特征的几乎完整的分类，只遗漏了Self算子。然后，我们展示了一个相当反直觉的结果，即ALCSelf的合取蕴涵问题比ALC的难得多。这将看似无害的Self操作符置于“恶意”的建模特征中，如逆、及物性或标称。

{"title":"How to Tell Easy from Hard: Complexities of Conjunctive Query Entailment in Extensions of ALC","authors":"Bartosz Bednarczyk, Sebastian Rudolph","doi":"10.1613/jair.1.14482","DOIUrl":"https://doi.org/10.1613/jair.1.14482","url":null,"abstract":"It is commonly known that the conjunctive query entailment problem for certain extensions of (the well-known ontology language) ALC is computationally harder than their knowledge base satisfiability problem while for others the complexities coincide, both under the standard and the finite-model semantics. We expose a uniform principle behind this divide by identifying a wide class of (finitely) locally-forward description logics, for which we prove that (finite) query entailment problem can be solved by a reduction to exponentially many calls of the (finite) knowledge base satisfiability problem. Consequently, our algorithm yields tight ExpTime upper bounds for locally-forward logics with ExpTime-complete knowledge base satisfiability problem, including logics between ALC and µALCHbregQ (and more), as well as ALCSCC with global cardinality constraints, for which the complexity of querying remained open. Moreover, to make our technique applicable in future research, we provide easy-to-check sufficient conditions for a logic to be locally-forward based several versions of the on model-theoretic notion of unravellings. Together with existing results, this provides a nearly complete classification of the “benign” vs. “malign” primitive modelling features extending ALC, missing out only the Self operator. We then show a rather counter-intuitive result, namely that the conjunctive entailment problem for ALCSelf is exponentially harder than for ALC. This places the seemingly innocuous Self operator among the “malign” modelling features, like inverses, transitivity or nominals.","PeriodicalId":54877,"journal":{"name":"Journal of Artificial Intelligence Research","volume":"10 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135086677","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Comprehensive Survey on Deep Graph Representation Learning Methods 深度图表示学习方法综述

3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence Research

Pub Date : 2023-10-25 DOI: 10.1613/jair.1.14768

Ijeoma Amuche Chikwendu, Xiaoling Zhang, Isaac Osei Agyemang, Isaac Adjei-Mensah, Ukwuoma Chiagoziem Chima, Chukwuebuka Joseph Ejiyi

There has been a lot of activity in graph representation learning in recent years. Graph representation learning aims to produce graph representation vectors to represent the structure and characteristics of huge graphs precisely. This is crucial since the effectiveness of the graph representation vectors will influence how well they perform in subsequent tasks like anomaly detection, connection prediction, and node classification. Recently, there has been an increase in the use of other deep-learning breakthroughs for data-based graph problems. Graph-based learning environments have a taxonomy of approaches, and this study reviews all their learning settings. The learning problem is theoretically and empirically explored. This study briefly introduces and summarizes the Graph Neural Architecture Search (G-NAS), outlines several Graph Neural Networks’ drawbacks, and suggests some strategies to mitigate these challenges. Lastly, the study discusses several potential future study avenues yet to be explored.

近年来，在图表示学习方面有很多活动。图表示学习的目的是生成图表示向量，以精确地表示庞大图的结构和特征。这是至关重要的，因为图表示向量的有效性将影响它们在异常检测、连接预测和节点分类等后续任务中的表现。最近，在基于数据的图形问题上使用其他深度学习突破的情况有所增加。基于图的学习环境有一种方法分类，本研究回顾了所有的学习设置。从理论和实证两方面探讨了学习问题。本研究简要介绍和总结了图神经架构搜索(G-NAS)，概述了图神经网络的几个缺点，并提出了一些缓解这些挑战的策略。最后，本研究讨论了几个潜在的未来研究途径尚未探索。

引用次数: 0

Select and Augment: Enhanced Dense Retrieval Knowledge Graph Augmentation 选择和增强:增强的密集检索知识图增强

3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence Research

Pub Date : 2023-10-25 DOI: 10.1613/jair.1.14365

Micheal Abaho, Yousef H. Alfaifi

Injecting textual information into knowledge graph (KG) entity representations has been a worthwhile expedition in terms of improving performance in KG oriented tasks within the NLP community. External knowledge often adopted to enhance KG embeddings ranges from semantically rich lexical dependency parsed features to a set of relevant key words to entire text descriptions supplied from an external corpus such as wikipedia and many more. Despite the gains this innovation (Text-enhanced KG embeddings) has made, the proposal in this work suggests that it can be improved even further. Instead of using a single text description (which would not sufficiently represent an entity because of the inherent lexical ambiguity of text), we propose a multi-task framework that jointly selects a set of text descriptions relevant to KG entities as well as align or augment KG embeddings with text descriptions. Different from prior work that plugs formal entity descriptions declared in knowledge bases, this framework leverages a retriever model to selectively identify richer or highly relevant text descriptions to use in augmenting entities. Furthermore, the framework treats the number of descriptions to use in augmentation process as a parameter, which allows the flexibility of enumerating across several numbers before identifying an appropriate number. Experiment results for Link Prediction demonstrate a 5.5% and 3.5% percentage increase in the Mean Reciprocal Rank (MRR) and Hits@10 scores respectively, in comparison to text-enhanced knowledge graph augmentation methods using traditional CNNs.

在NLP社区中，将文本信息注入知识图(KG)实体表示是提高面向KG任务性能的一项有价值的探索。通常用于增强KG嵌入的外部知识范围从语义丰富的词汇依赖解析特征到一组相关关键字，再到外部语料库(如wikipedia等)提供的完整文本描述。尽管这种创新(文本增强的KG嵌入)已经取得了进展，但这项工作中的建议表明它可以进一步改进。代替使用单一的文本描述(由于文本固有的词汇歧义，不能充分代表实体)，我们提出了一个多任务框架，共同选择一组与KG实体相关的文本描述，并将KG嵌入与文本描述对齐或增强。与之前插入知识库中声明的正式实体描述的工作不同，该框架利用检索器模型有选择地识别更丰富或高度相关的文本描述，以用于扩展实体。此外，框架将在增强过程中使用的描述数量作为参数，这允许在确定适当的数字之前枚举多个数字的灵活性。实验结果表明，与使用传统cnn的文本增强知识图增强方法相比，链接预测的平均倒数秩(MRR)和Hits@10分数分别提高了5.5%和3.5%。

{"title":"Select and Augment: Enhanced Dense Retrieval Knowledge Graph Augmentation","authors":"Micheal Abaho, Yousef H. Alfaifi","doi":"10.1613/jair.1.14365","DOIUrl":"https://doi.org/10.1613/jair.1.14365","url":null,"abstract":"Injecting textual information into knowledge graph (KG) entity representations has been a worthwhile expedition in terms of improving performance in KG oriented tasks within the NLP community. External knowledge often adopted to enhance KG embeddings ranges from semantically rich lexical dependency parsed features to a set of relevant key words to entire text descriptions supplied from an external corpus such as wikipedia and many more. Despite the gains this innovation (Text-enhanced KG embeddings) has made, the proposal in this work suggests that it can be improved even further. Instead of using a single text description (which would not sufficiently represent an entity because of the inherent lexical ambiguity of text), we propose a multi-task framework that jointly selects a set of text descriptions relevant to KG entities as well as align or augment KG embeddings with text descriptions. Different from prior work that plugs formal entity descriptions declared in knowledge bases, this framework leverages a retriever model to selectively identify richer or highly relevant text descriptions to use in augmenting entities. Furthermore, the framework treats the number of descriptions to use in augmentation process as a parameter, which allows the flexibility of enumerating across several numbers before identifying an appropriate number. Experiment results for Link Prediction demonstrate a 5.5% and 3.5% percentage increase in the Mean Reciprocal Rank (MRR) and Hits@10 scores respectively, in comparison to text-enhanced knowledge graph augmentation methods using traditional CNNs.","PeriodicalId":54877,"journal":{"name":"Journal of Artificial Intelligence Research","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135218524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0