Journal of Artificial Intelligence Research最新文献_第2页

Embedding Ontologies in the Description Logic ALC by Axis-Aligned Cones 轴向锥在描述逻辑ALC中嵌入本体

3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence Research

Pub Date : 2023-10-23 DOI: 10.1613/jair.1.13939

Özgür Lütfü Özcep, Mena Leemhuis, Diedrich Wolter

This paper is concerned with knowledge graph embedding with background knowledge, taking the formal perspective of logics. In knowledge graph embedding, knowledge— expressed as a set of triples of the form (a R b) (“a is R-related to b”)—is embedded into a real-valued vector space. The embedding helps exploiting geometrical regularities of the space in order to tackle typical inductive tasks of machine learning such as link prediction. Recent embedding approaches also consider incorporating background knowledge, in which the intended meanings of the symbols a, R, b are further constrained via axioms of a theory. Of particular interest are theories expressed in a formal language with a neat semantics and a good balance between expressivity and feasibility. In that case, the knowledge graph together with the background can be considered to be an ontology. This paper develops a cone-based theory for embedding in order to advance the expressivity of the ontology: it works (at least) with ontologies expressed in the description logic ALC, which comprises restricted existential and universal quantifiers, as well as concept negation and concept disjunction. In order to align the classical Tarskian Style semantics for ALC with the sub-symbolic representation of triples, we use the notion of a geometric model of an ALC ontology and show, as one of our main results, that an ALC ontology is satisfiable in the classical sense iff it is satisfiable by a geometric model based on cones. The geometric model, if treated as a partial model, can even be chosen to be faithful, i.e., to reflect all and only the knowledge captured by the ontology. We introduce the class of axis-aligned cones and show that modulo simple geometric operations any distributive logic (such as ALC) interpreted over cones employs this class of cones. Cones are also attractive from a machine learning perspective on knowledge graph embeddings since they give rise to applying conic optimization techniques.

本文从逻辑的形式化角度研究知识图与背景知识的嵌入问题。在知识图嵌入中，知识被表示为(a R b)(“a是R与b相关的”)形式的一组三元组，并嵌入到实值向量空间中。嵌入有助于利用空间的几何规律，以解决机器学习的典型归纳任务，如链接预测。最近的嵌入方法还考虑纳入背景知识，其中符号a, R, b的预期含义通过理论的公理进一步受到约束。特别感兴趣的是用形式语言表达的理论，它具有整洁的语义，并且在表达性和可行性之间取得了良好的平衡。在这种情况下，知识图和背景可以被认为是一个本体。为了提高本体的表达性，本文发展了一种基于圆锥体的嵌入理论:它至少适用于描述逻辑ALC中表达的本体，其中包括有限存在量词和全称量词，以及概念否定和概念析取。为了将ALC的经典Tarskian风格语义与三元组的子符号表示结合起来，我们使用了ALC本体的几何模型的概念，并作为我们的主要结果之一，证明了ALC本体在经典意义上是可满足的，如果它是基于锥体的几何模型可满足的。如果将几何模型视为部分模型，甚至可以选择忠实的模型，即反映本体捕获的所有知识。我们引入了轴向锥类，并证明了在锥上解释的任何分配逻辑(如ALC)的模简单几何运算都使用了这类锥。从知识图嵌入的机器学习角度来看，锥体也很有吸引力，因为它们可以应用锥体优化技术。

{"title":"Embedding Ontologies in the Description Logic ALC by Axis-Aligned Cones","authors":"Özgür Lütfü Özcep, Mena Leemhuis, Diedrich Wolter","doi":"10.1613/jair.1.13939","DOIUrl":"https://doi.org/10.1613/jair.1.13939","url":null,"abstract":"This paper is concerned with knowledge graph embedding with background knowledge, taking the formal perspective of logics. In knowledge graph embedding, knowledge— expressed as a set of triples of the form (a R b) (“a is R-related to b”)—is embedded into a real-valued vector space. The embedding helps exploiting geometrical regularities of the space in order to tackle typical inductive tasks of machine learning such as link prediction. Recent embedding approaches also consider incorporating background knowledge, in which the intended meanings of the symbols a, R, b are further constrained via axioms of a theory. Of particular interest are theories expressed in a formal language with a neat semantics and a good balance between expressivity and feasibility. In that case, the knowledge graph together with the background can be considered to be an ontology. This paper develops a cone-based theory for embedding in order to advance the expressivity of the ontology: it works (at least) with ontologies expressed in the description logic ALC, which comprises restricted existential and universal quantifiers, as well as concept negation and concept disjunction. In order to align the classical Tarskian Style semantics for ALC with the sub-symbolic representation of triples, we use the notion of a geometric model of an ALC ontology and show, as one of our main results, that an ALC ontology is satisfiable in the classical sense iff it is satisfiable by a geometric model based on cones. The geometric model, if treated as a partial model, can even be chosen to be faithful, i.e., to reflect all and only the knowledge captured by the ontology. We introduce the class of axis-aligned cones and show that modulo simple geometric operations any distributive logic (such as ALC) interpreted over cones employs this class of cones. Cones are also attractive from a machine learning perspective on knowledge graph embeddings since they give rise to applying conic optimization techniques.","PeriodicalId":54877,"journal":{"name":"Journal of Artificial Intelligence Research","volume":"59 11","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135413780","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Amortized Variational Inference: A Systematic Review 平摊变分推理:系统回顾

3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence Research

Pub Date : 2023-10-15 DOI: 10.1613/jair.1.14258

Ankush Ganguly, Sanjana Jain, Ukrit Watchareeruetai

The core principle of Variational Inference (VI) is to convert the statistical inference problem of computing complex posterior probability densities into a tractable optimization problem. This property enables VI to be faster than several sampling-based techniques. However, the traditional VI algorithm is not scalable to large data sets and is unable to readily infer out-of-bounds data points without re-running the optimization process. Recent developments in the field, like stochastic-, black box-, and amortized-VI, have helped address these issues. Generative modeling tasks nowadays widely make use of amortized VI for its efficiency and scalability, as it utilizes a parameterized function to learn the approximate posterior density parameters. In this paper, we review the mathematical foundations of various VI techniques to form the basis for understanding amortized VI. Additionally, we provide an overview of the recent trends that address several issues of amortized VI, such as the amortization gap, generalization issues, inconsistent representation learning, and posterior collapse. Finally, we analyze alternate divergence measures that improve VI optimization.

变分推理(VI)的核心原理是将计算复杂后验概率密度的统计推理问题转化为可处理的优化问题。这个属性使VI比一些基于采样的技术更快。然而，传统的VI算法无法扩展到大型数据集，并且在不重新运行优化过程的情况下无法轻松推断出越界数据点。该领域的最新发展，如随机-、黑盒-和平摊- vi，有助于解决这些问题。由于平摊VI算法利用参数化函数来学习近似后验密度参数，因此由于其效率和可扩展性，生成建模任务广泛使用平摊VI算法。在本文中，我们回顾了各种VI技术的数学基础，以形成理解平摊VI的基础。此外，我们概述了解决平摊VI的几个问题的最新趋势，如平摊差距、泛化问题、不一致表示学习和后向崩溃。最后，我们分析了改进VI优化的备选发散措施。

{"title":"Amortized Variational Inference: A Systematic Review","authors":"Ankush Ganguly, Sanjana Jain, Ukrit Watchareeruetai","doi":"10.1613/jair.1.14258","DOIUrl":"https://doi.org/10.1613/jair.1.14258","url":null,"abstract":"The core principle of Variational Inference (VI) is to convert the statistical inference problem of computing complex posterior probability densities into a tractable optimization problem. This property enables VI to be faster than several sampling-based techniques. However, the traditional VI algorithm is not scalable to large data sets and is unable to readily infer out-of-bounds data points without re-running the optimization process. Recent developments in the field, like stochastic-, black box-, and amortized-VI, have helped address these issues. Generative modeling tasks nowadays widely make use of amortized VI for its efficiency and scalability, as it utilizes a parameterized function to learn the approximate posterior density parameters. In this paper, we review the mathematical foundations of various VI techniques to form the basis for understanding amortized VI. Additionally, we provide an overview of the recent trends that address several issues of amortized VI, such as the amortization gap, generalization issues, inconsistent representation learning, and posterior collapse. Finally, we analyze alternate divergence measures that improve VI optimization.","PeriodicalId":54877,"journal":{"name":"Journal of Artificial Intelligence Research","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135758883","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Clustering what Matters: Optimal Approximation for Clustering with Outliers 重要的聚类:与离群值聚类的最优逼近

3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence Research

Pub Date : 2023-09-15 DOI: 10.1613/jair.1.14883

Akanksha Agrawal, Tanmay Inamdar, Saket Saurabh, Jie Xue

Clustering with outliers is one of the most fundamental problems in Computer Science. Given a set X of n points and two numbers k, m, the clustering with outliers aims to exclude m points from X and partition the remaining points into k clusters that minimizes a certain cost function. In this paper, we give a general approach for solving clustering with outliers, which results in a fixed-parameter tractable (FPT) algorithm in k and m—i.e., an algorithm with running time of the form f(k, m) · nO(1) for some function f—that almost matches the approximation ratio for its outlier-free counterpart. As a corollary, we obtain FPT approximation algorithms with optimal approximation ratios for k-Median and k-Means with outliers in general and Euclidean metrics. We also exhibit more applications of our approach to other variants of the problem that impose additional constraints on the clustering, such as fairness or matroid constraints.

异常值聚类是计算机科学中最基本的问题之一。给定一个n个点的集合X和两个数字k, m，带离群点聚类的目的是从X中排除m个点，并将剩下的点划分到k个最小化某个代价函数的聚类中。本文给出了一种求解具有离群值的聚类问题的一般方法，从而得到了k和m -即的固定参数可处理(FPT)算法。对于某些函数f，它的运行时间形式为f(k, m)·nO(1)，几乎与它的无离群值对应的近似比匹配。作为推论，我们得到了一般和欧几里得度量中具有离群值的k-Median和k-Means的最优近似比的FPT近似算法。我们还展示了我们的方法在问题的其他变体上的更多应用，这些变体对聚类施加了额外的约束，例如公平性或矩阵约束。

引用次数: 0

Exploiting Functional Constraints in Automatic Dominance Breaking for Constraint Optimization 利用自动优势打破中的功能约束进行约束优化

3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence Research

Pub Date : 2023-09-13 DOI: 10.1613/jair.1.14714

Jimmy H.M. Lee, Allen Z. Zhong

Dominance breaking is a powerful technique in improving the solving efficiency of Constraint Optimization Problems (COPs) by removing provably suboptimal solutions with additional constraints. While dominance breaking is effective in a range of practical problems, it is usually problem specific and requires human insights into problem structures to come up with correct dominance breaking constraints. Recently, a framework is proposed to generate nogood constraints automatically for dominance breaking, which formulates nogood generation as solving auxiliary Constraint Satisfaction Problems (CSPs). However, the framework uses a pattern matching approach to synthesize the auxiliary generation CSPs from the specific forms of objectives and constraints in target COPs, and is only applicable to a limited class of COPs. This paper proposes a novel rewriting system to derive constraints for the auxiliary generation CSPs automatically from COPs with nested function calls, significantly generalizing the original framework. In particular, the rewriting system exploits functional constraints flattened from nested functions in a high-level modeling language. To generate more effective dominance breaking nogoods and derive more relaxed constraints in generation CSPs, we further characterize how to extend the system with rewriting rules exploiting function properties, such as monotonicity, commutativity, and associativity, for specific functional constraints. Experimentation shows significant runtime speedup using the dominance breaking nogoods generated by our proposed method. Studying patterns of generated nogoods also demonstrates that our proposal can reveal dominance relations in the literature and discover new dominance relations on problems with ineffective or no known dominance breaking constraints.

优势打破是一种有效的技术，它通过去除带有附加约束的可证明次优解来提高约束优化问题的求解效率。虽然打破支配地位在一系列实际问题中是有效的，但它通常是特定问题，需要人类对问题结构的洞察力来提出正确的打破支配地位的约束。最近，提出了一种自动生成非优约束的框架，将非优约束生成表述为求解辅助约束满足问题(csp)。然而，该框架使用模式匹配方法从目标cop中的特定目标和约束形式合成辅助生成csp，并且仅适用于有限类别的cop。本文提出了一种新的重写系统，用于从嵌套函数调用的cop中自动生成辅助生成csp的约束，显著地推广了原框架。特别是，重写系统利用了高级建模语言中嵌套函数的平面化功能约束。为了生成更有效的优势打破无商品并在生成csp中推导出更宽松的约束，我们进一步描述了如何使用重写规则来扩展系统，利用函数属性，如单调性，交换性和结合性，用于特定的功能约束。实验表明，使用我们提出的方法生成的优势打破无商品显著加快运行时间。对无商品生成模式的研究也表明，我们的建议可以揭示文献中的优势关系，并在无效或未知优势打破约束的问题上发现新的优势关系。

{"title":"Exploiting Functional Constraints in Automatic Dominance Breaking for Constraint Optimization","authors":"Jimmy H.M. Lee, Allen Z. Zhong","doi":"10.1613/jair.1.14714","DOIUrl":"https://doi.org/10.1613/jair.1.14714","url":null,"abstract":"Dominance breaking is a powerful technique in improving the solving efficiency of Constraint Optimization Problems (COPs) by removing provably suboptimal solutions with additional constraints. While dominance breaking is effective in a range of practical problems, it is usually problem specific and requires human insights into problem structures to come up with correct dominance breaking constraints. Recently, a framework is proposed to generate nogood constraints automatically for dominance breaking, which formulates nogood generation as solving auxiliary Constraint Satisfaction Problems (CSPs). However, the framework uses a pattern matching approach to synthesize the auxiliary generation CSPs from the specific forms of objectives and constraints in target COPs, and is only applicable to a limited class of COPs. This paper proposes a novel rewriting system to derive constraints for the auxiliary generation CSPs automatically from COPs with nested function calls, significantly generalizing the original framework. In particular, the rewriting system exploits functional constraints flattened from nested functions in a high-level modeling language. To generate more effective dominance breaking nogoods and derive more relaxed constraints in generation CSPs, we further characterize how to extend the system with rewriting rules exploiting function properties, such as monotonicity, commutativity, and associativity, for specific functional constraints. Experimentation shows significant runtime speedup using the dominance breaking nogoods generated by our proposed method. Studying patterns of generated nogoods also demonstrates that our proposal can reveal dominance relations in the literature and discover new dominance relations on problems with ineffective or no known dominance breaking constraints.","PeriodicalId":54877,"journal":{"name":"Journal of Artificial Intelligence Research","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135781109","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Benchmark Study on Knowledge Graphs Enrichment and Pruning Methods in the Presence of Noisy Relationships 存在噪声关系的知识图充实与剪枝方法的基准研究

3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence Research

Pub Date : 2023-09-13 DOI: 10.1613/jair.1.14494

Stefano Faralli, Andrea Lenzi, Paola Velardi

In the past few years, knowledge graphs (KGs), as a form of structured human intelligence, have attracted considerable research attention from academia and industry. In this very active field of study, a widely explored problem is that of link prediction, the task of predicting whether two nodes should be connected, based on node attributes and local or global graph connectivity properties. The state of the art in this area is represented by techniques based on graph embeddings. However, KGs, especially those acquired using automated or partly automated techniques, are often riddled with noise, e.g., wrong relationships, which makes the problem of link deletion as important as that of link prediction. In this paper, we address three main research questions. The first is about the true effectiveness of different knowledge graph embedding models under the presence of an increasing number of wrong links. The second is to asses if methods that can predict unknown relationships effectively, work equally well in recognizing incorrect relations. The third is to verify if there are systems robust enough to maintain primacy in all experimental conditions. To answer these research questions, we performed a systematic benchmark study in which the experimental setting includes ten state-of-the-art models, three common KG datasets with different structural properties and three downstream tasks: the widely explored tasks of link prediction and triple classification, and the less popular task of link deletion. Comparative studies often yield contradictory results, where the same systems score better or worse depending on the experimental context. In our work, in order to facilitate the discovery of clear performance patterns and their interpretation, we select and/or aggregate performance data to highlight each specific comparison dimension: dataset complexity, type of task, category of models, and robustness against noise.

在过去的几年里，知识图谱作为一种结构化的人类智能形式，引起了学术界和工业界的广泛关注。在这个非常活跃的研究领域中，一个被广泛探索的问题是链路预测，即基于节点属性和局部或全局图连接属性来预测两个节点是否应该连接。该领域的最新技术是基于图嵌入的技术。然而，KGs，特别是那些使用自动化或部分自动化技术获得的KGs，往往充满了噪音，例如，错误的关系，这使得链接删除问题与链接预测问题一样重要。在本文中，我们解决了三个主要的研究问题。第一个问题是在错误链接不断增加的情况下，不同知识图嵌入模型的真实有效性。第二步是评估那些能够有效预测未知关系的方法在识别错误关系时是否同样有效。第三是验证是否存在足够健壮的系统，可以在所有实验条件下保持首要地位。为了回答这些研究问题，我们进行了系统的基准研究，实验设置包括10个最先进的模型，3个具有不同结构属性的常见KG数据集和3个下游任务:广泛探索的链接预测和三重分类任务，以及不太受欢迎的链接删除任务。比较研究往往会产生相互矛盾的结果，同一系统的得分是高是低取决于实验背景。在我们的工作中，为了便于发现清晰的性能模式及其解释，我们选择和/或汇总性能数据以突出每个特定的比较维度:数据集复杂性、任务类型、模型类别和抗噪声鲁棒性。

{"title":"A Benchmark Study on Knowledge Graphs Enrichment and Pruning Methods in the Presence of Noisy Relationships","authors":"Stefano Faralli, Andrea Lenzi, Paola Velardi","doi":"10.1613/jair.1.14494","DOIUrl":"https://doi.org/10.1613/jair.1.14494","url":null,"abstract":"In the past few years, knowledge graphs (KGs), as a form of structured human intelligence, have attracted considerable research attention from academia and industry. In this very active field of study, a widely explored problem is that of link prediction, the task of predicting whether two nodes should be connected, based on node attributes and local or global graph connectivity properties. The state of the art in this area is represented by techniques based on graph embeddings. However, KGs, especially those acquired using automated or partly automated techniques, are often riddled with noise, e.g., wrong relationships, which makes the problem of link deletion as important as that of link prediction. In this paper, we address three main research questions. The first is about the true effectiveness of different knowledge graph embedding models under the presence of an increasing number of wrong links. The second is to asses if methods that can predict unknown relationships effectively, work equally well in recognizing incorrect relations. The third is to verify if there are systems robust enough to maintain primacy in all experimental conditions. To answer these research questions, we performed a systematic benchmark study in which the experimental setting includes ten state-of-the-art models, three common KG datasets with different structural properties and three downstream tasks: the widely explored tasks of link prediction and triple classification, and the less popular task of link deletion. Comparative studies often yield contradictory results, where the same systems score better or worse depending on the experimental context. In our work, in order to facilitate the discovery of clear performance patterns and their interpretation, we select and/or aggregate performance data to highlight each specific comparison dimension: dataset complexity, type of task, category of models, and robustness against noise.","PeriodicalId":54877,"journal":{"name":"Journal of Artificial Intelligence Research","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135781105","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Sequence-Oriented Diagnosis of Discrete-Event Systems 离散事件系统的序列诊断

3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence Research

Pub Date : 2023-09-13 DOI: 10.1613/jair.1.14630

Gianfranco Lamperti, Stefano Trerotola, Marina Zanella, Xiangfu Zhao

Model-based diagnosis has always been conceived as set-oriented, meaning that a candidate is a set of faults, or faulty components, that explains a collection of observations. This perspective applies equally to both static and dynamical systems. Diagnosis of discrete-event systems (DESs) is no exception: a candidate is traditionally a set of faults, or faulty events, occurring in a trajectory of the DES that conforms with a given sequence of observations. As such, a candidate does not embed any temporal relationship among faults, nor does it account for multiple occurrences of the same fault. To improve diagnostic explanation and support decision making, a sequence-oriented perspective to diagnosis of DESs is presented, where a candidate is a sequence of faults occurring in a trajectory of the DES, called a fault sequence. Since a fault sequence is possibly unbounded, as the same fault may occur an unlimited number of times in the trajectory, the set of (output) candidates may be unbounded also, which contrasts with set-oriented diagnosis, where the set of candidates is bounded by the powerset of the domain of faults. Still, a possibly unbounded set of fault sequences is shown to be a regular language, which can be defined by a regular expression over the domain of faults, a property that makes sequence-oriented diagnosis feasible in practice. The task of monitoring-based diagnosis is considered, where a new candidate set is generated at the occurrence of each observation. The approach is based on three different techniques: .1/ blind diagnosis, with no compiled knowledge, .2/ greedy diagnosis, with total knowledge compilation, and .3/ lazy diagnosis, with partial knowledge compilation. By knowledge we mean a data structure slightly similar to a classical DES diagnoser, which can be generated (compiled) either entirely offline (greedy diagnosis) or incrementally online (lazy diagnosis). Experimental evidence suggests that, among these techniques, only lazy diagnosis may be viable in non-trivial application domains.

基于模型的诊断一直被认为是面向集合的，这意味着候选是一组故障或故障组件，它解释了一组观察结果。这个观点同样适用于静态和动态系统。离散事件系统(DESs)的诊断也不例外:候选系统通常是一组故障或故障事件，发生在符合给定观测序列的DESs轨迹中。因此，候选模型不嵌入故障之间的任何时间关系，也不考虑同一故障的多次出现。为了提高诊断解释和支持决策，提出了一种面向序列的诊断视角，其中候选是在DES轨迹中发生的故障序列，称为故障序列。由于故障序列可能是无界的，因为相同的故障可能在轨迹中出现无限次，因此候选集合也可能是无界的，这与面向集的诊断相反，面向集的诊断是由故障域的幂集限定的。然而，一个可能无界的故障序列集被证明是一种规则语言，它可以用故障域上的正则表达式定义，这一特性使得面向序列的诊断在实践中是可行的。考虑了基于监测的诊断任务，在每次观测发生时生成一个新的候选集。该方法基于三种不同的技术:1/盲诊断，不编译知识;2/贪婪诊断，全知识编译;3/懒惰诊断，部分知识编译。我们所说的知识是指与经典DES诊断程序稍微相似的数据结构，它既可以完全脱机(贪婪诊断)生成(编译)，也可以增量在线(惰性诊断)生成(编译)。实验证据表明，在这些技术中，只有惰性诊断可能在重要的应用领域是可行的。

{"title":"Sequence-Oriented Diagnosis of Discrete-Event Systems","authors":"Gianfranco Lamperti, Stefano Trerotola, Marina Zanella, Xiangfu Zhao","doi":"10.1613/jair.1.14630","DOIUrl":"https://doi.org/10.1613/jair.1.14630","url":null,"abstract":"Model-based diagnosis has always been conceived as set-oriented, meaning that a candidate is a set of faults, or faulty components, that explains a collection of observations. This perspective applies equally to both static and dynamical systems. Diagnosis of discrete-event systems (DESs) is no exception: a candidate is traditionally a set of faults, or faulty events, occurring in a trajectory of the DES that conforms with a given sequence of observations. As such, a candidate does not embed any temporal relationship among faults, nor does it account for multiple occurrences of the same fault. To improve diagnostic explanation and support decision making, a sequence-oriented perspective to diagnosis of DESs is presented, where a candidate is a sequence of faults occurring in a trajectory of the DES, called a fault sequence. Since a fault sequence is possibly unbounded, as the same fault may occur an unlimited number of times in the trajectory, the set of (output) candidates may be unbounded also, which contrasts with set-oriented diagnosis, where the set of candidates is bounded by the powerset of the domain of faults. Still, a possibly unbounded set of fault sequences is shown to be a regular language, which can be defined by a regular expression over the domain of faults, a property that makes sequence-oriented diagnosis feasible in practice. The task of monitoring-based diagnosis is considered, where a new candidate set is generated at the occurrence of each observation. The approach is based on three different techniques: .1/ blind diagnosis, with no compiled knowledge, .2/ greedy diagnosis, with total knowledge compilation, and .3/ lazy diagnosis, with partial knowledge compilation. By knowledge we mean a data structure slightly similar to a classical DES diagnoser, which can be generated (compiled) either entirely offline (greedy diagnosis) or incrementally online (lazy diagnosis). Experimental evidence suggests that, among these techniques, only lazy diagnosis may be viable in non-trivial application domains.","PeriodicalId":54877,"journal":{"name":"Journal of Artificial Intelligence Research","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135781108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Exploiting Functional Constraints in Automatic Dominance Breaking for Constraint Optimization 利用自动优势打破中的功能约束进行约束优化

IF 5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence Research

Pub Date : 2023-09-13 DOI: 10.4230/LIPIcs.CP.2022.31

Jimmy Ho-man Lee, Allen Z. Zhong

Dominance breaking is a powerful technique in improving the solving efficiency of Constraint Optimization Problems (COPs) by removing provably suboptimal solutions with additional constraints. While dominance breaking is effective in a range of practical problems, it is usually problem specific and requires human insights into problem structures to come up with correct dominance breaking constraints. Recently, a framework is proposed to generate nogood constraints automatically for dominance breaking, which formulates nogood generation as solving auxiliary Constraint Satisfaction Problems (CSPs). However, the framework uses a pattern matching approach to synthesize the auxiliary generation CSPs from the specific forms of objectives and constraints in target COPs, and is only applicable to a limited class of COPs.This paper proposes a novel rewriting system to derive constraints for the auxiliary generation CSPs automatically from COPs with nested function calls, significantly generalizing the original framework. In particular, the rewriting system exploits functional constraints flattened from nested functions in a high-level modeling language. To generate more effective dominance breaking nogoods and derive more relaxed constraints in generation CSPs, we further characterize how to extend the system with rewriting rules exploiting function properties, such as monotonicity, commutativity, and associativity, for specific functional constraints. Experimentation shows significant runtime speedup using the dominance breaking nogoods generated by our proposed method. Studying patterns of generated nogoods also demonstrates that our proposal can reveal dominance relations in the literature and discover new dominance relations on problems with ineffective or no known dominance breaking constraints.

优势打破是一种有效的技术，它通过去除带有附加约束的可证明次优解来提高约束优化问题的求解效率。虽然打破支配地位在一系列实际问题中是有效的，但它通常是特定问题，需要人类对问题结构的洞察力来提出正确的打破支配地位的约束。最近，提出了一种自动生成非优约束的框架，将非优约束生成表述为求解辅助约束满足问题(csp)。然而，该框架使用模式匹配方法从目标cop中的特定目标和约束形式合成辅助生成csp，并且仅适用于有限类别的cop。本文提出了一种新的重写系统，用于从嵌套函数调用的cop中自动生成辅助生成csp的约束，显著地推广了原框架。特别是，重写系统利用了高级建模语言中嵌套函数的平面化功能约束。为了生成更有效的优势打破无商品并在生成csp中推导出更宽松的约束，我们进一步描述了如何使用重写规则来扩展系统，利用函数属性，如单调性，交换性和结合性，用于特定的功能约束。实验表明，使用我们提出的方法生成的优势打破无商品显著加快运行时间。对无商品生成模式的研究也表明，我们的建议可以揭示文献中的优势关系，并在无效或未知优势打破约束的问题上发现新的优势关系。

{"title":"Exploiting Functional Constraints in Automatic Dominance Breaking for Constraint Optimization","authors":"Jimmy Ho-man Lee, Allen Z. Zhong","doi":"10.4230/LIPIcs.CP.2022.31","DOIUrl":"https://doi.org/10.4230/LIPIcs.CP.2022.31","url":null,"abstract":"Dominance breaking is a powerful technique in improving the solving efficiency of Constraint Optimization Problems (COPs) by removing provably suboptimal solutions with additional constraints. While dominance breaking is effective in a range of practical problems, it is usually problem specific and requires human insights into problem structures to come up with correct dominance breaking constraints. Recently, a framework is proposed to generate nogood constraints automatically for dominance breaking, which formulates nogood generation as solving auxiliary Constraint Satisfaction Problems (CSPs). However, the framework uses a pattern matching approach to synthesize the auxiliary generation CSPs from the specific forms of objectives and constraints in target COPs, and is only applicable to a limited class of COPs.\u0000This paper proposes a novel rewriting system to derive constraints for the auxiliary generation CSPs automatically from COPs with nested function calls, significantly generalizing the original framework. In particular, the rewriting system exploits functional constraints flattened from nested functions in a high-level modeling language. To generate more effective dominance breaking nogoods and derive more relaxed constraints in generation CSPs, we further characterize how to extend the system with rewriting rules exploiting function properties, such as monotonicity, commutativity, and associativity, for specific functional constraints. Experimentation shows significant runtime speedup using the dominance breaking nogoods generated by our proposed method. Studying patterns of generated nogoods also demonstrates that our proposal can reveal dominance relations in the literature and discover new dominance relations on problems with ineffective or no known dominance breaking constraints.","PeriodicalId":54877,"journal":{"name":"Journal of Artificial Intelligence Research","volume":"17 1","pages":"1-35"},"PeriodicalIF":5.0,"publicationDate":"2023-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75696952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Prediction of Social Dynamic Agents and Long-Tailed Learning Challenges: A Survey 社会动态代理预测与长尾学习挑战研究

IF 5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence Research

Pub Date : 2023-08-29 DOI: 10.1613/jair.1.14749

Divya Thuremella, L. Kunze

Autonomous robots that can perform common tasks like driving, surveillance, and chores have the biggest potential for impact due to frequency of usage, and the biggest potential for risk due to direct interaction with humans. These tasks take place in openended environments where humans socially interact and pursue their goals in complex and diverse ways. To operate in such environments, such systems must predict this behaviour, especially when the behavior is unexpected and potentially dangerous. Therefore, we summarize trends in various types of tasks, modeling methods, datasets, and social interaction modules aimed at predicting the future location of dynamic, socially interactive agents. Furthermore, we describe long-tailed learning techniques from classification and regression problems that can be applied to prediction problems. To our knowledge this is the first work that reviews social interaction modeling within prediction, and long-tailed learning techniques within regression and prediction.

可以执行驾驶、监视和家务等常见任务的自主机器人由于使用频率而具有最大的影响潜力，并且由于与人类直接互动而具有最大的风险潜力。这些任务发生在开放的环境中，在这些环境中，人类以复杂而多样的方式进行社会互动和追求目标。为了在这样的环境中运行，这样的系统必须预测这种行为，特别是当这种行为是意外的和潜在的危险时。因此，我们总结了各种类型的任务、建模方法、数据集和社会交互模块的趋势，旨在预测动态、社会交互代理的未来位置。此外，我们从分类和回归问题中描述了可以应用于预测问题的长尾学习技术。据我们所知，这是第一次回顾预测中的社会互动建模，以及回归和预测中的长尾学习技术。

引用次数: 0

Automatically Finding the Right Probabilities in Bayesian Networks 在贝叶斯网络中自动寻找正确概率

IF 5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence Research

Pub Date : 2023-08-27 DOI: 10.1613/jair.1.14044

Bahar Salmani, J. Katoen

This paper presents alternative techniques for inference on classical Bayesian networks in which all probabilities are fixed, and for synthesis problems when conditional probability tables (CPTs) in such networks contain symbolic parameters rather than concrete probabilities. The key idea is to exploit probabilistic model checking as well as its recent extension to parameter synthesis techniques for parametric Markov chains. To enable this, the Bayesian networks are transformed into Markov chains and their objectives are mapped onto probabilistic temporal logic formulas. For exact inference, we compare probabilistic model checking to weighted model counting on various Bayesian network benchmarks. We contrast symbolic model checking using multi-terminal binary (aka: algebraic) decision diagrams to symbolic inference using proba- bilistic sentential decision diagrams, symbolic data structures that are tailored to Bayesian networks. For the parametric setting, we describe how our techniques can be used for various synthesis problems such as computing sensitivity functions (and values), simple and difference parameter tuning and ratio parameter tuning. Our parameter synthesis techniques are applicable to arbitrarily many, possibly dependent, parameters that may occur in multiple CPTs. This lifts restrictions, e.g., on the number of parametrized CPTs, or on parameter dependencies between several CPTs, that exist in the literature. Experiments on several benchmarks show that our parameter synthesis techniques can treat parameter synthesis for Bayesian networks (with hundreds of unknown parameters) that are out of reach for existing techniques.

本文提出了对经典贝叶斯网络进行推理的替代技术，其中所有概率都是固定的，以及当这种网络中的条件概率表(cpt)包含符号参数而不是具体概率时的综合问题。关键思想是利用概率模型检验及其最近扩展到参数马尔可夫链的参数综合技术。为了实现这一点，贝叶斯网络被转换成马尔可夫链，它们的目标被映射到概率时间逻辑公式。为了精确推断，我们在各种贝叶斯网络基准上比较了概率模型检查和加权模型计数。我们对比了使用多终端二进制(又名:代数)决策图的符号模型检查和使用概率-双向句子决策图的符号推理，这是为贝叶斯网络量身定制的符号数据结构。对于参数设置，我们描述了如何将我们的技术用于各种综合问题，例如计算灵敏度函数(和值)，简单和差分参数调谐以及比率参数调谐。我们的参数合成技术适用于可能出现在多个cpt中的任意多个可能相关的参数。这解除了文献中存在的限制，例如，对参数化cpt的数量，或对几个cpt之间的参数依赖性。在几个基准测试上的实验表明，我们的参数合成技术可以处理现有技术无法达到的贝叶斯网络(具有数百个未知参数)的参数合成。

{"title":"Automatically Finding the Right Probabilities in Bayesian Networks","authors":"Bahar Salmani, J. Katoen","doi":"10.1613/jair.1.14044","DOIUrl":"https://doi.org/10.1613/jair.1.14044","url":null,"abstract":"This paper presents alternative techniques for inference on classical Bayesian networks in which all probabilities are fixed, and for synthesis problems when conditional probability tables (CPTs) in such networks contain symbolic parameters rather than concrete probabilities. The key idea is to exploit probabilistic model checking as well as its recent extension to parameter synthesis techniques for parametric Markov chains. To enable this, the Bayesian networks are transformed into Markov chains and their objectives are mapped onto probabilistic temporal logic formulas. \u0000For exact inference, we compare probabilistic model checking to weighted model counting on various Bayesian network benchmarks. We contrast symbolic model checking using multi-terminal binary (aka: algebraic) decision diagrams to symbolic inference using proba- bilistic sentential decision diagrams, symbolic data structures that are tailored to Bayesian networks. \u0000For the parametric setting, we describe how our techniques can be used for various synthesis problems such as computing sensitivity functions (and values), simple and difference parameter tuning and ratio parameter tuning. Our parameter synthesis techniques are applicable to arbitrarily many, possibly dependent, parameters that may occur in multiple CPTs. This lifts restrictions, e.g., on the number of parametrized CPTs, or on parameter dependencies between several CPTs, that exist in the literature. Experiments on several benchmarks show that our parameter synthesis techniques can treat parameter synthesis for Bayesian networks (with hundreds of unknown parameters) that are out of reach for existing techniques.","PeriodicalId":54877,"journal":{"name":"Journal of Artificial Intelligence Research","volume":"77 1","pages":"1637-1696"},"PeriodicalIF":5.0,"publicationDate":"2023-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"67392862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Classes of Hard Formulas for QBF Resolution 求解QBF的硬公式类

3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence Research

Pub Date : 2023-08-14 DOI: 10.1613/jair.1.14710

Agnes Schleitzer, Olaf Beyersdorff

To date, we know only a few handcrafted quantified Boolean formulas (QBFs) that are hard for central QBF resolution systems such as Q-Res and QU-Res, and only one specific QBF family to separate Q-Res and QU-Res. Here we provide a general method to construct hard formulas for Q-Res and QU-Res. The construction uses simple propositional formulas (e.g. minimally unsatisfiable formulas) in combination with easy QBF gadgets (Σb2 formulas without constant winning strategies). This leads to a host of new hard formulas, including new classes of hard random QBFs. We further present generic constructions for formulas separating Q-Res and QU-Res, and for separating Q-Res and LD-Q-Res.

到目前为止，我们只知道一些手工制作的量化布尔公式(QBF)，这些公式很难用于Q-Res和Q-Res等中央QBF分解系统，并且只有一个特定的QBF家族来分离Q-Res和Q-Res。本文给出了构造Q-Res和Q-Res硬公式的一般方法。结构使用简单的命题公式(例如，最小不满意公式)与简单的QBF小工具(Σb2公式没有恒定的获胜策略)相结合。这导致了一系列新的硬公式，包括硬随机qbf的新类别。我们进一步给出了Q-Res和Q-Res以及Q-Res和LD-Q-Res分离公式的一般结构。

引用次数: 0