首页 > 最新文献

Artificial Intelligence最新文献

英文 中文
Temporal segmentation in multi agent path finding with applications to explainability 多代理路径查找中的时间分割及其在可解释性中的应用
IF 14.4 2区 计算机科学 Q1 Arts and Humanities Pub Date : 2024-02-07 DOI: 10.1016/j.artint.2024.104087
Shaull Almagor , Justin Kottinger , Morteza Lahijanian

Multi-Agent Path Finding (MAPF) is the problem of planning paths for agents to reach their targets from their start locations, such that the agents do not collide while executing the plan. In many settings, the plan (or a digest thereof) is conveyed to a supervising entity, e.g., for confirmation before execution, for a report, etc. In such cases, we wish to convey that the plan is collision-free with minimal amount of information. To this end, we propose an explanation scheme for MAPF. The scheme decomposes a plan into segments such that within each segment, the agents' paths are disjoint. We can then convey the plan whilst convincing that it is collision-free, using a small number of frames (dubbed an explanation). We can also measure the simplicity of a plan by the number of segments required for the decomposition. We study the complexity of algorithmic problems that arise by the explanation scheme and the tradeoff between the length (makespan) of a plan and its minimal decomposition. We also introduce two centralized (i.e. runs on a single CPU with full knowledge of the multi-agent system) algorithms for planning with explanations. One is based on a coupled search algorithm similar to A, and the other is a decoupled method based on Conflict-Based Search (CBS). We refer to the latter as Explanation-Guided CBS (XG-CBS), which uses a low-level search for individual agents and maintains a high-level conflict tree to guide the low-level search to avoid collisions as well as increasing the number of segments. We propose four approaches to the low-level search of XG-CBS by modifying A for explanations and analyze their effects on the completeness of XG-CBS. Finally, we highlight important aspects of the proposed explanation scheme in various MAPF problems and empirically evaluate the performance of the proposed planning algorithms in a series of benchmark problems.

多代理路径查找(MAPF)是为代理规划从其起始位置到达目标的路径,从而使代理在执行计划时不会发生碰撞的问题。在许多情况下,计划(或其摘要)会被传达给一个监督实体,例如,在执行前进行确认、提交报告等。在这种情况下,我们希望以最少的信息量传达计划是无碰撞的。为此,我们提出了一种 MAPF 解释方案。该方案将计划分解成若干段,在每一段中,代理的路径都是不相交的。这样,我们就能用少量的帧来传达计划,同时让人相信它是无碰撞的(称为解释)。我们还可以通过分解所需的分段数量来衡量计划的简单程度。我们研究了解释方案带来的算法问题的复杂性,以及计划长度(makespan)和最小分解之间的权衡。我们还介绍了两种集中式(即在一个中央处理器上运行,且完全了解多代理系统)算法,用于进行带解释的规划。一种是基于类似于 A⁎ 的耦合搜索算法,另一种是基于冲突搜索(CBS)的解耦方法。我们将后者称为 "解释引导的 CBS(XG-CBS)",它对单个代理使用低层搜索,并维护高层冲突树来引导低层搜索,以避免碰撞并增加片段数量。我们提出了四种通过修改 A⁎ 来解释 XG-CBS 低层搜索的方法,并分析了它们对 XG-CBS 完整性的影响。最后,我们强调了在各种 MAPF 问题中建议的解释方案的重要方面,并在一系列基准问题中对建议的规划算法的性能进行了实证评估。
{"title":"Temporal segmentation in multi agent path finding with applications to explainability","authors":"Shaull Almagor ,&nbsp;Justin Kottinger ,&nbsp;Morteza Lahijanian","doi":"10.1016/j.artint.2024.104087","DOIUrl":"10.1016/j.artint.2024.104087","url":null,"abstract":"<div><p>Multi-Agent Path Finding (MAPF) is the problem of planning paths for agents to reach their targets from their start locations, such that the agents do not collide while executing the plan. In many settings, the plan (or a digest thereof) is conveyed to a supervising entity, e.g., for confirmation before execution, for a report, etc. In such cases, we wish to convey that the plan is collision-free with minimal amount of information. To this end, we propose an <em>explanation scheme</em> for MAPF. The scheme decomposes a plan into segments such that within each segment, the agents' paths are disjoint. We can then convey the plan whilst convincing that it is collision-free, using a small number of frames (dubbed an <em>explanation</em>). We can also measure the simplicity of a plan by the number of segments required for the decomposition. We study the complexity of algorithmic problems that arise by the explanation scheme and the tradeoff between the length (makespan) of a plan and its minimal decomposition. We also introduce two centralized (i.e. runs on a single CPU with full knowledge of the multi-agent system) algorithms for planning with explanations. One is based on a coupled search algorithm similar to A<sup>⁎</sup>, and the other is a decoupled method based on Conflict-Based Search (CBS). We refer to the latter as <em>Explanation-Guided CBS</em> (XG-CBS), which uses a low-level search for individual agents and maintains a high-level conflict tree to guide the low-level search to avoid collisions as well as increasing the number of segments. We propose four approaches to the low-level search of XG-CBS by modifying A<sup>⁎</sup> for explanations and analyze their effects on the completeness of XG-CBS. Finally, we highlight important aspects of the proposed explanation scheme in various MAPF problems and empirically evaluate the performance of the proposed planning algorithms in a series of benchmark problems.</p></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":14.4,"publicationDate":"2024-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139832974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An extended view on lifting Gaussian Bayesian networks 关于提升高斯贝叶斯网络的扩展观点
IF 14.4 2区 计算机科学 Q1 Arts and Humanities Pub Date : 2024-02-06 DOI: 10.1016/j.artint.2024.104082
Mattis Hartwig , Ralf Möller , Tanya Braun

Lifting probabilistic graphical models and developing lifted inference algorithms aim to use higher level groups of random variables instead of individual instances. In the past, many inference algorithms for discrete probabilistic graphical models have been lifted. Lifting continuous probabilistic graphical models has played a minor role. Since many real-world applications involve continuous random variables, this article turns its focus to lifting approaches for Gaussian Bayesian networks. Specifically, we present algorithms for constructing a lifted joint distribution for scenarios of sequences of overlapping and non-overlapping logical variables. We present operations that work in a fully lifted way including addition, multiplication, and inversion. We present how the operations can be used for lifted query answering algorithms and extend the existing query answering algorithm by a new way of evidence handling. The new way of evidence handling groups evidence that has the same effect on its neighboring variables in cases of partial overlap between the logical-variable sequences. In the theoretical complexity analysis and the experimental evaluation, we show under which conditions the existing lifted approach and the new lifted approach including evidence grouping lead to the most time savings compared to the grounded approach.

提升概率图形模型和开发提升推理算法的目的是使用更高层次的随机变量组,而不是单个实例。过去,许多离散概率图形模型的推理算法都是通过提升来实现的。连续概率图形模型的推理算法只发挥了次要作用。由于现实世界的许多应用涉及连续随机变量,本文将重点转向高斯贝叶斯网络的提升方法。具体来说,我们提出了为重叠和非重叠逻辑变量序列场景构建提升联合分布的算法。我们介绍了以完全提升方式进行的操作,包括加法、乘法和反转。我们介绍了如何将这些运算用于提升查询回答算法,并通过一种新的证据处理方法扩展了现有的查询回答算法。在逻辑变量序列之间部分重叠的情况下,新的证据处理方法将对其相邻变量具有相同影响的证据进行分组。在理论复杂性分析和实验评估中,我们展示了在哪些条件下,现有的提升方法和包含证据分组的新提升方法比基础方法节省了最多的时间。
{"title":"An extended view on lifting Gaussian Bayesian networks","authors":"Mattis Hartwig ,&nbsp;Ralf Möller ,&nbsp;Tanya Braun","doi":"10.1016/j.artint.2024.104082","DOIUrl":"https://doi.org/10.1016/j.artint.2024.104082","url":null,"abstract":"<div><p>Lifting probabilistic graphical models and developing lifted inference algorithms aim to use higher level groups of random variables instead of individual instances. In the past, many inference algorithms for discrete probabilistic graphical models have been lifted. Lifting continuous probabilistic graphical models has played a minor role. Since many real-world applications involve continuous random variables, this article turns its focus to lifting approaches for Gaussian Bayesian networks. Specifically, we present algorithms for constructing a lifted joint distribution for scenarios of sequences of overlapping and non-overlapping logical variables. We present operations that work in a fully lifted way including addition, multiplication, and inversion. We present how the operations can be used for lifted query answering algorithms and extend the existing query answering algorithm by a new way of evidence handling. The new way of evidence handling groups evidence that has the same effect on its neighboring variables in cases of partial overlap between the logical-variable sequences. In the theoretical complexity analysis and the experimental evaluation, we show under which conditions the existing lifted approach and the new lifted approach including evidence grouping lead to the most time savings compared to the grounded approach.</p></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":14.4,"publicationDate":"2024-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139907571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pre-training and diagnosing knowledge base completion models 预训练和诊断知识库完成模型
IF 14.4 2区 计算机科学 Q1 Arts and Humanities Pub Date : 2024-02-02 DOI: 10.1016/j.artint.2024.104081
Vid Kocijan , Myeongjun Jang , Thomas Lukasiewicz

In this work, we introduce and analyze an approach to knowledge transfer from one collection of facts to another without the need for entity or relation matching. The method works for both canonicalized knowledge bases and uncanonicalized or open knowledge bases, i.e., knowledge bases where more than one copy of a real-world entity or relation may exist. The main contribution is a method that can make use of large-scale pre-training on facts, which were collected from unstructured text, to improve predictions on structured data from a specific domain. The introduced method is most impactful on small datasets such as ReVerb20k, where a 6% absolute increase of mean reciprocal rank and 65% relative decrease of mean rank over the previously best method was achieved, despite not relying on large pre-trained models like Bert. To understand the obtained pre-trained models better, we then introduce a novel dataset for the analysis of pre-trained models for Open Knowledge Base Completion, called Doge (Diagnostics of Open knowledge Graph Embeddings). It consists of 6 subsets and is designed to measure multiple properties of a pre-trained model: robustness against synonyms, ability to perform deductive reasoning, presence of gender stereotypes, consistency with reverse relations, and coverage of different areas of general knowledge. Using the introduced dataset, we show that the existing OKBC models lack consistency in presence of synonyms and inverse relations and are unable to perform deductive reasoning. Moreover, their predictions often align with gender stereotypes, which persist even when presented with counterevidence. We additionally investigate the role of pre-trained word embeddings and demonstrate that avoiding biased word embeddings is not a sufficient measure to prevent biased behavior of OKBC models.

在这项工作中,我们介绍并分析了一种无需实体或关系匹配即可将知识从一个事实集合转移到另一个事实集合的方法。该方法既适用于规范化知识库,也适用于非规范化或开放式知识库,即现实世界中可能存在不止一个实体或关系副本的知识库。该方法的主要贡献在于,它可以利用从非结构化文本中收集的事实进行大规模预训练,从而改进对特定领域结构化数据的预测。引入的方法对 ReVerb20k 等小型数据集的影响最大,与之前的最佳方法相比,尽管不依赖像 Bert 这样的大型预训练模型,但平均倒数等级的绝对值提高了 6%,平均等级的相对值降低了 65%。为了更好地理解所获得的预训练模型,我们随后引入了一个用于分析开放知识库完成的预训练模型的新数据集,名为 Doge(开放知识图嵌入诊断)。该数据集由 6 个子集组成,旨在测量预训练模型的多个属性:对同义词的鲁棒性、执行演绎推理的能力、性别刻板印象的存在、与反向关系的一致性以及对不同常识领域的覆盖。利用引入的数据集,我们发现现有的 OKBC 模型在同义词和反向关系方面缺乏一致性,并且无法进行演绎推理。此外,它们的预测往往与性别刻板印象相一致,即使有反证,这种刻板印象也会持续存在。我们还研究了预训练词嵌入的作用,并证明避免有偏见的词嵌入并不足以防止 OKBC 模型的偏见行为。
{"title":"Pre-training and diagnosing knowledge base completion models","authors":"Vid Kocijan ,&nbsp;Myeongjun Jang ,&nbsp;Thomas Lukasiewicz","doi":"10.1016/j.artint.2024.104081","DOIUrl":"10.1016/j.artint.2024.104081","url":null,"abstract":"<div><p>In this work, we introduce and analyze an approach to knowledge transfer from one collection of facts to another without the need for entity or relation matching. The method works for both <em>canonicalized</em> knowledge bases and <em>uncanonicalized</em> or <em>open knowledge bases</em>, i.e., knowledge bases where more than one copy of a real-world entity or relation may exist. The main contribution is a method that can make use of large-scale pre-training on facts, which were collected from unstructured text, to improve predictions on structured data from a specific domain. The introduced method is most impactful on small datasets such as <span>ReVerb20k</span>, where a 6% absolute increase of mean reciprocal rank and 65% relative decrease of mean rank over the previously best method was achieved, despite not relying on large pre-trained models like <span>Bert</span>. To understand the obtained pre-trained models better, we then introduce a novel dataset for the analysis of pre-trained models for Open Knowledge Base Completion, called <span>Doge</span> (Diagnostics of Open knowledge Graph Embeddings). It consists of 6 subsets and is designed to measure multiple properties of a pre-trained model: robustness against synonyms, ability to perform deductive reasoning, presence of gender stereotypes, consistency with reverse relations, and coverage of different areas of general knowledge. Using the introduced dataset, we show that the existing OKBC models lack consistency in presence of synonyms and inverse relations and are unable to perform deductive reasoning. Moreover, their predictions often align with gender stereotypes, which persist even when presented with counterevidence. We additionally investigate the role of pre-trained word embeddings and demonstrate that avoiding biased word embeddings is not a sufficient measure to prevent biased behavior of OKBC models.</p></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":14.4,"publicationDate":"2024-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139688241","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Revision operators with compact representations 具有紧凑表示的修正算子
IF 14.4 2区 计算机科学 Q1 Arts and Humanities Pub Date : 2024-02-02 DOI: 10.1016/j.artint.2024.104080
Pavlos Peppas , Mary-Anne Williams , Grigoris Antoniou

Despite the great theoretical advancements in the area of Belief Revision, there has been limited success in terms of implementations. One of the hurdles in implementing revision operators is that their specification (let alone their computation), requires substantial resources. On the other hand, implementing a specific revision operator, like Dalal's operator, would be of limited use. In this paper we generalise Dalal's construction, defining a whole family of concrete revision operators, called Parametrised Difference revision operators or PD operators for short. This family is wide enough to cover a wide range of different applications, and at the same time it is easy to represent. In addition to its semantic definition, we characterise the family of PD operators axiomatically (including a characterisation specifically for Dalal's operator), we prove its' compliance with Parikh's relevance-sensitive postulate (P), we study its computational complexity, and discuss its benefits for belief revision implementations.

尽管 "信念修正 "领域在理论上取得了巨大进步,但在实现方面取得的成功却很有限。实现修正算子的障碍之一是其规范(更不用说计算)需要大量资源。另一方面,实现一个特定的修正算子(如 Dalal 的算子)的作用也很有限。在本文中,我们概括了 Dalal 的构造,定义了一整套具体的修正算子,称为参数化差分修正算子或简称 PD 算子。这个系列的范围很广,足以涵盖各种不同的应用,同时也很容易表示。除了语义定义之外,我们还从公理上描述了 PD 运算符族的特征(包括专门针对 Dalal 运算符的特征描述),证明它符合 Parikh 的相关性敏感公设 (P),研究了它的计算复杂性,并讨论了它对信念修正实现的益处。
{"title":"Revision operators with compact representations","authors":"Pavlos Peppas ,&nbsp;Mary-Anne Williams ,&nbsp;Grigoris Antoniou","doi":"10.1016/j.artint.2024.104080","DOIUrl":"10.1016/j.artint.2024.104080","url":null,"abstract":"<div><p>Despite the great theoretical advancements in the area of Belief Revision, there has been limited success in terms of implementations. One of the hurdles in implementing revision operators is that their specification (let alone their computation), requires substantial resources. On the other hand, implementing a specific revision operator, like Dalal's operator, would be of limited use. In this paper we generalise Dalal's construction, defining a whole family of concrete revision operators, called <em>Parametrised Difference revision operators</em> or <em>PD operators</em> for short. This family is wide enough to cover a wide range of different applications, and at the same time it is easy to represent. In addition to its semantic definition, we characterise the family of PD operators axiomatically (including a characterisation specifically for Dalal's operator), we prove its' compliance with Parikh's relevance-sensitive postulate (P), we study its computational complexity, and discuss its benefits for belief revision implementations.</p></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":14.4,"publicationDate":"2024-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139661130","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A stochastic process approach for multi-agent path finding with non-asymptotic performance guarantees 具有非渐近性能保证的多代理路径搜索随机过程方法
IF 14.4 2区 计算机科学 Q1 Arts and Humanities Pub Date : 2024-02-01 DOI: 10.1016/j.artint.2024.104084
Xiaoyu He , Xueyan Tang , Wentong Cai , Jingning Li

Multi-agent path finding (MAPF) is a classical NP-hard problem that considers planning collision-free paths for multiple agents simultaneously. A MAPF problem is typically solved via addressing a sequence of single-agent path finding subproblems in which well-studied algorithms such as A are applicable. Existing methods based on this idea, however, rely on an exhaustive search and therefore only have asymptotic performance guarantees. In this article, we provide a modeling paradigm that converts a MAPF problem into a stochastic process and adopts a confidence bound based rule for finding the optimal state transition strategy. A randomized algorithm is proposed to solve this stochastic process, which combines ideas from conflict based search and Monte Carlo tree search. We show that the proposed method is almost surely optimal while enjoying non-asymptotic performance guarantees. In particular, the proposed method can, after solving N single-agent subproblems, produce a feasible solution with suboptimality bounded by O(1/N). The theoretical results are verified by several numerical experiments based on grid maps.

多代理路径查找(MAPF)是一个经典的 NP 难问题,它考虑同时为多个代理规划无碰撞路径。MAPF 问题通常是通过处理一系列单个代理路径查找子问题来解决的,在这些子问题中,A⁎ 等经过充分研究的算法是适用的。然而,基于这一思想的现有方法依赖于穷举搜索,因此只能保证渐进性能。在本文中,我们提供了一种建模范式,将 MAPF 问题转换为随机过程,并采用基于置信度约束的规则来寻找最优状态转换策略。我们提出了一种随机算法来解决这一随机过程,该算法结合了基于冲突的搜索和蒙特卡罗树搜索的思想。我们证明,所提出的方法几乎肯定是最优的,同时享有非渐近性能保证。特别是,在解决 N 个单代理子问题后,所提出的方法可以产生一个可行解,其次优化性的边界为 O(1/N)。基于网格图的几个数值实验验证了理论结果。
{"title":"A stochastic process approach for multi-agent path finding with non-asymptotic performance guarantees","authors":"Xiaoyu He ,&nbsp;Xueyan Tang ,&nbsp;Wentong Cai ,&nbsp;Jingning Li","doi":"10.1016/j.artint.2024.104084","DOIUrl":"10.1016/j.artint.2024.104084","url":null,"abstract":"<div><p>Multi-agent path finding (MAPF) is a classical NP-hard problem that considers planning collision-free paths for multiple agents simultaneously. A MAPF problem is typically solved via addressing a sequence of single-agent path finding subproblems in which well-studied algorithms such as <span><math><msup><mrow><mtext>A</mtext></mrow><mrow><mo>⁎</mo></mrow></msup></math></span> are applicable. Existing methods based on this idea, however, rely on an exhaustive search and therefore only have asymptotic performance guarantees. In this article, we provide a modeling paradigm that converts a MAPF problem into a stochastic process and adopts a confidence bound based rule for finding the optimal state transition strategy. A randomized algorithm is proposed to solve this stochastic process, which combines ideas from conflict based search and Monte Carlo tree search. We show that the proposed method is almost surely optimal while enjoying non-asymptotic performance guarantees. In particular, the proposed method can, after solving <em>N</em> single-agent subproblems, produce a feasible solution with suboptimality bounded by <span><math><mi>O</mi><mo>(</mo><mn>1</mn><mo>/</mo><msqrt><mrow><mi>N</mi></mrow></msqrt><mo>)</mo></math></span>. The theoretical results are verified by several numerical experiments based on grid maps.</p></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":14.4,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139656004","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Temporal inductive path neural network for temporal knowledge graph reasoning 用于时态知识图谱推理的时态归纳路径神经网络
IF 14.4 2区 计算机科学 Q1 Arts and Humanities Pub Date : 2024-02-01 DOI: 10.1016/j.artint.2024.104085
Hao Dong , Pengyang Wang , Meng Xiao , Zhiyuan Ning , Pengfei Wang , Yuanchun Zhou

Temporal Knowledge Graph (TKG) is an extension of traditional Knowledge Graph (KG) that incorporates the dimension of time. Reasoning on TKGs is a crucial task that aims to predict future facts based on historical occurrences. The key challenge lies in uncovering structural dependencies within historical subgraphs and temporal patterns. Most existing approaches model TKGs relying on entity modeling, as nodes in the graph play a crucial role in knowledge representation. However, the real-world scenario often involves an extensive number of entities, with new entities emerging over time. This makes it challenging for entity-dependent methods to cope with extensive volumes of entities, and effectively handling newly emerging entities also becomes a significant challenge. Therefore, we propose Temporal Inductive Path Neural Network (TiPNN), which models historical information in an entity-independent perspective. Specifically, TiPNN adopts a unified graph, namely history temporal graph, to comprehensively capture and encapsulate information from history. Subsequently, we utilize the defined query-aware temporal paths on a history temporal graph to model historical path information related to queries for reasoning. Extensive experiments illustrate that the proposed model not only attains significant performance enhancements but also handles inductive settings, while additionally facilitating the provision of reasoning evidence through history temporal graphs.

时态知识图谱(TKG)是传统知识图谱(KG)的扩展,其中包含了时间维度。在 TKG 上进行推理是一项重要任务,旨在根据历史事件预测未来事实。关键的挑战在于发现历史子图中的结构依赖性和时间模式。由于图中的节点在知识表示中起着至关重要的作用,因此大多数现有方法都依赖实体建模对 TKG 进行建模。然而,现实世界的场景往往涉及大量实体,随着时间的推移还会出现新的实体。这使得依赖实体的方法在处理大量实体时面临挑战,而有效处理新出现的实体也成为一项重大挑战。因此,我们提出了时态归纳路径神经网络(Temporal Inductive Path Neural Network,TiPNN),它从与实体无关的角度对历史信息进行建模。具体来说,TiPNN 采用统一的图,即历史时序图,来全面捕捉和封装历史信息。随后,我们利用历史时态图上定义的查询感知时态路径来模拟与查询相关的历史路径信息,从而进行推理。广泛的实验表明,所提出的模型不仅能显著提高性能,还能处理归纳设置,同时还能通过历史时序图提供推理证据。
{"title":"Temporal inductive path neural network for temporal knowledge graph reasoning","authors":"Hao Dong ,&nbsp;Pengyang Wang ,&nbsp;Meng Xiao ,&nbsp;Zhiyuan Ning ,&nbsp;Pengfei Wang ,&nbsp;Yuanchun Zhou","doi":"10.1016/j.artint.2024.104085","DOIUrl":"10.1016/j.artint.2024.104085","url":null,"abstract":"<div><p>Temporal Knowledge Graph (TKG) is an extension of traditional Knowledge Graph (KG) that incorporates the dimension of time. Reasoning on TKGs is a crucial task that aims to predict future facts based on historical occurrences. The key challenge lies in uncovering structural dependencies within historical subgraphs and temporal patterns. Most existing approaches model TKGs relying on entity modeling, as nodes in the graph play a crucial role in knowledge representation. However, the real-world scenario often involves an extensive number of entities, with new entities emerging over time. This makes it challenging for entity-dependent methods to cope with extensive volumes of entities, and effectively handling newly emerging entities also becomes a significant challenge. Therefore, we propose <strong>T</strong>emporal <strong>I</strong>nductive <strong>P</strong>ath <strong>N</strong>eural <strong>N</strong>etwork (TiPNN), which models historical information in an entity-independent perspective. Specifically, TiPNN adopts a unified graph, namely history temporal graph, to comprehensively capture and encapsulate information from history. Subsequently, we utilize the defined query-aware temporal paths on a history temporal graph to model historical path information related to queries for reasoning. Extensive experiments illustrate that the proposed model not only attains significant performance enhancements but also handles inductive settings, while additionally facilitating the provision of reasoning evidence through history temporal graphs.</p></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":14.4,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139656037","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient optimal Kolmogorov approximation of random variables 随机变量的高效最优柯尔莫哥洛夫逼近法
IF 14.4 2区 计算机科学 Q1 Arts and Humanities Pub Date : 2024-02-01 DOI: 10.1016/j.artint.2024.104086
Liat Cohen , Tal Grinshpoun , Gera Weiss

Discrete random variables are essential ingredients in various artificial intelligence problems. These include the estimation of the probability of missing the deadline in a series-parallel schedule and the assignment of suppliers to tasks in a project in a manner that maximizes the probability of meeting the overall project deadline. The solving of such problems involves repetitive operations, such as summation, over random variables. However, these computations are NP-hard. Therefore, we explore techniques and methods for approximating random variables with a given support size and minimal Kolmogorov distance. We examine both the general problem of approximating a random variable and a one-sided version in which over-approximation is allowed but not under-approximation. We propose several algorithms and evaluate their performance through computational complexity analysis and empirical evaluation. All the presented algorithms are optimal in the sense that given an input random variable and a requested support size, they return a new approximated random variable with the requested support size and minimal Kolmogorov distance from the input random variable. Our approximation algorithms offer useful estimations of probabilities in situations where exact computations are not feasible due to NP-hardness complexity.

离散随机变量是各种人工智能问题的基本要素。这些问题包括在一系列并行计划中估算错过截止日期的概率,以及以最大化遵守整个项目截止日期的概率的方式为项目中的任务分配供应商。解决这些问题涉及随机变量的重复运算,如求和。然而,这些计算是 NP 难的。因此,我们探索了以给定的支持大小和最小的科尔莫哥洛夫距离逼近随机变量的技术和方法。我们既研究了近似随机变量的一般问题,也研究了允许过近似但不允许欠近似的单边问题。我们提出了几种算法,并通过计算复杂度分析和经验评估来评价它们的性能。从给定输入随机变量和要求的支持大小的意义上讲,所有提出的算法都是最优的,它们返回的新近似随机变量具有要求的支持大小和与输入随机变量的最小柯尔莫哥洛夫距离。我们的近似算法可以在由于 NP 难度复杂性而无法进行精确计算的情况下,提供有用的概率估计。
{"title":"Efficient optimal Kolmogorov approximation of random variables","authors":"Liat Cohen ,&nbsp;Tal Grinshpoun ,&nbsp;Gera Weiss","doi":"10.1016/j.artint.2024.104086","DOIUrl":"10.1016/j.artint.2024.104086","url":null,"abstract":"<div><p><span>Discrete random variables<span> are essential ingredients in various artificial intelligence problems. These include the estimation of the probability of missing the deadline in a series-parallel schedule and the assignment of suppliers to tasks in a project in a manner that maximizes the probability of meeting the overall project deadline. The solving of such problems involves repetitive operations, such as summation, over random variables. However, these computations are NP-hard. Therefore, we explore techniques and methods for approximating random variables with a given support size and minimal Kolmogorov distance. We examine both the general problem of approximating a random variable and a one-sided version in which over-approximation is allowed but not under-approximation. We propose several algorithms and evaluate their performance through </span></span>computational complexity<span> analysis and empirical evaluation. All the presented algorithms are optimal in the sense that given an input random variable and a requested support size, they return a new approximated random variable with the requested support size and minimal Kolmogorov distance from the input random variable. Our approximation algorithms offer useful estimations of probabilities in situations where exact computations are not feasible due to NP-hardness complexity.</span></p></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":14.4,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139656047","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Decentralized Fused-Learner Architectures for Bayesian Reinforcement Learning 贝叶斯强化学习的分散融合学习器架构
IF 14.4 2区 计算机科学 Q1 Arts and Humanities Pub Date : 2024-02-01 DOI: 10.1016/j.artint.2024.104094
Augustin A. Saucan, Subhro Das, M. Win
{"title":"Decentralized Fused-Learner Architectures for Bayesian Reinforcement Learning","authors":"Augustin A. Saucan, Subhro Das, M. Win","doi":"10.1016/j.artint.2024.104094","DOIUrl":"https://doi.org/10.1016/j.artint.2024.104094","url":null,"abstract":"","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":14.4,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139829878","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Temporal Segmentation in Multi Agent Path Finding with Applications to Explainability 多代理路径查找中的时间分割及其在可解释性中的应用
IF 14.4 2区 计算机科学 Q1 Arts and Humanities Pub Date : 2024-02-01 DOI: 10.1016/j.artint.2024.104087
Shaull Almagor, Justin Kottinger, Morteza Lahijanian
{"title":"Temporal Segmentation in Multi Agent Path Finding with Applications to Explainability","authors":"Shaull Almagor, Justin Kottinger, Morteza Lahijanian","doi":"10.1016/j.artint.2024.104087","DOIUrl":"https://doi.org/10.1016/j.artint.2024.104087","url":null,"abstract":"","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":14.4,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139892517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Transferable dynamics models for efficient object-oriented reinforcement learning 高效面向对象强化学习的可转移动力学模型
IF 14.4 2区 计算机科学 Q1 Arts and Humanities Pub Date : 2024-01-26 DOI: 10.1016/j.artint.2024.104079
Ofir Marom, Benjamin Rosman

The Reinforcement Learning (RL) framework offers a general paradigm for constructing autonomous agents that can make effective decisions when solving tasks. An important area of study within the field of RL is transfer learning, where an agent utilizes knowledge gained from solving previous tasks to solve a new task more efficiently. While the notion of transfer learning is conceptually appealing, in practice, not all RL representations are amenable to transfer learning. Moreover, much of the research on transfer learning in RL is purely empirical. Previous research has shown that object-oriented representations are suitable for the purposes of transfer learning with theoretical efficiency guarantees. Such representations leverage the notion of object classes to learn lifted rules that apply to grounded object instantiations. In this paper, we extend previous research on object-oriented representations and introduce two formalisms: the first is based on deictic predicates, and is used to learn a transferable transition dynamics model; the second is based on propositions, and is used to learn a transferable reward dynamics model. In addition, we extend previously introduced efficient learning algorithms for object-oriented representations to our proposed formalisms. Our frameworks are then combined into a single efficient algorithm that learns transferable transition and reward dynamics models across a domain of related tasks. We illustrate our proposed algorithm empirically on an extended version of the Taxi domain, as well as the more difficult Sokoban domain, showing the benefits of our approach with regards to efficient learning and transfer.

强化学习(RL)框架为构建能在解决任务时做出有效决策的自主代理提供了一种通用范式。强化学习领域的一个重要研究领域是迁移学习,即代理利用从解决以前任务中获得的知识,更高效地解决新任务。虽然迁移学习的概念很吸引人,但在实践中,并非所有的 RL 表征都适合迁移学习。此外,有关 RL 中迁移学习的研究大多纯属经验之谈。以往的研究表明,面向对象的表示法适合迁移学习的目的,并有理论上的效率保证。这类表征利用对象类的概念来学习适用于基础对象实例的提升规则。在本文中,我们扩展了之前关于面向对象表征的研究,并引入了两种形式主义:第一种形式主义基于谓词,用于学习可迁移的过渡动力学模型;第二种形式主义基于命题,用于学习可迁移的奖励动力学模型。此外,我们还将之前介绍的面向对象表征的高效学习算法扩展到了我们提出的形式主义中。然后,我们将这些框架组合成一个单一的高效算法,在相关任务领域学习可迁移的过渡和奖励动态模型。我们在扩展版的出租车领域以及难度更大的推箱子领域对我们提出的算法进行了实证说明,显示了我们的方法在高效学习和迁移方面的优势。
{"title":"Transferable dynamics models for efficient object-oriented reinforcement learning","authors":"Ofir Marom,&nbsp;Benjamin Rosman","doi":"10.1016/j.artint.2024.104079","DOIUrl":"10.1016/j.artint.2024.104079","url":null,"abstract":"<div><p>The Reinforcement Learning (RL) framework offers a general paradigm for constructing autonomous agents that can make effective decisions when solving tasks. An important area of study within the field of RL is transfer learning, where an agent utilizes knowledge gained from solving previous tasks to solve a new task more efficiently. While the notion of transfer learning is conceptually appealing, in practice, not all RL representations are amenable to transfer learning. Moreover, much of the research on transfer learning in RL is purely empirical. Previous research has shown that object-oriented representations are suitable for the purposes of transfer learning with theoretical efficiency guarantees. Such representations leverage the notion of object classes to learn lifted rules that apply to grounded object instantiations. In this paper, we extend previous research on object-oriented representations and introduce two formalisms: the first is based on deictic predicates, and is used to learn a transferable transition dynamics model; the second is based on propositions, and is used to learn a transferable reward dynamics model. In addition, we extend previously introduced efficient learning algorithms for object-oriented representations to our proposed formalisms. Our frameworks are then combined into a single efficient algorithm that learns transferable transition and reward dynamics models across a domain of related tasks. We illustrate our proposed algorithm empirically on an extended version of the Taxi domain, as well as the more difficult Sokoban domain, showing the benefits of our approach with regards to efficient learning and transfer.</p></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":14.4,"publicationDate":"2024-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0004370224000158/pdfft?md5=f71414a73bce8f86455910c6a084672b&pid=1-s2.0-S0004370224000158-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139568042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Artificial Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1