首页 > 最新文献

Artificial Intelligence最新文献

英文 中文
Lifted action models learning from partial traces 从部分轨迹学习的提升行动模型
IF 5.1 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-15 DOI: 10.1016/j.artint.2024.104256
Leonardo Lamanna , Luciano Serafini , Alessandro Saetti , Alfonso Emilio Gerevini , Paolo Traverso
For applying symbolic planning, there is the necessity of providing the specification of a symbolic action model, which is usually manually specified by a domain expert. However, such an encoding may be faulty due to either human errors or lack of domain knowledge. Therefore, learning the symbolic action model in an automated way has been widely adopted as an alternative to its manual specification. In this paper, we focus on the problem of learning action models offline, from an input set of partially observable plan traces. In particular, we propose an approach to: (i) augment the observability of a given plan trace by applying predefined logical rules; (ii) learn the preconditions and effects of each action in a plan trace from partial observations before and after the action execution. We formally prove that our approach learns action models with fundamental theoretical properties, not provided by other methods. We experimentally show that our approach outperforms a state-of-the-art method on a large set of existing benchmark domains. Furthermore, we compare the effectiveness of the learned action models for solving planning problems and show that the action models learned by our approach are much more effective w.r.t. a state-of-the-art method.1
要应用符号规划,就必须提供符号行动模型的规范,而这种规范通常是由领域专家手动指定的。然而,由于人为错误或缺乏领域知识,这种编码可能会出现问题。因此,以自动方式学习符号动作模型已被广泛采用,以替代人工规范。在本文中,我们将重点关注从部分可观察计划跟踪的输入集离线学习动作模型的问题。特别是,我们提出了一种方法来(i) 通过应用预定义的逻辑规则来增强给定计划跟踪的可观察性;(ii) 从行动执行前后的部分观察结果中学习计划跟踪中每个行动的前提条件和效果。我们从形式上证明,我们的方法所学习的行动模型具有其他方法无法提供的基本理论属性。我们通过实验证明,在大量现有基准领域中,我们的方法优于最先进的方法。此外,我们还比较了学习到的行动模型在解决规划问题时的有效性,结果表明我们的方法学习到的行动模型比最先进的方法更有效1。
{"title":"Lifted action models learning from partial traces","authors":"Leonardo Lamanna ,&nbsp;Luciano Serafini ,&nbsp;Alessandro Saetti ,&nbsp;Alfonso Emilio Gerevini ,&nbsp;Paolo Traverso","doi":"10.1016/j.artint.2024.104256","DOIUrl":"10.1016/j.artint.2024.104256","url":null,"abstract":"<div><div>For applying symbolic planning, there is the necessity of providing the specification of a symbolic action model, which is usually manually specified by a domain expert. However, such an encoding may be faulty due to either human errors or lack of domain knowledge. Therefore, learning the symbolic action model in an automated way has been widely adopted as an alternative to its manual specification. In this paper, we focus on the problem of learning action models offline, from an input set of partially observable plan traces. In particular, we propose an approach to: <em>(i)</em> augment the observability of a given plan trace by applying predefined logical rules; <em>(ii)</em> learn the preconditions and effects of each action in a plan trace from partial observations before and after the action execution. We formally prove that our approach learns action models with fundamental theoretical properties, not provided by other methods. We experimentally show that our approach outperforms a state-of-the-art method on a large set of existing benchmark domains. Furthermore, we compare the effectiveness of the learned action models for solving planning problems and show that the action models learned by our approach are much more effective w.r.t. a state-of-the-art method.<span><span><sup>1</sup></span></span></div></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":"339 ","pages":"Article 104256"},"PeriodicalIF":5.1,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142643211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Human-AI coevolution 人类与人工智能的共同进化
IF 5.1 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-13 DOI: 10.1016/j.artint.2024.104244
Dino Pedreschi , Luca Pappalardo , Emanuele Ferragina , Ricardo Baeza-Yates , Albert-László Barabási , Frank Dignum , Virginia Dignum , Tina Eliassi-Rad , Fosca Giannotti , János Kertész , Alistair Knott , Yannis Ioannidis , Paul Lukowicz , Andrea Passarella , Alex Sandy Pentland , John Shawe-Taylor , Alessandro Vespignani
Human-AI coevolution, defined as a process in which humans and AI algorithms continuously influence each other, increasingly characterises our society, but is understudied in artificial intelligence and complexity science literature. Recommender systems and assistants play a prominent role in human-AI coevolution, as they permeate many facets of daily life and influence human choices through online platforms. The interaction between users and AI results in a potentially endless feedback loop, wherein users' choices generate data to train AI models, which, in turn, shape subsequent user preferences. This human-AI feedback loop has peculiar characteristics compared to traditional human-machine interaction and gives rise to complex and often “unintended” systemic outcomes. This paper introduces human-AI coevolution as the cornerstone for a new field of study at the intersection between AI and complexity science focused on the theoretical, empirical, and mathematical investigation of the human-AI feedback loop. In doing so, we: (i) outline the pros and cons of existing methodologies and highlight shortcomings and potential ways for capturing feedback loop mechanisms; (ii) propose a reflection at the intersection between complexity science, AI and society; (iii) provide real-world examples for different human-AI ecosystems; and (iv) illustrate challenges to the creation of such a field of study, conceptualising them at increasing levels of abstraction, i.e., scientific, legal and socio-political.
人类-人工智能共同进化被定义为人类与人工智能算法不断相互影响的过程,它日益成为我们社会的特征,但在人工智能和复杂性科学文献中却未得到充分研究。推荐系统和助手在人类与人工智能的共同进化中扮演着重要角色,因为它们渗透到日常生活的许多方面,并通过在线平台影响人类的选择。用户与人工智能之间的互动可能会产生无穷无尽的反馈回路,用户的选择会产生数据来训练人工智能模型,而人工智能模型反过来又会塑造用户的后续偏好。与传统的人机交互相比,这种人类与人工智能的反馈循环具有独特的特点,会产生复杂且往往 "非预期 "的系统性结果。本文介绍了人类与人工智能的共同进化,并以此为基石,在人工智能与复杂性科学的交汇点上开辟了一个新的研究领域,专注于对人类与人工智能反馈回路的理论、实证和数学研究。为此,我们(i)概述现有方法论的利弊,并强调捕捉反馈回路机制的不足之处和潜在方法;(ii)提出对复杂性科学、人工智能和社会之间交叉点的反思;(iii)提供不同人类-人工智能生态系统的现实世界实例;以及(iv)说明创建这样一个研究领域所面临的挑战,并从科学、法律和社会政治等抽象层面对这些挑战进行概念化。
{"title":"Human-AI coevolution","authors":"Dino Pedreschi ,&nbsp;Luca Pappalardo ,&nbsp;Emanuele Ferragina ,&nbsp;Ricardo Baeza-Yates ,&nbsp;Albert-László Barabási ,&nbsp;Frank Dignum ,&nbsp;Virginia Dignum ,&nbsp;Tina Eliassi-Rad ,&nbsp;Fosca Giannotti ,&nbsp;János Kertész ,&nbsp;Alistair Knott ,&nbsp;Yannis Ioannidis ,&nbsp;Paul Lukowicz ,&nbsp;Andrea Passarella ,&nbsp;Alex Sandy Pentland ,&nbsp;John Shawe-Taylor ,&nbsp;Alessandro Vespignani","doi":"10.1016/j.artint.2024.104244","DOIUrl":"10.1016/j.artint.2024.104244","url":null,"abstract":"<div><div>Human-AI coevolution, defined as a process in which humans and AI algorithms continuously influence each other, increasingly characterises our society, but is understudied in artificial intelligence and complexity science literature. Recommender systems and assistants play a prominent role in human-AI coevolution, as they permeate many facets of daily life and influence human choices through online platforms. The interaction between users and AI results in a potentially endless feedback loop, wherein users' choices generate data to train AI models, which, in turn, shape subsequent user preferences. This human-AI feedback loop has peculiar characteristics compared to traditional human-machine interaction and gives rise to complex and often “unintended” systemic outcomes. This paper introduces human-AI coevolution as the cornerstone for a new field of study at the intersection between AI and complexity science focused on the theoretical, empirical, and mathematical investigation of the human-AI feedback loop. In doing so, we: <em>(i)</em> outline the pros and cons of existing methodologies and highlight shortcomings and potential ways for capturing feedback loop mechanisms; <em>(ii)</em> propose a reflection at the intersection between complexity science, AI and society; <em>(iii)</em> provide real-world examples for different human-AI ecosystems; and <em>(iv)</em> illustrate challenges to the creation of such a field of study, conceptualising them at increasing levels of abstraction, i.e., scientific, legal and socio-political.</div></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":"339 ","pages":"Article 104244"},"PeriodicalIF":5.1,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142643212","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Separate but equal: Equality in belief propagation for single-cycle graphs 分离但平等:单循环图中信念传播的平等性
IF 5.1 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-08 DOI: 10.1016/j.artint.2024.104243
Erel Cohen, Ben Rachmut, Omer Lev, Roie Zivan
Belief propagation is a widely used, incomplete optimization algorithm whose main theoretical properties hold only under the assumption that beliefs are not equal. Nevertheless, there is substantial evidence to suggest that equality between beliefs does occur. A published method to overcome belief equality, which is based on the use of unary function-nodes, is commonly assumed to resolve the problem.
In this study, we focus on min-sum, the version of belief propagation that is used to solve constraint optimization problems. We prove that for the case of a single-cycle graph, belief equality can only be avoided when the algorithm converges to the optimal solution. Under any other circumstances, the unary function method will not prevent equality, indicating that some of the existing results presented in the literature are in need of reassessment. We differentiate between belief equality, which refers to equal beliefs in a single message, and assignment equality, which prevents the coherent assignment of values to the variables, and we provide conditions for both.
信念传播是一种广泛使用的不完全优化算法,其主要理论特性只有在信念不相等的假设下才成立。然而,有大量证据表明,信念相等的情况确实存在。一种已公布的克服信念相等的方法是基于单值函数节点的使用,通常被认为可以解决这个问题。
{"title":"Separate but equal: Equality in belief propagation for single-cycle graphs","authors":"Erel Cohen,&nbsp;Ben Rachmut,&nbsp;Omer Lev,&nbsp;Roie Zivan","doi":"10.1016/j.artint.2024.104243","DOIUrl":"10.1016/j.artint.2024.104243","url":null,"abstract":"<div><div>Belief propagation is a widely used, incomplete optimization algorithm whose main theoretical properties hold only under the assumption that beliefs are not equal. Nevertheless, there is substantial evidence to suggest that equality between beliefs does occur. A published method to overcome belief equality, which is based on the use of unary function-nodes, is commonly assumed to resolve the problem.</div><div>In this study, we focus on min-sum, the version of belief propagation that is used to solve constraint optimization problems. We prove that for the case of a single-cycle graph, belief equality can only be avoided when the algorithm converges to the optimal solution. Under any other circumstances, the unary function method will <em>not</em> prevent equality, indicating that some of the existing results presented in the literature are in need of reassessment. We differentiate between belief equality, which refers to equal beliefs in a single message, and assignment equality, which prevents the coherent assignment of values to the variables, and we provide conditions for both.</div></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":"338 ","pages":"Article 104243"},"PeriodicalIF":5.1,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142643213","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generative models for grid-based and image-based pathfinding 基于网格和图像的寻路生成模型
IF 5.1 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-08 DOI: 10.1016/j.artint.2024.104238
Daniil Kirilenko , Anton Andreychuk , Aleksandr I. Panov , Konstantin Yakovlev
Pathfinding is a challenging problem which generally asks to find a sequence of valid moves for an agent provided with a representation of the environment, i.e. a map, in which it operates. In this work, we consider pathfinding on binary grids and on image representations of the digital elevation models. In the former case, the transition costs are known, while in latter scenario, they are not. A widespread method to solve the first problem is to utilize a search algorithm that systematically explores the search space to obtain a solution. Ideally, the search should also be complemented with an informative heuristic to focus on the goal and prune the unpromising regions of the search space, thus decreasing the number of search iterations. Unfortunately, the widespread heuristic functions for grid-based pathfinding, such as Manhattan distance or Chebyshev distance, do not take the obstacles into account and in obstacle-rich environments demonstrate inefficient performance. As for pathfinding with image inputs, the heuristic search cannot be applied straightforwardly as the transition costs, i.e. the costs of moving from one pixel to the other, are not known. To tackle both challenges, we suggest utilizing modern deep neural networks to infer the instance-dependent heuristic functions at the pre-processing step and further use them for pathfinding with standard heuristic search algorithms. The principal heuristic function that we suggest learning is the path probability, which indicates how likely the grid cell (pixel) is lying on the shortest path (for binary grids with known transition costs, we also suggest another variant of the heuristic function that can speed up the search). Learning is performed in a supervised fashion (while we have also explored the possibilities of end-to-end learning that includes a planner in the learning pipeline). At the test time, path probability is used as the secondary heuristic for the Focal Search, a specific heuristic search algorithm that provides the theoretical guarantees on the cost bound of the resultant solution. Empirically, we show that the suggested approach significantly outperforms state-of-the-art competitors in a variety of different tasks (including out-of-the distribution instances).
寻路是一个极具挑战性的问题,通常要求为一个代理找到一连串有效的移动,而代理所处的环境就是地图。在这项工作中,我们考虑在二进制网格和数字高程模型的图像表示上进行寻路。在前一种情况下,过渡成本是已知的,而在后一种情况下,过渡成本是未知的。解决第一个问题的普遍方法是利用一种搜索算法,系统地探索搜索空间以获得解决方案。理想情况下,搜索还应辅之以信息启发式,以聚焦目标并修剪搜索空间中不具前景的区域,从而减少搜索迭代次数。遗憾的是,用于网格寻路的常用启发式函数,如曼哈顿距离或切比雪夫距离,并没有将障碍物考虑在内,在障碍物密集的环境中表现出低效的性能。至于利用图像输入寻路,由于过渡成本(即从一个像素移动到另一个像素的成本)未知,启发式搜索无法直接应用。为了解决这两个难题,我们建议在预处理步骤中利用现代深度神经网络来推断与实例相关的启发式函数,并进一步将其用于标准启发式搜索算法的寻路。我们建议学习的主要启发式函数是路径概率,它表示网格单元(像素)位于最短路径上的可能性有多大(对于已知过渡成本的二进制网格,我们还建议使用另一种可加快搜索速度的启发式函数变体)。学习以有监督的方式进行(同时我们也探索了端到端学习的可能性,即在学习管道中加入规划器)。在测试时,路径概率被用作焦点搜索(Focal Search)的辅助启发式,这是一种特定的启发式搜索算法,可为最终解决方案的成本边界提供理论保证。经验表明,在各种不同的任务(包括分布外实例)中,所建议的方法明显优于最先进的竞争对手。
{"title":"Generative models for grid-based and image-based pathfinding","authors":"Daniil Kirilenko ,&nbsp;Anton Andreychuk ,&nbsp;Aleksandr I. Panov ,&nbsp;Konstantin Yakovlev","doi":"10.1016/j.artint.2024.104238","DOIUrl":"10.1016/j.artint.2024.104238","url":null,"abstract":"<div><div>Pathfinding is a challenging problem which generally asks to find a sequence of valid moves for an agent provided with a representation of the environment, i.e. a map, in which it operates. In this work, we consider pathfinding on binary grids and on image representations of the digital elevation models. In the former case, the transition costs are known, while in latter scenario, they are not. A widespread method to solve the first problem is to utilize a search algorithm that systematically explores the search space to obtain a solution. Ideally, the search should also be complemented with an informative heuristic to focus on the goal and prune the unpromising regions of the search space, thus decreasing the number of search iterations. Unfortunately, the widespread heuristic functions for grid-based pathfinding, such as Manhattan distance or Chebyshev distance, do not take the obstacles into account and in obstacle-rich environments demonstrate inefficient performance. As for pathfinding with image inputs, the heuristic search cannot be applied straightforwardly as the transition costs, i.e. the costs of moving from one pixel to the other, are not known. To tackle both challenges, we suggest utilizing modern deep neural networks to infer the instance-dependent heuristic functions at the pre-processing step and further use them for pathfinding with standard heuristic search algorithms. The principal heuristic function that we suggest learning is the path probability, which indicates how likely the grid cell (pixel) is lying on the shortest path (for binary grids with known transition costs, we also suggest another variant of the heuristic function that can speed up the search). Learning is performed in a supervised fashion (while we have also explored the possibilities of end-to-end learning that includes a planner in the learning pipeline). At the test time, path probability is used as the secondary heuristic for the Focal Search, a specific heuristic search algorithm that provides the theoretical guarantees on the cost bound of the resultant solution. Empirically, we show that the suggested approach significantly outperforms state-of-the-art competitors in a variety of different tasks (including out-of-the distribution instances).</div></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":"338 ","pages":"Article 104238"},"PeriodicalIF":5.1,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142643214","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Online learning in sequential Bayesian persuasion: Handling unknown priors 序列贝叶斯说服中的在线学习:处理未知先验
IF 5.1 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-06 DOI: 10.1016/j.artint.2024.104245
Martino Bernasconi, Matteo Castiglioni, Alberto Marchesi, Nicola Gatti, Francesco Trovò
We study a repeated information design problem faced by an informed sender who tries to influence the behavior of a self-interested receiver, through the provision of payoff-relevant information. We consider settings where the receiver repeatedly faces a sequential decision making (SDM) problem. At each round, the sender observes the realizations of random events in the SDM problem, which are only partially observable by the receiver. This begets the challenge of how to incrementally disclose such information to the receiver to persuade them to follow (desirable) action recommendations. We study the case in which the sender does not know random events probabilities, and, thus, they have to gradually learn them while persuading the receiver. We start by providing a non-trivial polytopal approximation of the set of the sender's persuasive information-revelation structures. This is crucial to design efficient learning algorithms. Next, we prove a negative result which also applies to the non-sequential case: no learning algorithm can be persuasive in high probability. Thus, we relax the persuasiveness requirement, studying algorithms that guarantee that the receiver's regret in following recommendations grows sub-linearly. In the full-feedback setting—where the sender observes the realizations of all the possible random events—, we provide an algorithm with O˜(T) regret for both the sender and the receiver. Instead, in the bandit-feedback setting—where the sender only observes the realizations of random events actually occurring in the SDM problem—, we design an algorithm that, given an α[1/2,1] as input, guarantees O˜(Tα) and O˜(Tmax{α,1α2}) regrets, for the sender and the receiver respectively. This result is complemented by a lower bound showing that such a regret trade-off is tight for α[1/2,2/3].
我们研究的是一个重复信息设计问题,该问题由一个知情的发送者面临,他试图通过提供与报酬相关的信息来影响一个自利的接收者的行为。我们考虑的是接收方重复面临连续决策(SDM)问题的情况。在每一轮中,发送方都会观察 SDM 问题中随机事件的实现情况,而接收方只能部分地观察到这些情况。这就带来了一个挑战:如何逐步向接收方披露这些信息,以说服他们遵循(理想的)行动建议。我们研究的是发送方不知道随机事件概率的情况,因此发送方必须在说服接收方的同时逐步了解这些概率。我们首先提供了发送方有说服力的信息披露结构集合的非难多顶近似值。这对于设计高效的学习算法至关重要。接下来,我们证明了一个同样适用于非序列情况的否定结果:任何学习算法都不可能高概率地具有说服力。因此,我们放宽了对说服力的要求,研究那些能保证接收者在遵循推荐时的遗憾呈亚线性增长的算法。在全反馈设置中--即发送者观察所有可能的随机事件的实现情况--我们提供了一种对发送者和接收者都有 O˜(T)遗憾的算法。相反,在匪徒反馈设置中,即发送方只观察 SDM 问题中实际发生的随机事件的实现情况,我们设计了一种算法,在输入α∈[1/2,1]的情况下,保证发送方和接收方分别有 O˜(Tα)和 O˜(Tmax{α,1-α2})遗憾。这一结果得到了一个下限的补充,表明这种遗憾权衡在 α∈[1/2,2/3] 时是紧密的。
{"title":"Online learning in sequential Bayesian persuasion: Handling unknown priors","authors":"Martino Bernasconi,&nbsp;Matteo Castiglioni,&nbsp;Alberto Marchesi,&nbsp;Nicola Gatti,&nbsp;Francesco Trovò","doi":"10.1016/j.artint.2024.104245","DOIUrl":"10.1016/j.artint.2024.104245","url":null,"abstract":"<div><div>We study a repeated <em>information design</em> problem faced by an informed <em>sender</em> who tries to influence the behavior of a self-interested <em>receiver</em>, through the provision of payoff-relevant information. We consider settings where the receiver repeatedly faces a <em>sequential decision making</em> (SDM) problem. At each round, the sender observes the realizations of random events in the SDM problem, which are only partially observable by the receiver. This begets the challenge of how to incrementally disclose such information to the receiver to <em>persuade</em> them to follow (desirable) action recommendations. We study the case in which the sender does <em>not</em> know random events probabilities, and, thus, they have to gradually learn them while persuading the receiver. We start by providing a non-trivial polytopal approximation of the set of the sender's persuasive information-revelation structures. This is crucial to design efficient learning algorithms. Next, we prove a negative result which also applies to the non-sequential case: <em>no learning algorithm can be persuasive in high probability</em>. Thus, we relax the persuasiveness requirement, studying algorithms that guarantee that the receiver's <em>regret</em> in following recommendations <em>grows sub-linearly</em>. In the <em>full-feedback</em> setting—where the sender observes the realizations of <em>all</em> the possible random events—, we provide an algorithm with <span><math><mover><mrow><mi>O</mi></mrow><mrow><mo>˜</mo></mrow></mover><mo>(</mo><msqrt><mrow><mi>T</mi></mrow></msqrt><mo>)</mo></math></span> regret for both the sender and the receiver. Instead, in the <em>bandit-feedback</em> setting—where the sender only observes the realizations of random events actually occurring in the SDM problem—, we design an algorithm that, given an <span><math><mi>α</mi><mo>∈</mo><mo>[</mo><mn>1</mn><mo>/</mo><mn>2</mn><mo>,</mo><mn>1</mn><mo>]</mo></math></span> as input, guarantees <span><math><mover><mrow><mi>O</mi></mrow><mrow><mo>˜</mo></mrow></mover><mo>(</mo><msup><mrow><mi>T</mi></mrow><mrow><mi>α</mi></mrow></msup><mo>)</mo></math></span> and <span><math><mover><mrow><mi>O</mi></mrow><mrow><mo>˜</mo></mrow></mover><mo>(</mo><msup><mrow><mi>T</mi></mrow><mrow><mi>max</mi><mo>⁡</mo><mo>{</mo><mi>α</mi><mo>,</mo><mn>1</mn><mo>−</mo><mfrac><mrow><mi>α</mi></mrow><mrow><mn>2</mn></mrow></mfrac><mo>}</mo></mrow></msup><mo>)</mo></math></span> regrets, for the sender and the receiver respectively. This result is complemented by a lower bound showing that such a regret trade-off is tight for <span><math><mi>α</mi><mo>∈</mo><mo>[</mo><mn>1</mn><mo>/</mo><mn>2</mn><mo>,</mo><mn>2</mn><mo>/</mo><mn>3</mn><mo>]</mo></math></span>.</div></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":"338 ","pages":"Article 104245"},"PeriodicalIF":5.1,"publicationDate":"2024-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142655204","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TeachText: CrossModal text-video retrieval through generalized distillation TeachText:通过概括提炼实现跨模态文本-视频检索
IF 5.1 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-10-30 DOI: 10.1016/j.artint.2024.104235
Ioana Croitoru , Simion-Vlad Bogolin , Marius Leordeanu , Hailin Jin , Andrew Zisserman , Yang Liu , Samuel Albanie
In recent years, considerable progress on the task of text-video retrieval has been achieved by leveraging large-scale pretraining on visual and audio datasets to construct powerful video encoders. By contrast, despite the natural symmetry, the design of effective algorithms for exploiting large-scale language pretraining remains under-explored. In this work, we investigate the design of such algorithms and propose a novel generalized distillation method, TeachText, which leverages complementary cues from multiple text encoders to provide an enhanced supervisory signal to the retrieval model. TeachText yields significant gains on a number of video retrieval benchmarks without incurring additional computational overhead during inference and was used to produce the winning entry in the Condensed Movie Challenge at ICCV 2021. We show how TeachText can be extended to include multiple video modalities, reducing computational cost at inference without compromising performance. Finally, we demonstrate the application of our method to the task of removing noisy descriptions from the training partitions of retrieval datasets to improve performance. Code and data can be found at https://www.robots.ox.ac.uk/~vgg/research/teachtext/.
近年来,通过对视觉和音频数据集进行大规模预训练来构建功能强大的视频编码器,文本-视频检索任务取得了长足的进步。相比之下,尽管存在天然的对称性,但利用大规模语言预训练设计有效算法的工作仍未得到充分探索。在这项工作中,我们对此类算法的设计进行了研究,并提出了一种新颖的广义蒸馏方法 TeachText,该方法利用来自多个文本编码器的互补线索,为检索模型提供增强的监督信号。TeachText 在一些视频检索基准测试中取得了显著的收益,而不会在推理过程中产生额外的计算开销,并在 2021 年 ICCV 的 "浓缩电影挑战赛 "中获得了优胜。我们展示了 TeachText 如何扩展到多种视频模式,从而在不影响性能的情况下降低推理时的计算成本。最后,我们演示了如何将我们的方法应用于从检索数据集的训练分区中去除噪声描述以提高性能的任务。代码和数据见 https://www.robots.ox.ac.uk/~vgg/research/teachtext/。
{"title":"TeachText: CrossModal text-video retrieval through generalized distillation","authors":"Ioana Croitoru ,&nbsp;Simion-Vlad Bogolin ,&nbsp;Marius Leordeanu ,&nbsp;Hailin Jin ,&nbsp;Andrew Zisserman ,&nbsp;Yang Liu ,&nbsp;Samuel Albanie","doi":"10.1016/j.artint.2024.104235","DOIUrl":"10.1016/j.artint.2024.104235","url":null,"abstract":"<div><div>In recent years, considerable progress on the task of text-video retrieval has been achieved by leveraging large-scale pretraining on visual and audio datasets to construct powerful video encoders. By contrast, despite the natural symmetry, the design of effective algorithms for exploiting large-scale language pretraining remains under-explored. In this work, we investigate the design of such algorithms and propose a novel generalized distillation method, <span>TeachText</span>, which leverages complementary cues from multiple text encoders to provide an enhanced supervisory signal to the retrieval model. <span>TeachText</span> yields significant gains on a number of video retrieval benchmarks without incurring additional computational overhead during inference and was used to produce the winning entry in the Condensed Movie Challenge at ICCV 2021. We show how <span>TeachText</span> can be extended to include multiple video modalities, reducing computational cost at inference without compromising performance. Finally, we demonstrate the application of our method to the task of removing noisy descriptions from the training partitions of retrieval datasets to improve performance. Code and data can be found at <span><span>https://www.robots.ox.ac.uk/~vgg/research/teachtext/</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":"338 ","pages":"Article 104235"},"PeriodicalIF":5.1,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142655203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Datalog rewritability and data complexity of ALCHOIQ with closed predicates 带封闭谓词的 ALCHOIQ 的数据模型可重写性和数据复杂性
IF 14.4 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-02-23 DOI: 10.1016/j.artint.2024.104099
Sanja Lukumbuzya , Magdalena Ortiz , Mantas Šimkus

We study the relative expressiveness of ontology-mediated queries (OMQs) formulated in the expressive Description Logic ALCHOIQ extended with closed predicates. In particular, we present a polynomial time translation from OMQs into Datalog with negation under the stable model semantics, the formalism that underlies Answer Set Programming. This is a novel and non-trivial result: the considered OMQs are not only non-monotonic, but also feature a tricky combination of nominals, inverse roles, and counting. We start with atomic queries and then lift our approach to a large class of first-order queries where quantification is “guarded” by closed predicates. Our translation is based on a characterization of the query answering problem via integer programming, and a specially crafted program in Datalog with negation that finds solutions to dynamically generated systems of integer inequalities. As an important by-product of our translation we get that the query answering problem is co-NP-complete in data complexity for the considered class of OMQs. Thus, answering these OMQs in the presence of closed predicates is not harder than answering them in the standard setting. This is not obvious as closed predicates are known to increase data complexity for some existing ontology languages.

我们研究了本体中介查询(OMQs)的相对表达能力,这些查询是用封闭谓词扩展的表达式描述逻辑 ALCHOIQ 提出的。特别是,我们提出了在稳定模型语义(支撑答案集编程的形式主义)下将 OMQ 转换为带否定的 Datalog 的多项式时间。这是一个新颖而非难的结果:所考虑的 OMQs 不仅是非单调的,而且还具有提名、反向角色和计数的棘手组合。我们从原子查询开始,然后将我们的方法推广到一大类一阶查询,在这些查询中,量化被封闭谓词 "保护 "着。我们的转换是基于通过整数编程对查询回答问题的描述,以及在 Datalog 中专门设计的带有否定的程序,该程序可以找到动态生成的整数不等式系统的解决方案。作为翻译的一个重要副产品,我们发现对于所考虑的这一类 OMQs,查询回答问题在数据复杂度上是共 NP 完备的。因此,在存在封闭谓词的情况下回答这些 OMQs 并不比在标准设置下回答它们更难。这一点并不明显,因为已知封闭谓词会增加某些现有本体语言的数据复杂度。
{"title":"Datalog rewritability and data complexity of ALCHOIQ with closed predicates","authors":"Sanja Lukumbuzya ,&nbsp;Magdalena Ortiz ,&nbsp;Mantas Šimkus","doi":"10.1016/j.artint.2024.104099","DOIUrl":"10.1016/j.artint.2024.104099","url":null,"abstract":"<div><p>We study the relative expressiveness of ontology-mediated queries (OMQs) formulated in the expressive Description Logic <span><math><mi>ALCHOIQ</mi></math></span> extended with closed predicates. In particular, we present a polynomial time translation from OMQs into Datalog with negation under the stable model semantics, the formalism that underlies Answer Set Programming. This is a novel and non-trivial result: the considered OMQs are not only non-monotonic, but also feature a tricky combination of nominals, inverse roles, and counting. We start with atomic queries and then lift our approach to a large class of first-order queries where quantification is “guarded” by closed predicates. Our translation is based on a characterization of the query answering problem via integer programming, and a specially crafted program in Datalog with negation that finds solutions to dynamically generated systems of integer inequalities. As an important by-product of our translation we get that the query answering problem is co-NP-complete in data complexity for the considered class of OMQs. Thus, answering these OMQs in the presence of closed predicates is not harder than answering them in the standard setting. This is not obvious as closed predicates are known to increase data complexity for some existing ontology languages.</p></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":"330 ","pages":"Article 104099"},"PeriodicalIF":14.4,"publicationDate":"2024-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0004370224000353/pdfft?md5=592b8d7cf49fb907ac6ec3932a493a97&pid=1-s2.0-S0004370224000353-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139943365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Decentralized fused-learner architectures for Bayesian reinforcement learning 贝叶斯强化学习的分散融合学习器架构
IF 14.4 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-02-13 DOI: 10.1016/j.artint.2024.104094
Augustin A. Saucan , Subhro Das , Moe Z. Win

Decentralized training is a robust solution for learning over an extensive network of distributed agents. Many existing solutions involve the averaging of locally inferred parameters which constrain the architecture to independent agents with identical learning algorithms. Here, we propose decentralized fused-learner architectures for Bayesian reinforcement learning, named fused Bayesian-learner architectures (FBLAs), that are capable of learning an optimal policy by fusing potentially heterogeneous Bayesian policy gradient learners, i.e., agents that employ different learning architectures to estimate the gradient of a control policy. The novelty of FBLAs relies on fusing the full posterior distributions of the local policy gradients. The inclusion of higher-order information, i.e., probabilistic uncertainty, is employed to robustly fuse the locally-trained parameters. FBLAs find the barycenter of all local posterior densities by minimizing the total Kullback–Leibler divergence from the barycenter distribution to the local posterior densities. The proposed FBLAs are demonstrated on a sensor-selection problem for Bernoulli tracking, where multiple sensors observe a dynamic target and only a subset of sensors is allowed to be active at any time.

分散式训练是在广泛的分布式代理网络中进行学习的稳健解决方案。现有的许多解决方案都涉及局部推断参数的平均化,这就将架构限制为具有相同学习算法的独立代理。在这里,我们提出了用于贝叶斯强化学习的分散式融合学习器架构,并将其命名为融合贝叶斯学习器架构(FBLAs),它能够通过融合潜在的异构贝叶斯策略梯度学习器(即采用不同学习架构来估计控制策略梯度的代理)来学习最优策略。贝叶斯策略梯度学习器的新颖之处在于融合了局部策略梯度的完整后验分布。将高阶信息(即概率不确定性)纳入其中,可稳健地融合局部训练参数。FBLA 通过最小化从原点分布到局部后验密度的总库尔贝-莱布勒发散,找到所有局部后验密度的原点。我们在伯努利跟踪的传感器选择问题上演示了所提出的 FBLA,在该问题中,多个传感器观察一个动态目标,而在任何时候都只允许一个传感器子集处于活动状态。
{"title":"Decentralized fused-learner architectures for Bayesian reinforcement learning","authors":"Augustin A. Saucan ,&nbsp;Subhro Das ,&nbsp;Moe Z. Win","doi":"10.1016/j.artint.2024.104094","DOIUrl":"10.1016/j.artint.2024.104094","url":null,"abstract":"<div><p>Decentralized training is a robust solution for learning over an extensive network of distributed agents. Many existing solutions involve the averaging of locally inferred parameters which constrain the architecture to independent agents with identical learning algorithms. Here, we propose decentralized fused-learner architectures for Bayesian reinforcement learning, named fused Bayesian-learner architectures (FBLAs), that are capable of learning an optimal policy by fusing potentially heterogeneous Bayesian policy gradient learners, i.e., agents that employ different learning architectures to estimate the gradient of a control policy. The novelty of FBLAs relies on fusing the full posterior distributions of the local policy gradients. The inclusion of higher-order information, i.e., probabilistic uncertainty, is employed to robustly fuse the locally-trained parameters. FBLAs find the barycenter of all local posterior densities by minimizing the total Kullback–Leibler divergence from the barycenter distribution to the local posterior densities. The proposed FBLAs are demonstrated on a sensor-selection problem for Bernoulli tracking, where multiple sensors observe a dynamic target and only a subset of sensors is allowed to be active at any time.</p></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":"331 ","pages":"Article 104094"},"PeriodicalIF":14.4,"publicationDate":"2024-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139889913","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Primarily about primaries 主要涉及初选
IF 14.4 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-02-07 DOI: 10.1016/j.artint.2024.104095
Allan Borodin , Omer Lev , Nisarg Shah , Tyrone Strangway

Much of the social choice literature examines direct voting systems, in which voters submit their ranked preferences over candidates and a voting rule picks a winner. Real-world elections and decision-making processes are often more complex and involve multiple stages. For instance, one popular voting system filters candidates through primaries: first, voters affiliated with each political party vote over candidates of their own party and the voting rule picks a set of candidates, one from each party, who then compete in a general election.

We present a model to analyze such multi-stage elections, and conduct what is, to the best of our knowledge, the first quantitative comparison of the direct and primary voting systems in terms of the quality of the elected candidate, using the metric of distortion, which attempts to quantify how far from the optimal winner is the actual winner of an election. Our main theoretical result is that voting rules (which are independent of party affiliations, of course) are guaranteed to perform in the primary system within a constant factor of the direct, single stage setting. Surprisingly, the converse does not hold: we show settings in which there exist voting rules that perform significantly better under the primary system than under the direct system. Using simulations, we see that plurality benefits significantly from using a primary system over a direct one, while Condorcet-consistent rules do not.

大部分社会选择文献研究的是直接投票系统,即选民提交他们对候选人的排序偏好,然后由投票规则选出获胜者。现实世界中的选举和决策过程往往更为复杂,涉及多个阶段。例如,一种流行的投票系统通过初选筛选候选人:首先,隶属于各政党的选民对本党派的候选人进行投票,投票规则选出一组候选人,每个党派选出一名,然后由这些候选人在大选中竞争。我们提出了一个模型来分析这种多阶段选举,并就我们所知,首次就当选候选人的质量对直接投票系统和初选投票系统进行了定量比较,使用的指标是失真度,它试图量化选举的实际获胜者离最优获胜者有多远。我们的主要理论结果是,投票规则(当然与党派无关)在初选系统中的表现保证在直接、单一阶段设置的恒定系数之内。令人惊讶的是,相反的情况并不成立:我们展示了在初选制下投票规则的表现明显优于直接制的情况。通过模拟,我们发现使用初选制比使用直接制更能使多数票获益,而康德赛特一致性规则则不然。
{"title":"Primarily about primaries","authors":"Allan Borodin ,&nbsp;Omer Lev ,&nbsp;Nisarg Shah ,&nbsp;Tyrone Strangway","doi":"10.1016/j.artint.2024.104095","DOIUrl":"https://doi.org/10.1016/j.artint.2024.104095","url":null,"abstract":"<div><p>Much of the social choice literature examines <em>direct</em> voting systems, in which voters submit their ranked preferences over candidates and a voting rule picks a winner. Real-world elections and decision-making processes are often more complex and involve multiple stages. For instance, one popular voting system filters candidates through <em>primaries</em>: first, voters affiliated with each political party vote over candidates of their own party and the voting rule picks a set of candidates, one from each party, who then compete in a general election.</p><p>We present a model to analyze such multi-stage elections, and conduct what is, to the best of our knowledge, the first quantitative comparison of the direct and primary voting systems in terms of the quality of the elected candidate, using the metric of <em>distortion</em>, which attempts to quantify how far from the optimal winner is the actual winner of an election. Our main theoretical result is that voting rules (which are independent of party affiliations, of course) are guaranteed to perform in the primary system within a constant factor of the direct, single stage setting. Surprisingly, the converse does not hold: we show settings in which there exist voting rules that perform significantly better under the primary system than under the direct system. Using simulations, we see that plurality benefits significantly from using a primary system over a direct one, while Condorcet-consistent rules do not.</p></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":"329 ","pages":"Article 104095"},"PeriodicalIF":14.4,"publicationDate":"2024-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139726522","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Temporal segmentation in multi agent path finding with applications to explainability 多代理路径查找中的时间分割及其在可解释性中的应用
IF 14.4 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-02-07 DOI: 10.1016/j.artint.2024.104087
Shaull Almagor , Justin Kottinger , Morteza Lahijanian

Multi-Agent Path Finding (MAPF) is the problem of planning paths for agents to reach their targets from their start locations, such that the agents do not collide while executing the plan. In many settings, the plan (or a digest thereof) is conveyed to a supervising entity, e.g., for confirmation before execution, for a report, etc. In such cases, we wish to convey that the plan is collision-free with minimal amount of information. To this end, we propose an explanation scheme for MAPF. The scheme decomposes a plan into segments such that within each segment, the agents' paths are disjoint. We can then convey the plan whilst convincing that it is collision-free, using a small number of frames (dubbed an explanation). We can also measure the simplicity of a plan by the number of segments required for the decomposition. We study the complexity of algorithmic problems that arise by the explanation scheme and the tradeoff between the length (makespan) of a plan and its minimal decomposition. We also introduce two centralized (i.e. runs on a single CPU with full knowledge of the multi-agent system) algorithms for planning with explanations. One is based on a coupled search algorithm similar to A, and the other is a decoupled method based on Conflict-Based Search (CBS). We refer to the latter as Explanation-Guided CBS (XG-CBS), which uses a low-level search for individual agents and maintains a high-level conflict tree to guide the low-level search to avoid collisions as well as increasing the number of segments. We propose four approaches to the low-level search of XG-CBS by modifying A for explanations and analyze their effects on the completeness of XG-CBS. Finally, we highlight important aspects of the proposed explanation scheme in various MAPF problems and empirically evaluate the performance of the proposed planning algorithms in a series of benchmark problems.

多代理路径查找(MAPF)是为代理规划从其起始位置到达目标的路径,从而使代理在执行计划时不会发生碰撞的问题。在许多情况下,计划(或其摘要)会被传达给一个监督实体,例如,在执行前进行确认、提交报告等。在这种情况下,我们希望以最少的信息量传达计划是无碰撞的。为此,我们提出了一种 MAPF 解释方案。该方案将计划分解成若干段,在每一段中,代理的路径都是不相交的。这样,我们就能用少量的帧来传达计划,同时让人相信它是无碰撞的(称为解释)。我们还可以通过分解所需的分段数量来衡量计划的简单程度。我们研究了解释方案带来的算法问题的复杂性,以及计划长度(makespan)和最小分解之间的权衡。我们还介绍了两种集中式(即在一个中央处理器上运行,且完全了解多代理系统)算法,用于进行带解释的规划。一种是基于类似于 A⁎ 的耦合搜索算法,另一种是基于冲突搜索(CBS)的解耦方法。我们将后者称为 "解释引导的 CBS(XG-CBS)",它对单个代理使用低层搜索,并维护高层冲突树来引导低层搜索,以避免碰撞并增加片段数量。我们提出了四种通过修改 A⁎ 来解释 XG-CBS 低层搜索的方法,并分析了它们对 XG-CBS 完整性的影响。最后,我们强调了在各种 MAPF 问题中建议的解释方案的重要方面,并在一系列基准问题中对建议的规划算法的性能进行了实证评估。
{"title":"Temporal segmentation in multi agent path finding with applications to explainability","authors":"Shaull Almagor ,&nbsp;Justin Kottinger ,&nbsp;Morteza Lahijanian","doi":"10.1016/j.artint.2024.104087","DOIUrl":"10.1016/j.artint.2024.104087","url":null,"abstract":"<div><p>Multi-Agent Path Finding (MAPF) is the problem of planning paths for agents to reach their targets from their start locations, such that the agents do not collide while executing the plan. In many settings, the plan (or a digest thereof) is conveyed to a supervising entity, e.g., for confirmation before execution, for a report, etc. In such cases, we wish to convey that the plan is collision-free with minimal amount of information. To this end, we propose an <em>explanation scheme</em> for MAPF. The scheme decomposes a plan into segments such that within each segment, the agents' paths are disjoint. We can then convey the plan whilst convincing that it is collision-free, using a small number of frames (dubbed an <em>explanation</em>). We can also measure the simplicity of a plan by the number of segments required for the decomposition. We study the complexity of algorithmic problems that arise by the explanation scheme and the tradeoff between the length (makespan) of a plan and its minimal decomposition. We also introduce two centralized (i.e. runs on a single CPU with full knowledge of the multi-agent system) algorithms for planning with explanations. One is based on a coupled search algorithm similar to A<sup>⁎</sup>, and the other is a decoupled method based on Conflict-Based Search (CBS). We refer to the latter as <em>Explanation-Guided CBS</em> (XG-CBS), which uses a low-level search for individual agents and maintains a high-level conflict tree to guide the low-level search to avoid collisions as well as increasing the number of segments. We propose four approaches to the low-level search of XG-CBS by modifying A<sup>⁎</sup> for explanations and analyze their effects on the completeness of XG-CBS. Finally, we highlight important aspects of the proposed explanation scheme in various MAPF problems and empirically evaluate the performance of the proposed planning algorithms in a series of benchmark problems.</p></div>","PeriodicalId":8434,"journal":{"name":"Artificial Intelligence","volume":"330 ","pages":"Article 104087"},"PeriodicalIF":14.4,"publicationDate":"2024-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139832974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Artificial Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1