首页 > 最新文献

Journal of Artificial Intelligence Research最新文献

英文 中文
Qualitative Reasoning about 2D Cardinal Directions using Answer Set Programming 利用答案集规划的二维基本方向定性推理
IF 5 3区 计算机科学 Q2 Computer Science Pub Date : 2023-08-09 DOI: 10.1613/jair.1.14345
Yusuf Izmirlioglu, E. Erdem
We introduce a formal framework (called NCDC-ASP) for representing and reasoning about cardinal directions between extended spatial objects on a plane, using Answer Set Programming (ASP). NCDC-ASP preserves the meaning of cardinal directional relations as in Cardinal Directional Calculus (CDC), and provides solutions to all consistency checking problems in CDC under various conditions (i.e., for a complete/incomplete set of basic/disjunctive CDC constraints over connected/disconnected spatial objects). In particular, NCDC-ASP models a discretized version of the consistency checking problem in ASP, over a finite grid (rather than a plane), where we provide new lower bounds on the grid size to guarantee that it correctly characterizes solutions for the consistency checking in CDC. In addition, NCDC-ASP has the following two novelties important for applications. NCDC-ASP introduces default CDC constraints to represent and reason about background or commonsense knowledge that involves default qualitative directional relations (e.g., "the ice cream truck is by default to the north of the playground" or "the keyboard is normally placed in front of the monitor"). NCDC-ASP introduces inferred CDC constraints to allow inference of missing CDC relations and to provide them as explanations. We illustrate the uses and usefulness of NCDC-ASP with interesting scenarios from the real-world. We design and develop a variety of benchmark instances, and comprehensively evaluate NCDC-ASP from the perspectives of computational efficiency.
我们引入了一个正式的框架(称为NCDC-ASP)来表示和推理平面上扩展空间对象之间的基本方向,使用答案集编程(ASP)。NCDC-ASP保留了基数方向微积分(CDC)中基数方向关系的含义,并提供了各种条件下CDC中所有一致性检查问题的解决方案(即在连通/不连通空间对象上的基本/析取CDC约束的完整/不完整集合)。特别是,NCDC-ASP在有限网格(而不是平面)上对ASP中一致性检查问题的离散化版本进行了建模,其中我们提供了网格大小的新下界,以保证它正确表征了CDC中一致性检查的解。此外,NCDC-ASP还具有以下两个重要的应用新颖性。NCDC-ASP引入了默认的CDC约束来表示和推理涉及默认定性方向关系的背景或常识(例如,“冰淇淋卡车默认位于操场的北部”或“键盘通常位于显示器的前面”)。NCDC-ASP引入了推断的CDC约束,以允许对缺失的CDC关系进行推断并提供它们作为解释。我们用来自现实世界的有趣场景来说明NCDC-ASP的使用和有用性。我们设计并开发了多种基准实例,并从计算效率的角度对NCDC-ASP进行了综合评估。
{"title":"Qualitative Reasoning about 2D Cardinal Directions using Answer Set Programming","authors":"Yusuf Izmirlioglu, E. Erdem","doi":"10.1613/jair.1.14345","DOIUrl":"https://doi.org/10.1613/jair.1.14345","url":null,"abstract":"We introduce a formal framework (called NCDC-ASP) for representing and reasoning about cardinal directions between extended spatial objects on a plane, using Answer Set Programming (ASP). NCDC-ASP preserves the meaning of cardinal directional relations as in Cardinal Directional Calculus (CDC), and provides solutions to all consistency checking problems in CDC under various conditions (i.e., for a complete/incomplete set of basic/disjunctive CDC constraints over connected/disconnected spatial objects). In particular, NCDC-ASP models a discretized version of the consistency checking problem in ASP, over a finite grid (rather than a plane), where we provide new lower bounds on the grid size to guarantee that it correctly characterizes solutions for the consistency checking in CDC. In addition, NCDC-ASP has the following two novelties important for applications. NCDC-ASP introduces default CDC constraints to represent and reason about background or commonsense knowledge that involves default qualitative directional relations (e.g., \"the ice cream truck is by default to the north of the playground\" or \"the keyboard is normally placed in front of the monitor\"). NCDC-ASP introduces inferred CDC constraints to allow inference of missing CDC relations and to provide them as explanations. We illustrate the uses and usefulness of NCDC-ASP with interesting scenarios from the real-world. We design and develop a variety of benchmark instances, and comprehensively evaluate NCDC-ASP from the perspectives of computational efficiency.","PeriodicalId":54877,"journal":{"name":"Journal of Artificial Intelligence Research","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2023-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76150143","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic Controllability of Temporal Plans in Uncertain and Partially Observable Environments 不确定和部分可观测环境下时间计划的动态可控性
IF 5 3区 计算机科学 Q2 Computer Science Pub Date : 2023-08-08 DOI: 10.1613/jair.1.13065
Arthur Bit-Monnot, P. Morris
The formalism of Simple Temporal Networks (STNs) provides methods for evaluating the feasibility of temporal plans. The basic formalism deals with the consistency of quantitative temporal requirements on scheduled events. This implicitly assumes a single agent has full control over the timing of events. The extension of Simple Temporal Networks with Uncertainty (STNU) introduces uncertainty into the timing of some events. Two main approaches to the feasibility of STNUs involve (1) where a single schedule works irrespective of the duration outcomes, called Strong Controllability, and (2) whether a strategy exists to schedule future events based on the outcomes of past events, called Dynamic Controllability. Case (1) essentially assumes the timing of uncertain events cannot be observed by the agent while case (2) assumes full observability.The formalism of Partially Observable Simple Temporal Networks with Uncertainty (POSTNU) provides an intermediate stance between these two extremes, where a known subset of the uncertain events can be observed when they occur. A sound and complete polynomial algorithm to determining the Dynamic Controllability of POSTNUs has not previously been known; we present one in this paper. This answers an open problem that has been posed in the literature.The approach we take factors the problem into Strong Controllability micro-problems in an overall Dynamic Controllability macro-problem framework. It generalizes the notion of labeled distance graph from STNUs. The generalized labels are expressed as max/min expressions involving the observables. The paper introduces sound generalized reduction rules that act on the generalized labels. These incorporate tightenings based on observability that preserve dynamic viable strategies. It is shown that if the generalized reduction rules reach quiescence without exposing an inconsistency, then the POSTNU is Dynamically Controllable (DC). The paper also presents algorithms that apply the reduction rules in an organized way and reach quiescence in a polynomial number of steps if the POSTNU is Dynamically Controllable.Remarkably, the generalized perspective leads to a simpler and more uniform framework that applies also to the STNU special case. It helps illuminate the previous methods inasmuch as the max/min label representation is more semantically clear than the ad-hoc upper/lower case labels previously used.
简单时态网络(STNs)的形式化为评估时态规划的可行性提供了方法。基本的形式主义处理计划事件的定量时间需求的一致性。这隐含地假设单个代理完全控制事件的时间。具有不确定性的简单时间网络(STNU)的扩展将不确定性引入到某些事件的时序中。STNUs可行性的两种主要方法包括:(1)单个调度不考虑持续结果,称为强可控性;(2)是否存在基于过去事件的结果来调度未来事件的策略,称为动态可控性。情形(1)本质上假定不确定事件的时间不能被agent观察到,而情形(2)假定完全可观察。具有不确定性的部分可观察简单时间网络(POSTNU)的形式化提供了这两个极端之间的中间立场,其中当不确定事件发生时可以观察到已知子集。一种完善的多项式算法来确定POSTNUs的动态可控性,这在以前是未知的;我们在本文中提出了一个。这回答了文献中提出的一个开放性问题。该方法在整体动态可控性宏观问题框架中,将问题分解为强可控性微观问题。它推广了标记距离图的概念。广义标签表示为涉及可观测值的max/min表达式。本文介绍了作用于广义标签的健全的广义约简规则。这些措施包括基于可观察性的收紧措施,以保持动态可行的策略。证明了如果广义约简规则达到静止状态而不暴露不一致性,则POSTNU是动态可控的(DC)。本文还提出了在POSTNU动态可控的情况下,有组织地应用约简规则的算法,并在多项式步数内达到静态。值得注意的是,广义的观点导致了一个更简单、更统一的框架,也适用于上海师范大学的特殊情况。它有助于阐明以前的方法,因为最大/最小标签表示比以前使用的特别的大写/小写标签在语义上更清晰。
{"title":"Dynamic Controllability of Temporal Plans in Uncertain and Partially Observable Environments","authors":"Arthur Bit-Monnot, P. Morris","doi":"10.1613/jair.1.13065","DOIUrl":"https://doi.org/10.1613/jair.1.13065","url":null,"abstract":"The formalism of Simple Temporal Networks (STNs) provides methods for evaluating the feasibility of temporal plans. The basic formalism deals with the consistency of quantitative temporal requirements on scheduled events. This implicitly assumes a single agent has full control over the timing of events. The extension of Simple Temporal Networks with Uncertainty (STNU) introduces uncertainty into the timing of some events. Two main approaches to the feasibility of STNUs involve (1) where a single schedule works irrespective of the duration outcomes, called Strong Controllability, and (2) whether a strategy exists to schedule future events based on the outcomes of past events, called Dynamic Controllability. Case (1) essentially assumes the timing of uncertain events cannot be observed by the agent while case (2) assumes full observability.\u0000The formalism of Partially Observable Simple Temporal Networks with Uncertainty (POSTNU) provides an intermediate stance between these two extremes, where a known subset of the uncertain events can be observed when they occur. A sound and complete polynomial algorithm to determining the Dynamic Controllability of POSTNUs has not previously been known; we present one in this paper. This answers an open problem that has been posed in the literature.\u0000The approach we take factors the problem into Strong Controllability micro-problems in an overall Dynamic Controllability macro-problem framework. It generalizes the notion of labeled distance graph from STNUs. The generalized labels are expressed as max/min expressions involving the observables. The paper introduces sound generalized reduction rules that act on the generalized labels. These incorporate tightenings based on observability that preserve dynamic viable strategies. It is shown that if the generalized reduction rules reach quiescence without exposing an inconsistency, then the POSTNU is Dynamically Controllable (DC). The paper also presents algorithms that apply the reduction rules in an organized way and reach quiescence in a polynomial number of steps if the POSTNU is Dynamically Controllable.\u0000Remarkably, the generalized perspective leads to a simpler and more uniform framework that applies also to the STNU special case. It helps illuminate the previous methods inasmuch as the max/min label representation is more semantically clear than the ad-hoc upper/lower case labels previously used.","PeriodicalId":54877,"journal":{"name":"Journal of Artificial Intelligence Research","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2023-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88387302","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Program Synthesis with Best-First Bottom-Up Search 基于最佳优先自底向上搜索的程序综合
IF 5 3区 计算机科学 Q2 Computer Science Pub Date : 2023-08-02 DOI: 10.1613/jair.1.14394
Saqib Ameen, Levi H. S. Lelis
Cost-guided bottom-up search (BUS) algorithms use a cost function to guide the search to solve program synthesis tasks. In this paper, we show that current state-of-the-art cost-guided BUS algorithms suffer from a common problem: they can lose useful information given by the model and fail to perform the search in a best-first order according to a cost function. We introduce a novel best-first bottom-up search algorithm, which we call Bee Search, that does not suffer information loss and is able to perform cost-guided bottom-up synthesis in a best-first manner. Importantly, Bee Search performs best-first search with respect to the generation of programs, i.e., it does not even create in memory programs that are more expensive than the solution program. It attains best-first ordering with respect to generation by performing a search in an abstract space of program costs. We also introduce a new cost function that better uses the information provided by an existing cost model. Empirical results on string manipulation and bit-vector tasks show that Bee Search can outperform existing cost-guided BUS approaches when employing more complex domain-specific languages (DSLs); Bee Search and previous approaches perform equally well with simpler DSLs. Furthermore, our new cost function with Bee Search outperforms previous cost functions on string manipulation tasks.
成本导向自底向上搜索(BUS)算法使用成本函数来引导搜索以解决程序合成任务。在本文中,我们证明了当前最先进的成本导向总线算法存在一个常见问题:它们可能会丢失模型给出的有用信息,并且无法根据成本函数以最佳优先顺序执行搜索。我们引入了一种新颖的最佳优先自下而上搜索算法,我们称之为蜜蜂搜索,它不会遭受信息丢失,并且能够以最佳优先的方式执行成本引导的自下而上合成。重要的是,Bee Search在程序生成方面执行最佳优先搜索,也就是说,它甚至不会在内存中创建比解决方案程序更昂贵的程序。它通过在程序成本的抽象空间中执行搜索来获得关于生成的最佳优先排序。我们还引入了一个新的成本函数,它可以更好地利用现有成本模型提供的信息。字符串操作和位向量任务的实证结果表明,当使用更复杂的领域特定语言(dsl)时,Bee Search可以优于现有的成本导向总线方法;蜜蜂搜索和以前的方法在更简单的dsl中表现同样出色。此外,我们使用Bee Search的新成本函数在字符串操作任务上优于以前的成本函数。
{"title":"Program Synthesis with Best-First Bottom-Up Search","authors":"Saqib Ameen, Levi H. S. Lelis","doi":"10.1613/jair.1.14394","DOIUrl":"https://doi.org/10.1613/jair.1.14394","url":null,"abstract":"Cost-guided bottom-up search (BUS) algorithms use a cost function to guide the search to solve program synthesis tasks. In this paper, we show that current state-of-the-art cost-guided BUS algorithms suffer from a common problem: they can lose useful information given by the model and fail to perform the search in a best-first order according to a cost function. We introduce a novel best-first bottom-up search algorithm, which we call Bee Search, that does not suffer information loss and is able to perform cost-guided bottom-up synthesis in a best-first manner. Importantly, Bee Search performs best-first search with respect to the generation of programs, i.e., it does not even create in memory programs that are more expensive than the solution program. It attains best-first ordering with respect to generation by performing a search in an abstract space of program costs. We also introduce a new cost function that better uses the information provided by an existing cost model. Empirical results on string manipulation and bit-vector tasks show that Bee Search can outperform existing cost-guided BUS approaches when employing more complex domain-specific languages (DSLs); Bee Search and previous approaches perform equally well with simpler DSLs. Furthermore, our new cost function with Bee Search outperforms previous cost functions on string manipulation tasks.","PeriodicalId":54877,"journal":{"name":"Journal of Artificial Intelligence Research","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2023-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83884478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Complexity of Computing the Shapley Value in Partition Function Form Games 配分函数形式博弈中Shapley值计算的复杂性
IF 5 3区 计算机科学 Q2 Computer Science Pub Date : 2023-08-02 DOI: 10.1613/jair.1.14648
Oskar Skibski
We study the complexity of computing the Shapley value in partition function form games. We focus on two representations based on marginal contribution nets (embedded MC-nets and weighted MC-nets) and five extensions of the Shapley value. Our results show that while weighted MC-nets are more concise than embedded MC-nets, they have slightly worse computational properties when it comes to computing the Shapley value: two out of five extensions can be computed in polynomial time for embedded MC-nets and only one for weighted MC-nets.
研究了配分函数形式博弈中Shapley值的计算复杂性。我们重点讨论了基于边际贡献网的两种表示(嵌入式MC-nets和加权MC-nets)和Shapley值的五种扩展。我们的研究结果表明,虽然加权MC-nets比嵌入式MC-nets更简洁,但在计算Shapley值时,它们的计算性能略差:嵌入式MC-nets可以在多项式时间内计算5个扩展中的2个,而加权MC-nets只能在多项式时间内计算1个。
{"title":"Complexity of Computing the Shapley Value in Partition Function Form Games","authors":"Oskar Skibski","doi":"10.1613/jair.1.14648","DOIUrl":"https://doi.org/10.1613/jair.1.14648","url":null,"abstract":"We study the complexity of computing the Shapley value in partition function form games. We focus on two representations based on marginal contribution nets (embedded MC-nets and weighted MC-nets) and five extensions of the Shapley value. Our results show that while weighted MC-nets are more concise than embedded MC-nets, they have slightly worse computational properties when it comes to computing the Shapley value: two out of five extensions can be computed in polynomial time for embedded MC-nets and only one for weighted MC-nets.","PeriodicalId":54877,"journal":{"name":"Journal of Artificial Intelligence Research","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2023-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72378789","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Select and Augment: Enhanced Dense Retrieval Knowledge Graph Augmentation 选择和增强:增强的密集检索知识图增强
IF 5 3区 计算机科学 Q2 Computer Science Pub Date : 2023-07-28 DOI: 10.48550/arXiv.2307.15776
Micheal Abaho, Yousef H. Alfaifi
Injecting textual information into knowledge graph (KG) entity representations has been a worthwhile expedition in terms of improving performance in KG oriented tasks within the NLP community. External knowledge often adopted to enhance KG embeddings ranges from semantically rich lexical dependency parsed features to a set of relevant key words to entire text descriptions supplied from an external corpus such as wikipedia and many more. Despite the gains this innovation (Text-enhanced KG embeddings) has made, the proposal in this work suggests that it can be improved even further. Instead of using a single text description (which would not sufficiently represent an entity because of the inherent lexical ambiguity of text), we propose a multi-task framework that jointly selects a set of text descriptions relevant to KG entities as well as align or augment KG embeddings with text descriptions. Different from prior work that plugs formal entity descriptions declared in knowledge bases, this framework leverages a retriever model to selectively identify richer or highly relevant text descriptions to use in augmenting entities. Furthermore, the framework treats the number of descriptions to use in augmentation process as a parameter, which allows the flexibility of enumerating across several numbers before identifying an appropriate number. Experiment results for Link Prediction demonstrate a 5.5% and 3.5% percentage increase in the Mean Reciprocal Rank (MRR) and Hits@10 scores respectively, in comparison to text-enhanced knowledge graph augmentation methods using traditional CNNs.
在NLP社区中,将文本信息注入知识图(KG)实体表示是提高面向KG任务性能的一项有价值的探索。通常用于增强KG嵌入的外部知识范围从语义丰富的词汇依赖解析特征到一组相关关键字,再到外部语料库(如wikipedia等)提供的完整文本描述。尽管这种创新(文本增强的KG嵌入)已经取得了进展,但这项工作中的建议表明它可以进一步改进。代替使用单一的文本描述(由于文本固有的词汇歧义,不能充分代表实体),我们提出了一个多任务框架,共同选择一组与KG实体相关的文本描述,并将KG嵌入与文本描述对齐或增强。与之前插入知识库中声明的正式实体描述的工作不同,该框架利用检索器模型有选择地识别更丰富或高度相关的文本描述,以用于扩展实体。此外,框架将在增强过程中使用的描述数量作为参数,这允许在确定适当的数字之前枚举多个数字的灵活性。实验结果表明,与使用传统cnn的文本增强知识图增强方法相比,链接预测的平均倒数秩(MRR)和Hits@10分数分别提高了5.5%和3.5%。
{"title":"Select and Augment: Enhanced Dense Retrieval Knowledge Graph Augmentation","authors":"Micheal Abaho, Yousef H. Alfaifi","doi":"10.48550/arXiv.2307.15776","DOIUrl":"https://doi.org/10.48550/arXiv.2307.15776","url":null,"abstract":"Injecting textual information into knowledge graph (KG) entity representations has been a worthwhile expedition in terms of improving performance in KG oriented tasks within the NLP community. External knowledge often adopted to enhance KG embeddings ranges from semantically rich lexical dependency parsed features to a set of relevant key words to entire text descriptions supplied from an external corpus such as wikipedia and many more. Despite the gains this innovation (Text-enhanced KG embeddings) has made, the proposal in this work suggests that it can be improved even further. Instead of using a single text description (which would not sufficiently represent an entity because of the inherent lexical ambiguity of text), we propose a multi-task framework that jointly selects a set of text descriptions relevant to KG entities as well as align or augment KG embeddings with text descriptions. Different from prior work that plugs formal entity descriptions declared in knowledge bases, this framework leverages a retriever model to selectively identify richer or highly relevant text descriptions to use in augmenting entities. Furthermore, the framework treats the number of descriptions to use in augmentation process as a parameter, which allows the flexibility of enumerating across several numbers before identifying an appropriate number. Experiment results for Link Prediction demonstrate a 5.5% and 3.5% percentage increase in the Mean Reciprocal Rank (MRR) and Hits@10 scores respectively, in comparison to text-enhanced knowledge graph augmentation methods using traditional CNNs.","PeriodicalId":54877,"journal":{"name":"Journal of Artificial Intelligence Research","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2023-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84279768","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Information Lattice Learning 信息点阵学习
IF 5 3区 计算机科学 Q2 Computer Science Pub Date : 2023-07-13 DOI: 10.1613/jair.1.14277
Haizi Yu, James A. Evans, L. Varshney
We propose Information Lattice Learning (ILL) as a general framework to learn rules of a signal (e.g., an image or a probability distribution). In our definition, a rule is a coarsened signal used to help us gain one interpretable insight about the original signal. To make full sense of what might govern the signal’s intrinsic structure, we seek multiple disentangled rules arranged in a hierarchy, called a lattice. Compared to representation/rule-learning models optimized for a specific task (e.g., classification), ILL focuses on explainability: it is designed to mimic human experiential learning and discover rules akin to those humans can distill and comprehend. This paper details the math and algorithms of ILL, and illustrates how it addresses the fundamental question “what makes X an X” by creating rule-based explanations designed to help humans understand. Our focus is on explaining X rather than (re)generating it. We present applications in knowledge discovery, using ILL to distill music theory from scores and chemical laws from molecules and further revealing connections between them. We show ILL’s efficacy and interpretability on benchmarks and assessments, as well as a demonstration of ILL-enhanced classifiers achieving human-level digit recognition using only one or a few MNIST training examples (1–10 per class).
我们提出信息点阵学习(Information Lattice Learning, ILL)作为学习信号(如图像或概率分布)规则的一般框架。在我们的定义中,规则是一个粗化的信号,用于帮助我们获得关于原始信号的一个可解释的见解。为了充分理解是什么控制了信号的内在结构,我们寻找排列成层次结构的多个不纠缠的规则,称为晶格。与针对特定任务(例如分类)优化的表示/规则学习模型相比,ILL侧重于可解释性:它旨在模仿人类体验式学习,并发现类似于人类可以提炼和理解的规则。本文详细介绍了人工智能的数学和算法,并说明了它如何通过创建旨在帮助人类理解的基于规则的解释来解决“什么使X成为X”的基本问题。我们的重点是解释X,而不是(重新)生成它。我们展示了在知识发现方面的应用,使用ILL从乐谱中提取音乐理论,从分子中提取化学定律,并进一步揭示它们之间的联系。我们在基准和评估中展示了ILL的有效性和可解释性,并演示了仅使用一个或几个MNIST训练示例(每类1-10个)就可以实现人类水平的数字识别的ILL增强分类器。
{"title":"Information Lattice Learning","authors":"Haizi Yu, James A. Evans, L. Varshney","doi":"10.1613/jair.1.14277","DOIUrl":"https://doi.org/10.1613/jair.1.14277","url":null,"abstract":"We propose Information Lattice Learning (ILL) as a general framework to learn rules of a signal (e.g., an image or a probability distribution). In our definition, a rule is a coarsened signal used to help us gain one interpretable insight about the original signal. To make full sense of what might govern the signal’s intrinsic structure, we seek multiple disentangled rules arranged in a hierarchy, called a lattice. Compared to representation/rule-learning models optimized for a specific task (e.g., classification), ILL focuses on explainability: it is designed to mimic human experiential learning and discover rules akin to those humans can distill and comprehend. This paper details the math and algorithms of ILL, and illustrates how it addresses the fundamental question “what makes X an X” by creating rule-based explanations designed to help humans understand. Our focus is on explaining X rather than (re)generating it. We present applications in knowledge discovery, using ILL to distill music theory from scores and chemical laws from molecules and further revealing connections between them. We show ILL’s efficacy and interpretability on benchmarks and assessments, as well as a demonstration of ILL-enhanced classifiers achieving human-level digit recognition using only one or a few MNIST training examples (1–10 per class).","PeriodicalId":54877,"journal":{"name":"Journal of Artificial Intelligence Research","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2023-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87214528","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
SAlign: A Graph Neural Attention Framework for Aligning Structurally Heterogeneous Networks SAlign:一种结构异构网络对齐的图神经注意框架
IF 5 3区 计算机科学 Q2 Computer Science Pub Date : 2023-07-12 DOI: 10.1613/jair.1.14427
S. Saxena, Joydeep Chandra
Network alignment techniques that map the same entities across multiple networks assume that the mapping nodes in two different networks have similar attributes and neighborhood proximity. However, real-world networks often violate such assumptions, having diverse attributes and structural properties. Node mapping across such structurally heterogeneous networks remains a challenge. Although capturing the nodes’ entire neighborhood (in low-dimensional embeddings) may help deal with these characteristic differences, the issue of over-smoothing in the representations that come from higherorder learning still remains a major problem. To address the above concerns, we propose SAlign: a supervised graph neural attention framework for aligning structurally heterogeneous networks that learns the correlation of structural properties of mapping nodes using a set of labeled (mapped) anchor nodes. SAlign incorporates nodes’ graphlet information with a novel structure-aware cross-network attention mechanism that transfers the required higher-order structure information across networks. The information exchanged across networks helps in enhancing the expressivity of the graph neural network, thereby handling any potential over-smoothing problem. Extensive experiments on three real datasets demonstrate that SAlign consistently outperforms the state-of-the-art network alignment methods by at least 1.3-8% in terms of accuracy score. The code is available at https://github.com/shruti400/SAlign for reproducibility.
跨多个网络映射相同实体的网络对齐技术假设两个不同网络中的映射节点具有相似的属性和邻域接近性。然而,现实世界的网络往往违反这样的假设,具有不同的属性和结构属性。跨这种结构异构网络的节点映射仍然是一个挑战。尽管捕获节点的整个邻域(在低维嵌入中)可能有助于处理这些特征差异,但来自高阶学习的表示中的过度平滑问题仍然是一个主要问题。为了解决上述问题,我们提出了SAlign:一个用于对齐结构异构网络的监督图神经注意框架,该框架使用一组标记(映射)锚节点学习映射节点结构属性的相关性。SAlign将节点的graphlet信息与一种新颖的结构感知跨网络注意机制相结合,该机制可以跨网络传输所需的高阶结构信息。网络间的信息交换有助于增强图神经网络的表达能力,从而处理任何潜在的过平滑问题。在三个真实数据集上进行的大量实验表明,SAlign在准确率得分方面始终优于最先进的网络对齐方法至少1.3-8%。该代码可在https://github.com/shruti400/SAlign上获得,以获得再现性。
{"title":"SAlign: A Graph Neural Attention Framework for Aligning Structurally Heterogeneous Networks","authors":"S. Saxena, Joydeep Chandra","doi":"10.1613/jair.1.14427","DOIUrl":"https://doi.org/10.1613/jair.1.14427","url":null,"abstract":"Network alignment techniques that map the same entities across multiple networks assume that the mapping nodes in two different networks have similar attributes and neighborhood proximity. However, real-world networks often violate such assumptions, having diverse attributes and structural properties. Node mapping across such structurally heterogeneous networks remains a challenge. Although capturing the nodes’ entire neighborhood (in low-dimensional embeddings) may help deal with these characteristic differences, the issue of over-smoothing in the representations that come from higherorder learning still remains a major problem. To address the above concerns, we propose SAlign: a supervised graph neural attention framework for aligning structurally heterogeneous networks that learns the correlation of structural properties of mapping nodes using a set of labeled (mapped) anchor nodes. SAlign incorporates nodes’ graphlet information with a novel structure-aware cross-network attention mechanism that transfers the required higher-order structure information across networks. The information exchanged across networks helps in enhancing the expressivity of the graph neural network, thereby handling any potential over-smoothing problem. Extensive experiments on three real datasets demonstrate that SAlign consistently outperforms the state-of-the-art network alignment methods by at least 1.3-8% in terms of accuracy score. The code is available at https://github.com/shruti400/SAlign for reproducibility.","PeriodicalId":54877,"journal":{"name":"Journal of Artificial Intelligence Research","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2023-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81554085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On Dynamics in Structured Argumentation Formalisms 论结构化论证形式主义的动态性
3区 计算机科学 Q2 Computer Science Pub Date : 2023-06-29 DOI: 10.1613/jair.1.14481
Anna Rapberger, Markus Ulbricht
This paper is a contribution to the research on dynamics in assumption-based argumentation (ABA). We investigate situations where a given knowledge base undergoes certain changes. We show that two frequently investigated problems, namely enforcement of a given target atom and deciding strong equivalence of two given ABA frameworks, are intractable in general. Notably, these problems are both tractable for abstract argumentation frameworks (AFs) which admit a close correspondence to ABA by constructing semanticspreserving instances. Inspired by this observation, we search for tractable fragments for ABA frameworks by means of the instantiated AFs. We argue that the usual instantiation procedure is not suitable for the investigation of dynamic scenarios since too much information is lost when constructing the abstract framework. We thus consider an extension of AFs, called cvAFs, equipping arguments with conclusions and vulnerabilities in order to better anticipate their role after the underlying knowledge base is extended. We investigate enforcement and strong equivalence for cvAFs and present syntactic conditions to decide them. We show that the correspondence between cvAFs and ABA frameworks is close enough to capture dynamics in ABA. This yields the desired tractable fragment. We furthermore discuss consequences for the corresponding problems for logic programs.
本文是对基于假设的论证动力学研究的一项贡献。我们研究给定知识库发生某些变化的情况。我们证明了两个经常研究的问题,即给定目标原子的强制执行和确定两个给定ABA框架的强等效性,通常是难以解决的。值得注意的是,对于抽象论证框架(AFs)来说,这些问题都是可处理的,抽象论证框架通过构造语义保留实例来承认与ABA的密切对应。受到这一观察结果的启发,我们通过实例化的AFs来搜索ABA框架的可处理片段。我们认为通常的实例化过程不适合动态场景的研究,因为在构建抽象框架时丢失了太多的信息。因此,我们考虑了AFs的扩展,称为cvAFs,为论证配备结论和漏洞,以便在基础知识库扩展后更好地预测它们的作用。我们研究了cvAFs的强制和强等价性,并给出了决定它们的语法条件。我们表明,cvAFs和ABA框架之间的对应关系足够接近,可以捕获ABA中的动态。这将产生所需的可处理片段。我们进一步讨论了逻辑程序相应问题的结果。
{"title":"On Dynamics in Structured Argumentation Formalisms","authors":"Anna Rapberger, Markus Ulbricht","doi":"10.1613/jair.1.14481","DOIUrl":"https://doi.org/10.1613/jair.1.14481","url":null,"abstract":"This paper is a contribution to the research on dynamics in assumption-based argumentation (ABA). We investigate situations where a given knowledge base undergoes certain changes. We show that two frequently investigated problems, namely enforcement of a given target atom and deciding strong equivalence of two given ABA frameworks, are intractable in general. Notably, these problems are both tractable for abstract argumentation frameworks (AFs) which admit a close correspondence to ABA by constructing semanticspreserving instances. Inspired by this observation, we search for tractable fragments for ABA frameworks by means of the instantiated AFs. We argue that the usual instantiation procedure is not suitable for the investigation of dynamic scenarios since too much information is lost when constructing the abstract framework. We thus consider an extension of AFs, called cvAFs, equipping arguments with conclusions and vulnerabilities in order to better anticipate their role after the underlying knowledge base is extended. We investigate enforcement and strong equivalence for cvAFs and present syntactic conditions to decide them. We show that the correspondence between cvAFs and ABA frameworks is close enough to capture dynamics in ABA. This yields the desired tractable fragment. We furthermore discuss consequences for the corresponding problems for logic programs.","PeriodicalId":54877,"journal":{"name":"Journal of Artificial Intelligence Research","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134997936","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A Markov Framework for Learning and Reasoning About Strategies in Professional Soccer 职业足球策略学习与推理的马尔可夫框架
IF 5 3区 计算机科学 Q2 Computer Science Pub Date : 2023-06-19 DOI: 10.1613/jair.1.13934
Maaike Van Roy, Pieter Robberechts, Wen-Chi Yang, L. D. Raedt, Jesse Davis
Strategy-optimization is a fundamental element of dynamic and complex team sports such as soccer, American football, and basketball. As the amount of data that is collected from matches in these sports has increased, so has the demand for data-driven decisionmaking support. If alternative strategies need to be balanced, a data-driven approach can uncover insights that are not available from qualitative analysis. This could tremendously aid teams in their match preparations. In this work, we propose a novel Markov modelbased framework for soccer that allows reasoning about the specific strategies teams use in order to gain insights into the efficiency of each strategy. The framework consists of two components: (1) a learning component, which entails modeling a team’s offensive behavior by learning a Markov decision process (MDP) from event data that is collected from the team’s matches, and (2) a reasoning component, which involves a novel application of probabilistic model checking to reason about the efficacy of the learned strategies of each team. In this paper, we provide an overview of this framework and illustrate it on several use cases using real-world event data from three leagues. Our results show that the framework can be used to reason about the shot decision-making of teams and to optimise the defensive strategies used when playing against a particular team. The general ideas presented in this framework can easily be extended to other sports.
策略优化是动态和复杂团队运动(如足球、美式足球和篮球)的基本元素。随着从这些运动的比赛中收集的数据量的增加,对数据驱动的决策支持的需求也在增加。如果需要平衡备选策略,那么数据驱动的方法可以揭示定性分析无法获得的见解。这可以极大地帮助球队准备比赛。在这项工作中,我们提出了一个新的基于马尔可夫模型的足球框架,该框架允许对球队使用的特定策略进行推理,以便深入了解每种策略的效率。该框架由两个部分组成:(1)学习部分,它需要通过从团队比赛中收集的事件数据中学习马尔可夫决策过程(MDP)来建模团队的进攻行为;(2)推理部分,它涉及到概率模型检查的新应用,以推理每个团队学习策略的有效性。在本文中,我们提供了该框架的概述,并使用来自三个联盟的真实事件数据在几个用例中说明了它。我们的研究结果表明,该框架可以用来推理球队的射门决策,并优化与特定球队比赛时使用的防守策略。在这个框架中提出的一般思想可以很容易地扩展到其他运动。
{"title":"A Markov Framework for Learning and Reasoning About Strategies in Professional Soccer","authors":"Maaike Van Roy, Pieter Robberechts, Wen-Chi Yang, L. D. Raedt, Jesse Davis","doi":"10.1613/jair.1.13934","DOIUrl":"https://doi.org/10.1613/jair.1.13934","url":null,"abstract":"Strategy-optimization is a fundamental element of dynamic and complex team sports such as soccer, American football, and basketball. As the amount of data that is collected from matches in these sports has increased, so has the demand for data-driven decisionmaking support. If alternative strategies need to be balanced, a data-driven approach can uncover insights that are not available from qualitative analysis. This could tremendously aid teams in their match preparations. In this work, we propose a novel Markov modelbased framework for soccer that allows reasoning about the specific strategies teams use in order to gain insights into the efficiency of each strategy. The framework consists of two components: (1) a learning component, which entails modeling a team’s offensive behavior by learning a Markov decision process (MDP) from event data that is collected from the team’s matches, and (2) a reasoning component, which involves a novel application of probabilistic model checking to reason about the efficacy of the learned strategies of each team. In this paper, we provide an overview of this framework and illustrate it on several use cases using real-world event data from three leagues. Our results show that the framework can be used to reason about the shot decision-making of teams and to optimise the defensive strategies used when playing against a particular team. The general ideas presented in this framework can easily be extended to other sports.","PeriodicalId":54877,"journal":{"name":"Journal of Artificial Intelligence Research","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2023-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76839783","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stackelberg Security Games with Contagious Attacks on a Network: Reallocation to the Rescue 网络上具有传染性攻击的Stackelberg安全游戏:重新分配救援
IF 5 3区 计算机科学 Q2 Computer Science Pub Date : 2023-06-19 DOI: 10.1613/jair.1.14563
Rufan Bai, Haoxing Lin, Xinyu Yang, Xiaowei Wu, Minming Li, Weijia Jia
In the classic network security games, the defender distributes defending resources to the nodes of the network, and the attacker attacks a node, with the objective of maximizing the damage caused. In this paper, we consider the network defending problem against contagious attacks, e.g., the attack at a node u spreads to the neighbors of u and can cause damage at multiple nodes. Existing works that study shared resources assume that the resource allocated to a node can be shared or duplicated between neighboring nodes. However, in the real world, sharing resource naturally leads to a decrease in defending power of the source node, especially when defending against contagious attacks. Therefore, we study the model in which resources allocated to a node can only be transferred to its neighboring nodes, which we refer to as a reallocation process. We show that the problem of computing optimal defending strategy is NP-hard even for some very special cases. For positive results, we give a mixed integer linear program formulation for the problem and a bi-criteria approximation algorithm. Our experimental results demonstrate that the allocation and reallocation strategies our algorithm computes perform well in terms of minimizing the damage due to contagious attacks.
在经典的网络安全博弈中,防御方将防御资源分配到网络的节点上,攻击方攻击某一个节点,目的是将造成的损害最大化。在本文中,我们考虑网络对传染性攻击的防御问题,例如节点u上的攻击会扩散到u的邻居,并可能造成多个节点的破坏。现有研究共享资源的工作假设分配给一个节点的资源可以在相邻节点之间共享或复制。然而,在现实世界中,资源共享自然会导致源节点的防御能力下降,尤其是在防御传染性攻击时。因此,我们研究分配给一个节点的资源只能转移到相邻节点的模型,我们称之为再分配过程。我们证明,即使在一些非常特殊的情况下,计算最优防御策略的问题也是np困难的。对于积极的结果,我们给出了问题的混合整数线性规划形式和双准则逼近算法。我们的实验结果表明,我们的算法计算的分配和再分配策略在最小化传染性攻击造成的损害方面表现良好。
{"title":"Stackelberg Security Games with Contagious Attacks on a Network: Reallocation to the Rescue","authors":"Rufan Bai, Haoxing Lin, Xinyu Yang, Xiaowei Wu, Minming Li, Weijia Jia","doi":"10.1613/jair.1.14563","DOIUrl":"https://doi.org/10.1613/jair.1.14563","url":null,"abstract":"In the classic network security games, the defender distributes defending resources to the nodes of the network, and the attacker attacks a node, with the objective of maximizing the damage caused. In this paper, we consider the network defending problem against contagious attacks, e.g., the attack at a node u spreads to the neighbors of u and can cause damage at multiple nodes. Existing works that study shared resources assume that the resource allocated to a node can be shared or duplicated between neighboring nodes. However, in the real world, sharing resource naturally leads to a decrease in defending power of the source node, especially when defending against contagious attacks. Therefore, we study the model in which resources allocated to a node can only be transferred to its neighboring nodes, which we refer to as a reallocation process. We show that the problem of computing optimal defending strategy is NP-hard even for some very special cases. For positive results, we give a mixed integer linear program formulation for the problem and a bi-criteria approximation algorithm. Our experimental results demonstrate that the allocation and reallocation strategies our algorithm computes perform well in terms of minimizing the damage due to contagious attacks.","PeriodicalId":54877,"journal":{"name":"Journal of Artificial Intelligence Research","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2023-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89190543","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Artificial Intelligence Research
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1