Proceedings of the 41st ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems最新文献

英文中文

Convergence of Datalog over (Pre-) Semirings 数据在(预)半环上的收敛性

Proceedings of the 41st ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems

Pub Date : 2021-05-30 DOI: 10.1145/3517804.3524140

Mahmoud Abo Khamis, H. Ngo, R. Pichler, Dan Suciu, Y. Wang

Recursive queries have been traditionally studied in the framework of datalog, a language that restricts recursion to monotone queries over sets, which is guaranteed to converge in polynomial time in the size of the input. But modern big data systems require recursive computations beyond the Boolean space. In this paper we study the convergence of datalog when it is interpreted over an arbitrary semiring. We consider an ordered semiring, define the semantics of a datalog program as a least fixpoint in this semiring, and study the number of steps required to reach that fixpoint, if ever. We identify algebraic properties of the semiring that correspond to certain convergence properties of datalog programs. Finally, we describe a class of ordered semirings on which one can use the semi-naive evaluation algorithm on any datalog program.

递归查询传统上是在datalog的框架下研究的，这种语言将递归限制为对集合的单调查询，保证在输入大小的多项式时间内收敛。但现代大数据系统需要超越布尔空间的递归计算。本文研究了在任意半环上解释数据表时的收敛性。我们考虑一个有序半环，将数据程序的语义定义为该半环中的最小不动点，并研究到达该不动点所需的步骤数(如果有的话)。我们确定了与数据规划的某些收敛性质相对应的半环的代数性质。最后，我们描述了一类有序半环，在这些半环上可以对任意数据程序使用半朴素求值算法。

引用次数: 2

A Nearly Instance-optimal Differentially Private Mechanism for Conjunctive Queries 一种近乎实例最优的联合查询差分私有机制

Proceedings of the 41st ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems

Pub Date : 2021-05-12 DOI: 10.1145/3517804.3524143

Wei Dong, K. Yi

Releasing the result size of conjunctive queries and graph pattern queries under differential privacy (DP) has received considerable attention in the literature, but existing solutions do not offer any optimality guarantees. We provide the first DP mechanism for this problem with a fairly strong notion of optimality, which can be considered as a natural relaxation of instance-optimality to a constant.

在差分隐私(DP)下释放联合查询和图模式查询的结果大小在文献中受到了相当大的关注，但是现有的解决方案并没有提供任何最优性保证。我们为这个问题提供了具有相当强的最优性概念的第一种DP机制，它可以被认为是实例最优性的自然松弛到一个常数。

引用次数: 12

Approximately Counting Answers to Conjunctive Queries with Disequalities and Negations 带有不等式和否定的连接查询的近似计数答案

Proceedings of the 41st ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems

Pub Date : 2021-03-23 DOI: 10.1145/3517804.3526231

Jacob Focke, L. A. Goldberg, M. Roth, Stanislav Živný

We study the complexity of approximating the number of answers to a small query φ in a large database D. We establish an exhaustive classification into tractable and intractable cases if φ is a conjunctive query possibly including disequalities and negations: - If there is a constant bound on the arity of φ, and if the randomised Exponential Time Hypothesis (rETH) holds, then the problem has a fixed-parameter tractable approximation scheme (FPTRAS) if and only if the treewidth of φ is bounded. - If the arity is unbounded and φ does not have negations, then the problem has an FPTRAS if and only if the adaptive width of φ (a width measure strictly more general than treewidth) is bounded; the lower bound relies on the rETH as well. Additionally we show that our results cannot be strengthened to achieve a fully polynomial randomised approximation scheme (FPRAS): We observe that, unless NP=RP, there is no FPRAS even if the treewidth (and the adaptive width) is 1. However, if there are neither disequalities nor negations, we prove the existence of an FPRAS for queries of bounded fractional hypertreewidth, strictly generalising the recently established FPRAS for conjunctive queries with bounded hypertreewidth due to Arenas, Croquevielle, Jayaram and Riveros (STOC 2021).

我们研究的复杂性近似回答一个小数量的大型数据库中查询φd我们建立一个详尽的分类成容易处理的和棘手的案件如果φ是连接查询可能包括disequalities和否定:——如果有一个常数φ的参数数量,如果随机指数时间假说(rETH)成立,那么问题有一个固定参数可近似方案(FPTRAS)当且仅当的treewidthφ是有界的。-如果arity是无界的，φ没有负数，那么问题有一个FPTRAS当且仅当φ的自适应宽度(一个宽度测量严格比treewidth更一般)是有界的;下限也依赖于rETH。此外，我们表明我们的结果不能被加强以实现完全多项式随机化近似方案(FPRAS):我们观察到，除非NP=RP，否则即使树宽(和自适应宽度)为1，也没有FPRAS。然而，如果既不存在不等式也不存在否定，我们证明了有界分数超树宽查询的FPRAS的存在性，严格推广了最近由于Arenas, Croquevielle, Jayaram和Riveros (STOC 2021)而建立的具有有界超树宽的联合查询的FPRAS。

{"title":"Approximately Counting Answers to Conjunctive Queries with Disequalities and Negations","authors":"Jacob Focke, L. A. Goldberg, M. Roth, Stanislav Živný","doi":"10.1145/3517804.3526231","DOIUrl":"https://doi.org/10.1145/3517804.3526231","url":null,"abstract":"We study the complexity of approximating the number of answers to a small query φ in a large database D. We establish an exhaustive classification into tractable and intractable cases if φ is a conjunctive query possibly including disequalities and negations: - If there is a constant bound on the arity of φ, and if the randomised Exponential Time Hypothesis (rETH) holds, then the problem has a fixed-parameter tractable approximation scheme (FPTRAS) if and only if the treewidth of φ is bounded. - If the arity is unbounded and φ does not have negations, then the problem has an FPTRAS if and only if the adaptive width of φ (a width measure strictly more general than treewidth) is bounded; the lower bound relies on the rETH as well. Additionally we show that our results cannot be strengthened to achieve a fully polynomial randomised approximation scheme (FPRAS): We observe that, unless NP=RP, there is no FPRAS even if the treewidth (and the adaptive width) is 1. However, if there are neither disequalities nor negations, we prove the existence of an FPRAS for queries of bounded fractional hypertreewidth, strictly generalising the recently established FPRAS for conjunctive queries with bounded hypertreewidth due to Arenas, Croquevielle, Jayaram and Riveros (STOC 2021).","PeriodicalId":230606,"journal":{"name":"Proceedings of the 41st ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems","volume":"89 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121511856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

On the Parameterized Complexity of Learning First-Order Logic 一阶逻辑学习的参数化复杂度研究

Proceedings of the 41st ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems

Pub Date : 2021-02-24 DOI: 10.1145/3517804.3524151

Steffen van Bergerem, Martin Grohe, Martin Ritzert

We analyse the complexity of learning first-order queries in a model-theoretic framework for supervised learning introduced by (Grohe and Turán, TOCS 2004). Previous research on the complexity of learning in this framework focussed on the question of when learning is possible in time sublinear in the background structure. Here we study the parameterized complexity of the learning problem. We have two main results. The first is a hardness result, showing that learning first-order queries is at least as hard as the corresponding model-checking problem, which implies that on general structures it is hard for the parameterized complexity class AW[*]. Our second main contribution is a fixed-parameter tractable agnostic PAC learning algorithm for first-order queries over sparse relational data (more precisely, over nowhere dense background structures).

我们分析了由(Grohe和Turán, TOCS 2004)引入的模型理论框架中学习一阶查询的复杂性。在此框架下，以往关于学习复杂性的研究主要集中在时间亚线性背景结构下何时学习是可能的。这里我们研究了学习问题的参数化复杂度。我们有两个主要的结果。第一个是硬度结果，表明学习一阶查询至少与相应的模型检查问题一样困难，这意味着在一般结构上，参数化复杂性类AW[*]很难。我们的第二个主要贡献是针对稀疏关系数据(更准确地说，针对无处密集的背景结构)的一阶查询的固定参数可处理不可知PAC学习算法。

引用次数: 5

A Journey to the Frontiers of Query Rewritability 查询可重写性的前沿之旅

Proceedings of the 41st ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems

Pub Date : 2020-12-21 DOI: 10.1145/3517804.3524163

Piotr Ostropolski-Nalewaja, J. Marcinkowski, David Carral, S. Rudolph

We consider (first-order) query rewritability in the context of theory-mediated query answering. The starting point of our journey is the FUS/FES conjecture, which states that any theory that is a finite expansion set (FES) and admits query rewriting (BDD, FUS) must be uniformly bounded. We show that this conjecture holds for a large class of BDD theories, which we call "local". Upon investigating how "non-local" BDD theories can actually get, we discover unexpected phenomena that, we think, are at odds with prevailing intuitions about BDD theories.

我们在理论中介查询应答的背景下考虑(一阶)查询可重写性。我们旅程的起点是FUS/FES猜想，它指出任何是有限扩展集(FES)并允许查询重写(BDD, FUS)的理论必须是一致有界的。我们证明这个猜想适用于一大类BDD理论，我们称之为“局部”。在调查“非局部”BDD理论如何实际得到时，我们发现了意想不到的现象，我们认为这些现象与关于BDD理论的普遍直觉不一致。

引用次数: 2

Linear-Delay Enumeration for Minimal Steiner Problems 最小Steiner问题的线性延迟枚举

Proceedings of the 41st ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems

Pub Date : 2020-10-22 DOI: 10.1145/3517804.3524148

Yasuaki Kobayashi, Kazuhiro Kurita, Kunihiro Wasa

Kimelfeld and Sagiv [Kimelfeld and Sagiv, PODS 2006], [Kimelfeld and Sagiv, Inf. Syst. 2008] pointed out that the problem of enumerating K-fragments is of great importance in a keyword search on data graphs. In a graph-theoretic term, the problem corresponds to enumerating minimal Steiner trees in (directed) graphs. In this paper, we propose a linear-delay and polynomial-space algorithm for enumerating all minimal Steiner trees, improving on a previous result in [Kimelfeld and Sagiv, Inf. Syst. 2008]. Our enumeration algorithm can be extended to other Steiner problems, such as minimal Steiner forests, minimal terminal Steiner trees, and minimal directed Steiner trees. As another variant of the minimal Steiner tree enumeration problem, we study the problem of enumerating minimal induced Steiner subgraphs. We propose a polynomial-delay and exponential-space enumeration algorithm of minimal induced Steiner subgraphs on claw-free graphs. Contrary to these tractable results, we show that the problem of enumerating minimal group Steiner trees is at least as hard as the minimal transversal enumeration problem on hypergraphs.

Kimelfeld和Sagiv [Kimelfeld and Sagiv, PODS 2006]， [Kimelfeld and Sagiv, Inf. Syst. 2008]指出，在数据图的关键字搜索中，k片段的枚举问题是非常重要的。在图论术语中，这个问题对应于(有向)图中最小斯坦纳树的枚举。在本文中，我们提出了一种线性延迟和多项式空间算法，用于枚举所有最小Steiner树，改进了先前在[Kimelfeld和Sagiv, Inf. Syst. 2008]中的结果。我们的枚举算法可以推广到其他的斯坦纳问题，如最小斯坦纳森林、最小终端斯坦纳树和最小有向斯坦纳树。作为最小Steiner树枚举问题的另一个变体，我们研究了最小诱导Steiner子图的枚举问题。提出了无爪图上最小诱导Steiner子图的多项式延迟和指数空间枚举算法。与这些容易处理的结果相反，我们证明了枚举最小群斯坦纳树的问题至少与超图上的最小横向枚举问题一样难。

{"title":"Linear-Delay Enumeration for Minimal Steiner Problems","authors":"Yasuaki Kobayashi, Kazuhiro Kurita, Kunihiro Wasa","doi":"10.1145/3517804.3524148","DOIUrl":"https://doi.org/10.1145/3517804.3524148","url":null,"abstract":"Kimelfeld and Sagiv [Kimelfeld and Sagiv, PODS 2006], [Kimelfeld and Sagiv, Inf. Syst. 2008] pointed out that the problem of enumerating K-fragments is of great importance in a keyword search on data graphs. In a graph-theoretic term, the problem corresponds to enumerating minimal Steiner trees in (directed) graphs. In this paper, we propose a linear-delay and polynomial-space algorithm for enumerating all minimal Steiner trees, improving on a previous result in [Kimelfeld and Sagiv, Inf. Syst. 2008]. Our enumeration algorithm can be extended to other Steiner problems, such as minimal Steiner forests, minimal terminal Steiner trees, and minimal directed Steiner trees. As another variant of the minimal Steiner tree enumeration problem, we study the problem of enumerating minimal induced Steiner subgraphs. We propose a polynomial-delay and exponential-space enumeration algorithm of minimal induced Steiner subgraphs on claw-free graphs. Contrary to these tractable results, we show that the problem of enumerating minimal group Steiner trees is at least as hard as the minimal transversal enumeration problem on hypergraphs.","PeriodicalId":230606,"journal":{"name":"Proceedings of the 41st ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125753450","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Optimal Bounds for Approximate Counting 近似计数的最优界

Proceedings of the 41st ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems

Pub Date : 2020-10-05 DOI: 10.1145/3517804.3526225

Jelani Nelson, Huacheng Yu

Storing a counter incremented N times would naively consume O(log N) bits of memory. In 1978 Morris described the very first streaming algorithm: the "Morris Counter" [15]. His algorithm's space bound is a random variable, and it has been shown to be O(log log N + log(1/ε) + log(1/δ)) bits in expectation to provide a (1+ε)-approximation with probability $1-δ to the counter's value. We provide a new simple algorithm with a simple analysis showing that randomized space O(log log N + log(1/ε) + log log(1/δ)) bits suffice for the same task, i.e. an exponentially improved dependence on the inverse failure probability. We then provide a new analysis showing that the original Morris Counter itself, after a minor but necessary tweak, actually also enjoys this same improved upper bound. Lastly, we prove a new lower bound for this task showing optimality of our upper bound. We thus completely resolve the asymptotic space complexity of approximate counting. Furthermore all our constants are explicit, and our lower bound and tightest upper bound differ by a multiplicative factor of at most 3+o(1).

存储递增N次的计数器将天真地消耗O(log N)位内存。1978年，莫里斯描述了第一个流媒体算法:“莫里斯计数器”[15]。他的算法的空间边界是一个随机变量，它已经被证明是O(log log N + log(1/ε) + log(1/δ))位，期望提供一个(1+ε)-近似，概率为$1-δ。我们提供了一种新的简单算法，通过简单的分析表明，随机化空间O(log log N + log(1/ε) + log log(1/δ))位足以满足相同的任务，即对逆失效概率的依赖性呈指数级提高。然后，我们提供了一个新的分析，表明原来的莫里斯计数器本身，经过一个小但必要的调整，实际上也享受相同的改进上界。最后，我们证明了这个任务的一个新的下界，显示了上界的最优性。从而完全解决了近似计数的渐近空间复杂度问题。此外，我们所有的常数都是显式的，我们的下界和最紧上界相差一个乘因子，最多为3+ 0(1)。

引用次数: 9

Proceedings of the 41st ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems 第41届ACM SIGMOD-SIGACT-SIGAI数据库系统原理研讨会论文集

Proceedings of the 41st ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems

Pub Date : 1900-01-01 DOI: 10.1145/3517804

引用次数: 1

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the 41st ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀