首页 > 最新文献

Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing最新文献

英文 中文
General strong polarization 一般强极化
Pub Date : 2018-02-08 DOI: 10.1145/3188745.3188816
Jarosław Błasiok, V. Guruswami, Preetum Nakkiran, A. Rudra, M. Sudan
Arikan’s exciting discovery of polar codes has provided an altogether new way to efficiently achieve Shannon capacity. Given a (constant-sized) invertible matrix M, a family of polar codes can be associated with this matrix and its ability to approach capacity follows from the polarization of an associated [0,1]-bounded martingale, namely its convergence in the limit to either 0 or 1 with probability 1. Arikan showed appropriate polarization of the martingale associated with the matrix G2 = ( [complex formula not displayed] ) to get capacity achieving codes. His analysis was later extended to all matrices M which satisfy an obvious necessary condition for polarization. While Arikan’s theorem does not guarantee that the codes achieve capacity at small blocklengths (specifically in length which is a polynomial in 1/є where є is the difference between the capacity of a channel and the rate of the code), it turns out that a “strong” analysis of the polarization of the underlying martingale would lead to such constructions. Indeed for the martingale associated with G2 such a strong polarization was shown in two independent works ([Guruswami and Xia, IEEE IT ’15] and [Hassani et al., IEEE IT’14]), thereby resolving a major theoretical challenge associated with the efficient attainment of Shannon capacity. In this work we extend the result above to cover martingales associated with all matrices that satisfy the necessary condition for (weak) polarization. In addition to being vastly more general, our proofs of strong polarization are (in our view) also much simpler and modular. Key to our proof is a notion of local polarization that only depends on the evolution of the martingale in a single time step. We show that local polarization always implies strong polarization. We then apply relatively simple reasoning about conditional entropies to prove local polarization in very general settings. Specifically, our result shows strong polarization over all prime fields and leads to efficient capacity-achieving source codes for compressing arbitrary i.i.d. sources, and capacity-achieving channel codes for arbitrary symmetric memoryless channels.
阿里坎令人兴奋的极性码的发现提供了一种全新的方法来有效地实现香农容量。给定一个(常数大小的)可逆矩阵M,可以与该矩阵关联一组极码,其接近容量的能力来自于关联的[0,1]有界鞅的极化,即它在极限收敛于0或1的概率为1。Arikan对矩阵G2 =([复公式未显示])相关的鞅进行适当极化,得到容量实现码。他的分析后来推广到所有满足一个明显的极化必要条件的矩阵M。虽然Arikan定理并不能保证代码在小块长度下实现容量(特别是长度是1/ k的多项式,其中k是信道容量和代码速率之间的差),但事实证明,对底层鞅的极化的“强”分析将导致这样的结构。事实上,对于与G2相关的鞅,这种强烈的极化在两个独立的著作([Guruswami and Xia, IEEE IT ' 15]和[Hassani et al., IEEE IT ' 14])中得到了证明,从而解决了与有效实现香农容量相关的主要理论挑战。在这项工作中,我们扩展了上述结果,以涵盖与满足(弱)极化必要条件的所有矩阵相关的鞅。除了更通用之外,我们的强极化证明(在我们看来)也更简单和模块化。我们证明的关键是局部极化的概念,它只依赖于鞅在单个时间步长的演化。我们证明了局部极化总是意味着强极化。然后,我们应用相对简单的关于条件熵的推理来证明在非常一般的情况下的局部极化。具体地说,我们的结果显示了所有素场上的强极化,并导致压缩任意i.i.d源的有效容量实现源代码和压缩任意对称无内存信道的容量实现信道代码。
{"title":"General strong polarization","authors":"Jarosław Błasiok, V. Guruswami, Preetum Nakkiran, A. Rudra, M. Sudan","doi":"10.1145/3188745.3188816","DOIUrl":"https://doi.org/10.1145/3188745.3188816","url":null,"abstract":"Arikan’s exciting discovery of polar codes has provided an altogether new way to efficiently achieve Shannon capacity. Given a (constant-sized) invertible matrix M, a family of polar codes can be associated with this matrix and its ability to approach capacity follows from the polarization of an associated [0,1]-bounded martingale, namely its convergence in the limit to either 0 or 1 with probability 1. Arikan showed appropriate polarization of the martingale associated with the matrix G2 = ( [complex formula not displayed] ) to get capacity achieving codes. His analysis was later extended to all matrices M which satisfy an obvious necessary condition for polarization. While Arikan’s theorem does not guarantee that the codes achieve capacity at small blocklengths (specifically in length which is a polynomial in 1/є where є is the difference between the capacity of a channel and the rate of the code), it turns out that a “strong” analysis of the polarization of the underlying martingale would lead to such constructions. Indeed for the martingale associated with G2 such a strong polarization was shown in two independent works ([Guruswami and Xia, IEEE IT ’15] and [Hassani et al., IEEE IT’14]), thereby resolving a major theoretical challenge associated with the efficient attainment of Shannon capacity. In this work we extend the result above to cover martingales associated with all matrices that satisfy the necessary condition for (weak) polarization. In addition to being vastly more general, our proofs of strong polarization are (in our view) also much simpler and modular. Key to our proof is a notion of local polarization that only depends on the evolution of the martingale in a single time step. We show that local polarization always implies strong polarization. We then apply relatively simple reasoning about conditional entropies to prove local polarization in very general settings. Specifically, our result shows strong polarization over all prime fields and leads to efficient capacity-achieving source codes for compressing arbitrary i.i.d. sources, and capacity-achieving channel codes for arbitrary symmetric memoryless channels.","PeriodicalId":20593,"journal":{"name":"Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing","volume":"61 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84570942","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 28
Algorithmic polynomials 算法的多项式
Pub Date : 2018-01-14 DOI: 10.1145/3188745.3188958
Alexander A. Sherstov
The approximate degree of a Boolean function f(x1,x2,…,xn) is the minimum degree of a real polynomial that approximates f pointwise within 1/3. Upper bounds on approximate degree have a variety of applications in learning theory, differential privacy, and algorithm design in general. Nearly all known upper bounds on approximate degree arise in an existential manner from bounds on quantum query complexity. We develop a first-principles, classical approach to the polynomial approximation of Boolean functions. We use it to give the first constructive upper bounds on the approximate degree of several fundamental problems: (i) O(n3/4−1/(4(2k−1))) for the k-element distinctness problem; (ii) O(n1−1/(k+1)) for the k-subset sum problem; (iii) O(n1−1/(k+1)) for any k-DNF or k-CNF formula; (iv) O(n3/4) for the surjectivity problem. In all cases, we obtain explicit, closed-form approximating polynomials that are unrelated to the quantum arguments from previous work. Our first three results match the bounds from quantum query complexity. Our fourth result improves polynomially on the Θ(n) quantum query complexity of the problem and refutes the conjecture by several experts that surjectivity has approximate degree Ω(n). In particular, we exhibit the first natural problem with a polynomial gap between approximate degree and quantum query complexity.
布尔函数f(x1,x2,…,xn)的近似度数是在1/3以内逐点逼近f的实多项式的最小度数。近似度的上界通常在学习理论、微分隐私和算法设计中有各种各样的应用。几乎所有已知的近似度的上界都是由量子查询复杂度的上界产生的。我们开发了一个第一性原理,经典的方法来多项式逼近布尔函数。我们利用它给出了几个基本问题的近似度的第一建设性上界:(i)对于k元明显性问题的O(n3/4−1/(4(2k−1)));(ii)对于k子集和问题,O(n1−1/(k+1));(iii)对于任何k- dnf或k- cnf公式,O(n1−1/(k+1));(iv) O(n3/4)对于满性问题。在所有的情况下,我们得到显式的,封闭形式的近似多项式,这些多项式与以前工作中的量子参数无关。我们的前三个结果符合量子查询复杂度的界限。我们的第四个结果多项式地提高了问题的Θ(n)量子查询复杂度,并反驳了几位专家关于满性近似度Ω(n)的猜想。特别地,我们展示了在近似度和量子查询复杂度之间具有多项式差距的第一个自然问题。
{"title":"Algorithmic polynomials","authors":"Alexander A. Sherstov","doi":"10.1145/3188745.3188958","DOIUrl":"https://doi.org/10.1145/3188745.3188958","url":null,"abstract":"The approximate degree of a Boolean function f(x1,x2,…,xn) is the minimum degree of a real polynomial that approximates f pointwise within 1/3. Upper bounds on approximate degree have a variety of applications in learning theory, differential privacy, and algorithm design in general. Nearly all known upper bounds on approximate degree arise in an existential manner from bounds on quantum query complexity. We develop a first-principles, classical approach to the polynomial approximation of Boolean functions. We use it to give the first constructive upper bounds on the approximate degree of several fundamental problems: (i) O(n3/4−1/(4(2k−1))) for the k-element distinctness problem; (ii) O(n1−1/(k+1)) for the k-subset sum problem; (iii) O(n1−1/(k+1)) for any k-DNF or k-CNF formula; (iv) O(n3/4) for the surjectivity problem. In all cases, we obtain explicit, closed-form approximating polynomials that are unrelated to the quantum arguments from previous work. Our first three results match the bounds from quantum query complexity. Our fourth result improves polynomially on the Θ(n) quantum query complexity of the problem and refutes the conjecture by several experts that surjectivity has approximate degree Ω(n). In particular, we exhibit the first natural problem with a polynomial gap between approximate degree and quantum query complexity.","PeriodicalId":20593,"journal":{"name":"Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing","volume":"290 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79457747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Operator scaling with specified marginals 具有指定边际的算子缩放
Pub Date : 2018-01-01 DOI: 10.1145/3188745.3188932
Cole Franks
The completely positive maps, a generalization of the nonnegative matrices, are a well-studied class of maps from n× n matrices to m× m matrices. The existence of the operator analogues of doubly stochastic scalings of matrices, the study of which is known as operator scaling, is equivalent to a multitude of problems in computer science and mathematics such rational identity testing in non-commuting variables, noncommutative rank of symbolic matrices, and a basic problem in invariant theory (Garg et. al., 2016). We study operator scaling with specified marginals, which is the operator analogue of scaling matrices to specified row and column sums (or marginals). We characterize the operators which can be scaled to given marginals, much in the spirit of the Gurvits’ algorithmic characterization of the operators that can be scaled to doubly stochastic (Gurvits, 2004). Our algorithm, which is a modified version of Gurvits’ algorithm, produces approximate scalings in time poly(n,m) whenever scalings exist. A central ingredient in our analysis is a reduction from operator scaling with specified marginals to operator scaling in the doubly stochastic setting. Instances of operator scaling with specified marginals arise in diverse areas of study such as the Brascamp-Lieb inequalities, communication complexity, eigenvalues of sums of Hermitian matrices, and quantum information theory. Some of the known theorems in these areas, several of which had no algorithmic proof, are straightforward consequences of our characterization theorem. For instance, we obtain a simple algorithm to find, when it exists, a tuple of Hermitian matrices with given spectra whose sum has a given spectrum. We also prove new theorems such as a generalization of Forster’s theorem (Forster, 2002) concerning radial isotropic position.
完全正映射是对非负矩阵的推广,是一类从n× n矩阵到m× m矩阵的映射。矩阵的双随机缩放算子类似物的存在,其研究被称为算子缩放,相当于计算机科学和数学中的许多问题,如非交换变量的有理恒等检验,符号矩阵的非交换秩,以及不变理论中的一个基本问题(Garg et. al., 2016)。我们研究了具有指定边际的算子缩放,这是将矩阵缩放到指定行和和(或边际)的算子模拟。我们描述了可以缩放到给定边缘的算子,这与Gurvits的算法描述可以缩放到双重随机的算子的精神非常相似(Gurvits, 2004)。我们的算法是Gurvits算法的改进版本,只要存在标量,就会在时间多边形(n,m)中产生近似的标量。在我们的分析中,一个中心成分是从具有指定边际的算子标度到双随机设置中的算子标度的减少。具有特定边际的算子缩放实例出现在不同的研究领域,如Brascamp-Lieb不等式、通信复杂性、厄米矩阵和的特征值和量子信息论。这些领域的一些已知定理,其中一些没有算法证明,是我们的表征定理的直接结果。例如,我们得到了一个简单的算法,当它存在时,求出具有给定谱的厄米矩阵元组,其和具有给定谱。我们还证明了一些新的定理,如关于径向各向同性位置的Forster定理的推广(Forster, 2002)。
{"title":"Operator scaling with specified marginals","authors":"Cole Franks","doi":"10.1145/3188745.3188932","DOIUrl":"https://doi.org/10.1145/3188745.3188932","url":null,"abstract":"The completely positive maps, a generalization of the nonnegative matrices, are a well-studied class of maps from n× n matrices to m× m matrices. The existence of the operator analogues of doubly stochastic scalings of matrices, the study of which is known as operator scaling, is equivalent to a multitude of problems in computer science and mathematics such rational identity testing in non-commuting variables, noncommutative rank of symbolic matrices, and a basic problem in invariant theory (Garg et. al., 2016). We study operator scaling with specified marginals, which is the operator analogue of scaling matrices to specified row and column sums (or marginals). We characterize the operators which can be scaled to given marginals, much in the spirit of the Gurvits’ algorithmic characterization of the operators that can be scaled to doubly stochastic (Gurvits, 2004). Our algorithm, which is a modified version of Gurvits’ algorithm, produces approximate scalings in time poly(n,m) whenever scalings exist. A central ingredient in our analysis is a reduction from operator scaling with specified marginals to operator scaling in the doubly stochastic setting. Instances of operator scaling with specified marginals arise in diverse areas of study such as the Brascamp-Lieb inequalities, communication complexity, eigenvalues of sums of Hermitian matrices, and quantum information theory. Some of the known theorems in these areas, several of which had no algorithmic proof, are straightforward consequences of our characterization theorem. For instance, we obtain a simple algorithm to find, when it exists, a tuple of Hermitian matrices with given spectra whose sum has a given spectrum. We also prove new theorems such as a generalization of Forster’s theorem (Forster, 2002) concerning radial isotropic position.","PeriodicalId":20593,"journal":{"name":"Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing","volume":"15 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88710893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
A PSPACE construction of a hitting set for the closure of small algebraic circuits 小代数电路闭包命中集的PSPACE构造
Pub Date : 2017-12-28 DOI: 10.1145/3188745.3188792
Michael A. Forbes, Amir Shpilka
In this paper we study the complexity of constructing a hitting set for VP, the class of polynomials that can be infinitesimally approximated by polynomials that are computed by polynomial sized algebraic circuits, over the real or complex numbers. Specifically, we show that there is a PSPACE algorithm that given n,s,r in unary outputs a set of rational n-tuples of size poly(n,s,r), with poly(n,s,r) bit complexity, that hits all n-variate polynomials of degree r that are the limit of size s algebraic circuits. Previously it was known that a random set of this size is a hitting set, but a construction that is certified to work was only known in EXPSPACE (or EXPH assuming the generalized Riemann hypothesis). As a corollary we get that a host of other algebraic problems such as Noether Normalization Lemma, can also be solved in PSPACE deterministically, where earlier only randomized algorithms and EXPSPACE algorithms (or EXPH assuming the generalized Riemann hypothesis) were known. The proof relies on the new notion of a robust hitting set which is a set of inputs such that any nonzero polynomial that can be computed by a polynomial size algebraic circuit, evaluates to a not too small value on at least one element of the set. Proving the existence of such a robust hitting set is the main technical difficulty in the proof. Our proof uses anti-concentration results for polynomials, basic tools from algebraic geometry and the existential theory of the reals.
本文研究了在实数或复数上构造VP的命中集的复杂性,VP是一类多项式,它可以被多项式大小的代数电路计算的多项式无限逼近。具体地说,我们证明了存在一个PSPACE算法,该算法在一元中给定n,s,r,输出一个大小为poly(n,s,r)的有理n元组集合,具有poly(n,s,r)位复杂度,它命中所有的n变量多项式,次数为r,是大小为s的代数电路的极限。以前我们知道这种大小的随机集是命中集,但是证明有效的构造只在EXPSPACE(或假设广义黎曼假设的EXPH)中已知。作为一个推论,我们得到许多其他代数问题,如Noether归一化引理,也可以在PSPACE中确定性地解决,其中早期只有随机算法和EXPSPACE算法(或假设广义黎曼假设的EXPH)是已知的。该证明依赖于鲁棒命中集的新概念,鲁棒命中集是输入的集合,使得任何可以由多项式大小的代数电路计算的非零多项式,在集合的至少一个元素上求值不太小。证明这种鲁棒命中集的存在性是证明中的主要技术难点。我们的证明使用多项式的反集中结果,代数几何的基本工具和实数的存在论。
{"title":"A PSPACE construction of a hitting set for the closure of small algebraic circuits","authors":"Michael A. Forbes, Amir Shpilka","doi":"10.1145/3188745.3188792","DOIUrl":"https://doi.org/10.1145/3188745.3188792","url":null,"abstract":"In this paper we study the complexity of constructing a hitting set for VP, the class of polynomials that can be infinitesimally approximated by polynomials that are computed by polynomial sized algebraic circuits, over the real or complex numbers. Specifically, we show that there is a PSPACE algorithm that given n,s,r in unary outputs a set of rational n-tuples of size poly(n,s,r), with poly(n,s,r) bit complexity, that hits all n-variate polynomials of degree r that are the limit of size s algebraic circuits. Previously it was known that a random set of this size is a hitting set, but a construction that is certified to work was only known in EXPSPACE (or EXPH assuming the generalized Riemann hypothesis). As a corollary we get that a host of other algebraic problems such as Noether Normalization Lemma, can also be solved in PSPACE deterministically, where earlier only randomized algorithms and EXPSPACE algorithms (or EXPH assuming the generalized Riemann hypothesis) were known. The proof relies on the new notion of a robust hitting set which is a set of inputs such that any nonzero polynomial that can be computed by a polynomial size algebraic circuit, evaluates to a not too small value on at least one element of the set. Proving the existence of such a robust hitting set is the main technical difficulty in the proof. Our proof uses anti-concentration results for polynomials, basic tools from algebraic geometry and the existential theory of the reals.","PeriodicalId":20593,"journal":{"name":"Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing","volume":"161 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85415254","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Improved distributed algorithms for exact shortest paths 改进的精确最短路径分布式算法
Pub Date : 2017-12-25 DOI: 10.1145/3188745.3188948
M. Ghaffari, Jason Li
Computing shortest paths is one of the central problems in the theory of distributed computing. For the last few years, substantial progress has been made on the approximate single source shortest paths problem, culminating in an algorithm of Henzinger, Krinninger, and Nanongkai [STOC’16] which deterministically computes (1+o(1))-approximate shortest paths in Õ(D+√n) time, where D is the hop-diameter of the graph. Up to logarithmic factors, this time complexity is optimal, matching the lower bound of Elkin [STOC’04]. The question of exact shortest paths however saw no algorithmic progress for decades, until the recent breakthrough of Elkin [STOC’17], which established a sublinear-time algorithm for exact single source shortest paths on undirected graphs. Shortly after, Huang et al. [FOCS’17] provided improved algorithms for exact all pairs shortest paths problem on directed graphs. In this paper, we provide an alternative single-source shortest path algorithm with complexity Õ(n3/4D1/4). For polylogarithmic D, this improves on Elkin’s Õ(n5/6) bound and gets closer to the Ω(n1/2) lower bound of Elkin [STOC’04]. For larger values of D, we present an improved variant of our algorithm which achieves complexity Õ(max{ n3/4+o(1) , n3/4D1/6} + D ), and thus compares favorably with Elkin’s bound of Õ(max{ n5/6, n2/3D1/3} + D ) in essentially the entire range of parameters. This algorithm provides also a qualitative improvement, because it works for the more challenging case of directed graph (i.e., graphs where the two directions of an edge can have different weights), constituting the first sublinear-time algorithm for directed graphs. Our algorithm also extends to the case of exact r-source shortest paths, in which we provide the fastest algorithm for moderately small r and D, improving on those of Huang et al.
最短路径计算是分布式计算理论的核心问题之一。在过去几年中,在近似单源最短路径问题上取得了实质性进展,最终产生了Henzinger, Krinninger和Nanongkai [STOC ' 16]的算法,该算法在Õ(D+√n)时间内确定性地计算(1+o(1))-近似最短路径,其中D是图的跳径。在对数因子范围内,该时间复杂度是最优的,符合Elkin的下界[STOC ' 04]。然而,精确最短路径问题几十年来一直没有任何算法进展,直到最近Elkin [STOC ' 17]的突破,他建立了一种针对无向图上精确单源最短路径的亚线性时间算法。不久之后,Huang等人[FOCS ' 17]提供了有向图上精确全对最短路径问题的改进算法。在本文中,我们提供了一个替代的单源最短路径算法,其复杂度为Õ(n3/4D1/4)。对于多对数D,这改进了Elkin的Õ(n5/6)界,更接近Elkin的Ω(n1/2)下界[STOC ' 04]。对于较大的D值,我们提出了一种改进的算法,其复杂度为Õ(max{n3/4+o(1), n3/4D1/6} + D),因此在整个参数范围内与Elkin的界Õ(max{n5/6, n2/ 3d /3} + D)相比具有优势。该算法也提供了一个质的改进,因为它适用于更具挑战性的有向图(即,一条边的两个方向可以具有不同权重的图),构成了有向图的第一个次线性时间算法。我们的算法还扩展到精确r源最短路径的情况,在中等大小的r和D下,我们提供了最快的算法,改进了Huang等人的算法。
{"title":"Improved distributed algorithms for exact shortest paths","authors":"M. Ghaffari, Jason Li","doi":"10.1145/3188745.3188948","DOIUrl":"https://doi.org/10.1145/3188745.3188948","url":null,"abstract":"Computing shortest paths is one of the central problems in the theory of distributed computing. For the last few years, substantial progress has been made on the approximate single source shortest paths problem, culminating in an algorithm of Henzinger, Krinninger, and Nanongkai [STOC’16] which deterministically computes (1+o(1))-approximate shortest paths in Õ(D+√n) time, where D is the hop-diameter of the graph. Up to logarithmic factors, this time complexity is optimal, matching the lower bound of Elkin [STOC’04]. The question of exact shortest paths however saw no algorithmic progress for decades, until the recent breakthrough of Elkin [STOC’17], which established a sublinear-time algorithm for exact single source shortest paths on undirected graphs. Shortly after, Huang et al. [FOCS’17] provided improved algorithms for exact all pairs shortest paths problem on directed graphs. In this paper, we provide an alternative single-source shortest path algorithm with complexity Õ(n3/4D1/4). For polylogarithmic D, this improves on Elkin’s Õ(n5/6) bound and gets closer to the Ω(n1/2) lower bound of Elkin [STOC’04]. For larger values of D, we present an improved variant of our algorithm which achieves complexity Õ(max{ n3/4+o(1) , n3/4D1/6} + D ), and thus compares favorably with Elkin’s bound of Õ(max{ n5/6, n2/3D1/3} + D ) in essentially the entire range of parameters. This algorithm provides also a qualitative improvement, because it works for the more challenging case of directed graph (i.e., graphs where the two directions of an edge can have different weights), constituting the first sublinear-time algorithm for directed graphs. Our algorithm also extends to the case of exact r-source shortest paths, in which we provide the fastest algorithm for moderately small r and D, improving on those of Huang et al.","PeriodicalId":20593,"journal":{"name":"Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing","volume":"6 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72734494","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 56
Stochastic localization + Stieltjes barrier = tight bound for log-Sobolev 随机定位+ Stieltjes势垒= log-Sobolev的紧界
Pub Date : 2017-12-05 DOI: 10.1145/3188745.3188866
Y. Lee, S. Vempala
Logarithmic Sobolev inequalities are a powerful way to estimate the rate of convergence of Markov chains and to derive concentration inequalities on distributions. We prove that the log-Sobolev constant of any isotropic logconcave density in Rn with support of diameter D is Ω(1/D), resolving a question posed by Frieze and Kannan in 1997. This is asymptotically the best possible estimate and improves on the previous bound of Ω(1/D2) by Kannan-Lovász-Montenegro. It follows that for any isotropic logconcave density, the ball walk with step size δ=Θ(1/√n) mixes in O*(n2D) proper steps from any starting point. This improves on the previous best bound of O*(n2D2) and is also asymptotically tight. The new bound leads to the following refined large deviation inequality for an L-Lipschitz function g over an isotropic logconcave density p: for any t>0, [complex formula not displayed] where ḡ is the median or mean of g for x∼ p; this improves on previous bounds by Paouris and by Guedon-Milman. Our main proof is based on stochastic localization together with a Stieltjes-type barrier function.
对数索博列夫不等式是估计马尔可夫链收敛速度和推导分布上的集中不等式的有力方法。我们证明了在直径为D的支持下,Rn中任何各向同性对数凹密度的log-Sobolev常数为Ω(1/D),解决了Frieze和Kannan在1997年提出的问题。这是渐近的最佳估计,并且在Ω(1/D2)的上一个边界上改进了Kannan-Lovász-Montenegro。由此可见,对于任何各向同性对数凹密度,步长δ=Θ(1/√n)的球步从任何起点混合为O*(n2D)步长。这改进了先前的O*(n2D2)的最佳界,并且也是渐近紧的。新的界导致了L-Lipschitz函数g在各向同性对数凹密度p上的以下改进的大偏差不等式:对于任何t>0,[未显示的复杂公式],其中,z为x ~ p的g的中位数或平均值;这改进了Paouris和Guedon-Milman之前的边界。我们的主要证明是基于随机局部化和stieltjess型势垒函数。
{"title":"Stochastic localization + Stieltjes barrier = tight bound for log-Sobolev","authors":"Y. Lee, S. Vempala","doi":"10.1145/3188745.3188866","DOIUrl":"https://doi.org/10.1145/3188745.3188866","url":null,"abstract":"Logarithmic Sobolev inequalities are a powerful way to estimate the rate of convergence of Markov chains and to derive concentration inequalities on distributions. We prove that the log-Sobolev constant of any isotropic logconcave density in Rn with support of diameter D is Ω(1/D), resolving a question posed by Frieze and Kannan in 1997. This is asymptotically the best possible estimate and improves on the previous bound of Ω(1/D2) by Kannan-Lovász-Montenegro. It follows that for any isotropic logconcave density, the ball walk with step size δ=Θ(1/√n) mixes in O*(n2D) proper steps from any starting point. This improves on the previous best bound of O*(n2D2) and is also asymptotically tight. The new bound leads to the following refined large deviation inequality for an L-Lipschitz function g over an isotropic logconcave density p: for any t>0, [complex formula not displayed] where ḡ is the median or mean of g for x∼ p; this improves on previous bounds by Paouris and by Guedon-Milman. Our main proof is based on stochastic localization together with a Stieltjes-type barrier function.","PeriodicalId":20593,"journal":{"name":"Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing","volume":"51 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87846042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
(Gap/S)ETH hardness of SVP (Gap/S) SVP的ETH硬度
Pub Date : 2017-12-04 DOI: 10.1145/3188745.3188840
Divesh Aggarwal, Noah Stephens-Davidowitz
We prove the following quantitative hardness results for the Shortest Vector Problem in the ℓp norm (SVP_p), where n is the rank of the input lattice. For “almost all” p > p0 ≈ 2.1397, there is no 2n/Cp-time algorithm for SVP_p for some explicit (easily computable) constant Cp > 0 unless the (randomized) Strong Exponential Time Hypothesis (SETH) is false. (E.g., for p ≥ 3, Cp < 1 + (p+3) 2−p + 10 p2 2−2p.) For any 1 ≤ p ≤ ∞, there is no 2o(n)-time algorithm for SVP_p unless the non-uniform Gap-Exponential Time Hypothesis (Gap-ETH) is false. Furthermore, for each such p, there exists a constant γp > 1 such that the same result holds even for γp-approximate SVP_p. For p > 2, the above statement holds under the weaker assumption of randomized Gap-ETH. I.e., there is no 2o(n)-time algorithm for γp-approximate SVP_p unless randomized Gap-ETH is false. See http://arxiv.org/abs/1712.00942 for a complete exposition.
我们证明了在p范数(SVP_p)中最短向量问题的定量硬度结果,其中n是输入格的秩。对于“几乎所有”p > p0≈2.1397,对于某些显式(易于计算的)常数Cp > 0, SVP_p不存在2n/Cp- Time算法,除非(随机化)强指数时间假设(SETH)为假。(例如,p≥3,Cp < 1 + (p + 3) 2−p + 10 p2 2−2 p。)对于任意1≤p≤∞,除非非均匀间隙-指数时间假设(Gap-ETH)为假,否则SVP_p不存在20 (n)时间算法。此外,对于每一个这样的p,都存在一个常数γp > 1,使得同样的结果对γp近似的SVP_p也成立。对于p > 2,在随机化Gap-ETH的弱假设下,上述表述成立。即,除非随机化Gap-ETH为假,否则不存在20 (n)时间的γ - p近似SVP_p算法。参见http://arxiv.org/abs/1712.00942获得完整的说明。
{"title":"(Gap/S)ETH hardness of SVP","authors":"Divesh Aggarwal, Noah Stephens-Davidowitz","doi":"10.1145/3188745.3188840","DOIUrl":"https://doi.org/10.1145/3188745.3188840","url":null,"abstract":"We prove the following quantitative hardness results for the Shortest Vector Problem in the ℓp norm (SVP_p), where n is the rank of the input lattice. For “almost all” p > p0 ≈ 2.1397, there is no 2n/Cp-time algorithm for SVP_p for some explicit (easily computable) constant Cp > 0 unless the (randomized) Strong Exponential Time Hypothesis (SETH) is false. (E.g., for p ≥ 3, Cp < 1 + (p+3) 2−p + 10 p2 2−2p.) For any 1 ≤ p ≤ ∞, there is no 2o(n)-time algorithm for SVP_p unless the non-uniform Gap-Exponential Time Hypothesis (Gap-ETH) is false. Furthermore, for each such p, there exists a constant γp > 1 such that the same result holds even for γp-approximate SVP_p. For p > 2, the above statement holds under the weaker assumption of randomized Gap-ETH. I.e., there is no 2o(n)-time algorithm for γp-approximate SVP_p unless randomized Gap-ETH is false. See http://arxiv.org/abs/1712.00942 for a complete exposition.","PeriodicalId":20593,"journal":{"name":"Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing","volume":"3 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84109845","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 34
A generalized Turán problem and its applications 一个广义Turán问题及其应用
Pub Date : 2017-12-03 DOI: 10.1145/3188745.3188778
Lior Gishboliner, A. Shapira
Our first theorem in this paper is a hierarchy theorem for the query complexity of testing graph properties with 1-sided error; more precisely, we show that for every sufficiently fast-growing function f, there is a graph property whose 1-sided-error query complexity is precisely f(Θ(1/ε)). No result of this type was previously known for any f which is super-polynomial. Goldreich [ECCC 2005] asked to exhibit a graph property whose query complexity is 2Θ(1/ε). Our hierarchy theorem partially resolves this problem by exhibiting a property whose 1-sided-error query complexity is 2Θ(1/ε). We also use our hierarchy theorem in order to resolve a problem raised by the second author and Alon [STOC 2005] regarding testing relaxed versions of bipartiteness. Our second theorem states that for any function f there is a graph property whose 1-sided-error query complexity is f(Θ(1/ε)) while its 2-sided-error query complexity is only poly(1/ε). This is the first indication of the surprising power that 2-sided-error testing algorithms have over 1-sided-error ones, even when restricted to properties that are testable with 1-sided error. Again, no result of this type was previously known for any f that is super polynomial. The above theorems are derived from a graph theoretic result which we think is of independent interest, and might have further applications. Alon and Shikhelman [JCTB 2016] introduced the following generalized Turán problem: for fixed graphs H and T, and an integer n, what is the maximum number of copies of T, denoted by ex(n,T,H), that can appear in an n-vertex H-free graph? This problem received a lot of attention recently, with an emphasis on ex(n,C3,C2ℓ +1). Our third theorem in this paper gives tight bounds for ex(n,Ck,Cℓ) for all the remaining values of k and ℓ.
本文的第一个定理是具有单侧误差的图属性测试查询复杂度的层次定理;更准确地说,我们证明了对于每一个足够快速的函数f,存在一个图属性,其单边错误查询复杂度恰好是f(Θ(1/ε))。对于任何超多项式f,以前都不知道这种类型的结果。Goldreich [ECCC 2005]要求展示一个查询复杂度为2Θ(1/ε)的图属性。我们的层次定理通过展示一个单侧错误查询复杂度为2Θ(1/ε)的属性,部分地解决了这个问题。我们还使用我们的层次定理来解决由第二作者和Alon [STOC 2005]提出的关于测试放宽版本的双方性的问题。我们的第二个定理表明,对于任何函数f,存在一个图属性,其单边错误查询复杂度为f(Θ(1/ε)),而其双向错误查询复杂度仅为poly(1/ε)。这是双侧错误测试算法比单侧错误测试算法的惊人威力的第一个迹象,即使局限于可以用单侧错误测试的属性。同样,对于任何f是超多项式的情况,以前都不知道这种类型的结果。以上定理是由图论的一个结果推导出来的,我们认为这个结果有独立的意义,并且可能有进一步的应用。Alon和Shikhelman [JCTB 2016]引入了以下广义Turán问题:对于固定图H和T,以及整数n,在一个n顶点的无H图中,T的最大副本数(用ex(n,T,H)表示)是多少?这个问题最近受到了很多关注,重点是ex(n,C3,C2, r +1)。本文的第三个定理对于k和r的所有剩余值给出了ex(n,Ck,C, r)的紧界。
{"title":"A generalized Turán problem and its applications","authors":"Lior Gishboliner, A. Shapira","doi":"10.1145/3188745.3188778","DOIUrl":"https://doi.org/10.1145/3188745.3188778","url":null,"abstract":"Our first theorem in this paper is a hierarchy theorem for the query complexity of testing graph properties with 1-sided error; more precisely, we show that for every sufficiently fast-growing function f, there is a graph property whose 1-sided-error query complexity is precisely f(Θ(1/ε)). No result of this type was previously known for any f which is super-polynomial. Goldreich [ECCC 2005] asked to exhibit a graph property whose query complexity is 2Θ(1/ε). Our hierarchy theorem partially resolves this problem by exhibiting a property whose 1-sided-error query complexity is 2Θ(1/ε). We also use our hierarchy theorem in order to resolve a problem raised by the second author and Alon [STOC 2005] regarding testing relaxed versions of bipartiteness. Our second theorem states that for any function f there is a graph property whose 1-sided-error query complexity is f(Θ(1/ε)) while its 2-sided-error query complexity is only poly(1/ε). This is the first indication of the surprising power that 2-sided-error testing algorithms have over 1-sided-error ones, even when restricted to properties that are testable with 1-sided error. Again, no result of this type was previously known for any f that is super polynomial. The above theorems are derived from a graph theoretic result which we think is of independent interest, and might have further applications. Alon and Shikhelman [JCTB 2016] introduced the following generalized Turán problem: for fixed graphs H and T, and an integer n, what is the maximum number of copies of T, denoted by ex(n,T,H), that can appear in an n-vertex H-free graph? This problem received a lot of attention recently, with an emphasis on ex(n,C3,C2ℓ +1). Our third theorem in this paper gives tight bounds for ex(n,Ck,Cℓ) for all the remaining values of k and ℓ.","PeriodicalId":20593,"journal":{"name":"Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing","volume":"32 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74692540","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 44
On the parameterized complexity of approximating dominating set 关于逼近支配集的参数化复杂度
Pub Date : 2017-11-29 DOI: 10.1145/3188745.3188896
S. KarthikC., Bundit Laekhanukit, Pasin Manurangsi
We study the parameterized complexity of approximating the k-Dominating Set (domset) problem where an integer k and a graph G on n vertices are given as input, and the goal is to find a dominating set of size at most F(k) · k whenever the graph G has a dominating set of size k. When such an algorithm runs in time T(k)poly(n) (i.e., FPT-time) for some computable function T, it is said to be an F(k)-FPT-approximation algorithm for k-domset. Whether such an algorithm exists is listed in the seminal book of Downey and Fellows (2013) as one of the ”most infamous” open problems in Parameterized Complexity. This work gives an almost complete answer to this question by showing the non-existence of such an algorithm under W[1]≠FPT and further providing tighter running time lower bounds under stronger hypotheses. Specifically, we prove the following for every computable functions T, F and every constant ε > 0: (i) Assuming W[1]≠FPT, there is no F(k)-FPT-approximation algorithm for k-domset, (ii) Assuming the Exponential Time Hypothesis (ETH), there is no F(k)-approximation algorithm for k-domset that runs in T(k)no(k) time, (iii) Assuming the Strong Exponential Time Hypothesis (SETH), for every integer k ≥ 2, there is no F(k)-approximation algorithm for k-domset that runs in T(k)nk − ε time, (iv) Assuming the k-sum Hypothesis, for every integer k ≥ 3, there is no F(k)-approximation algorithm for k-domset that runs in T(k) n⌈ k/2 ⌉ − ε time. Previously, only constant ratio FPT-approximation algorithms were ruled out under W[1]≠FPT and (log1/4 − ε k)-FPT-approximation algorithms were ruled out under ETH [Chen and Lin, FOCS 2016]. Recently, the non-existence of an F(k)-FPT-approximation algorithm for any function F was shown under gapETH [Chalermsook et al., FOCS 2017]. Note that, to the best of our knowledge, no running time lower bound of the form nδ k for any absolute constant δ > 0 was known before even for any constant factor inapproximation ratio. Our results are obtained by establishing a connection between communication complexity and hardness of approximation, generalizing the ideas from a recent breakthrough work of Abboud et al. [FOCS 2017]. Specifically, we show that to prove hardness of approximation of a certain parameterized variant of the label cover problem, it suffices to devise a specific protocol for a communication problem that depends on which hypothesis we rely on. Each of these communication problems turns out to be either a well studied problem or a variant of one; this allows us to easily apply known techniques to solve them.
研究了参数化近似k-Dominating集(domset)问题的复杂性,整数k和给出图G在n顶点作为输入,目标是找到一组主导的大小最多F·k (k)每当图G有一组主导的k大小。当这样的一个算法在时间T (k)聚(n)(例如,FPT-time)对于一些可计算函数T,据说是一个F (k) -FPT-approximation k-domset算法。这种算法是否存在,在唐尼和费罗斯(2013)的开创性著作中被列为参数化复杂性中“最臭名昭著”的开放问题之一。本文给出了在W[1]≠FPT条件下不存在这种算法的基本答案,并在更强的假设条件下提供了更严格的运行时间下界。具体地说,我们证明了对于每一个可计算函数T, F和每一个ε > 0的常数:(i)假设W[1]≠FPT,不存在k-domset的F(k)-FPT逼近算法;(ii)假设指数时间假设(ETH),不存在运行在T(k)no(k)时间内的k-domset的F(k)-逼近算法;(iii)假设强指数时间假设(SETH),对于每一个整数k≥2,不存在运行在T(k)nk−ε时间内的k-domset的F(k)-逼近算法;(iv)假设k和假设,对于每一个整数k≥3,对于k-domset,不存在运行在T(k) n≤k/2≤ε时间内的F(k)逼近算法。在此之前,在W[1]≠FPT条件下,只排除了常数比FPT近似算法,在ETH条件下排除了(log1/4−ε k)-FPT近似算法[Chen and Lin, FOCS 2016]。最近,在gapETH下证明了任何函数F的F(k)- fpt逼近算法的不存在性[Chalermsook et al., FOCS 2017]。请注意,据我们所知,对于任何绝对常数δ > 0,甚至对于任何常数因子不近似比,之前都没有已知形式为nδ k的运行时间下界。我们的结果是通过建立通信复杂性和近似硬度之间的联系获得的,推广了Abboud等人最近突破性工作的想法[FOCS 2017]。具体地说,我们表明,为了证明标签覆盖问题的某个参数化变体的逼近的硬度,它足以为通信问题设计一个特定的协议,这取决于我们所依赖的假设。这些沟通问题中的每一个要么是一个研究得很好的问题,要么是一个问题的变体;这使得我们可以很容易地应用已知的技术来解决它们。
{"title":"On the parameterized complexity of approximating dominating set","authors":"S. KarthikC., Bundit Laekhanukit, Pasin Manurangsi","doi":"10.1145/3188745.3188896","DOIUrl":"https://doi.org/10.1145/3188745.3188896","url":null,"abstract":"We study the parameterized complexity of approximating the k-Dominating Set (domset) problem where an integer k and a graph G on n vertices are given as input, and the goal is to find a dominating set of size at most F(k) · k whenever the graph G has a dominating set of size k. When such an algorithm runs in time T(k)poly(n) (i.e., FPT-time) for some computable function T, it is said to be an F(k)-FPT-approximation algorithm for k-domset. Whether such an algorithm exists is listed in the seminal book of Downey and Fellows (2013) as one of the ”most infamous” open problems in Parameterized Complexity. This work gives an almost complete answer to this question by showing the non-existence of such an algorithm under W[1]≠FPT and further providing tighter running time lower bounds under stronger hypotheses. Specifically, we prove the following for every computable functions T, F and every constant ε > 0: (i) Assuming W[1]≠FPT, there is no F(k)-FPT-approximation algorithm for k-domset, (ii) Assuming the Exponential Time Hypothesis (ETH), there is no F(k)-approximation algorithm for k-domset that runs in T(k)no(k) time, (iii) Assuming the Strong Exponential Time Hypothesis (SETH), for every integer k ≥ 2, there is no F(k)-approximation algorithm for k-domset that runs in T(k)nk − ε time, (iv) Assuming the k-sum Hypothesis, for every integer k ≥ 3, there is no F(k)-approximation algorithm for k-domset that runs in T(k) n⌈ k/2 ⌉ − ε time. Previously, only constant ratio FPT-approximation algorithms were ruled out under W[1]≠FPT and (log1/4 − ε k)-FPT-approximation algorithms were ruled out under ETH [Chen and Lin, FOCS 2016]. Recently, the non-existence of an F(k)-FPT-approximation algorithm for any function F was shown under gapETH [Chalermsook et al., FOCS 2017]. Note that, to the best of our knowledge, no running time lower bound of the form nδ k for any absolute constant δ > 0 was known before even for any constant factor inapproximation ratio. Our results are obtained by establishing a connection between communication complexity and hardness of approximation, generalizing the ideas from a recent breakthrough work of Abboud et al. [FOCS 2017]. Specifically, we show that to prove hardness of approximation of a certain parameterized variant of the label cover problem, it suffices to devise a specific protocol for a communication problem that depends on which hypothesis we rely on. Each of these communication problems turns out to be either a well studied problem or a variant of one; this allows us to easily apply known techniques to solve them.","PeriodicalId":20593,"journal":{"name":"Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing","volume":"34 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91022715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 82
Mixture models, robustness, and sum of squares proofs 混合模型,鲁棒性和平方和证明
Pub Date : 2017-11-20 DOI: 10.1145/3188745.3188748
Samuel B. Hopkins, Jerry Li
We use the Sum of Squares method to develop new efficient algorithms for learning well-separated mixtures of Gaussians and robust mean estimation, both in high dimensions, that substantially improve upon the statistical guarantees achieved by previous efficient algorithms. Our contributions are: Mixture models with separated means: We study mixtures of poly(k)-many k-dimensional distributions where the means of every pair of distributions are separated by at least kε. In the special case of spherical Gaussian mixtures, we give a kO(1/ε)-time algorithm that learns the means assuming separation at least kε, for any ε> 0. This is the first algorithm to improve on greedy (“single-linkage”) and spectral clustering, breaking a long-standing barrier for efficient algorithms at separation k1/4. Robust estimation: When an unknown (1−ε)-fraction of X1,…,Xn are chosen from a sub-Gaussian distribution with mean µ but the remaining points are chosen adversarially, we give an algorithm recovering µ to error ε1−1/t in time kO(t), so long as sub-Gaussian-ness up to O(t) moments can be certified by a Sum of Squares proof. This is the first polynomial-time algorithm with guarantees approaching the information-theoretic limit for non-Gaussian distributions. Previous algorithms could not achieve error better than ε1/2. As a corollary, we achieve similar results for robust covariance estimation. Both of these results are based on a unified technique. Inspired by recent algorithms of Diakonikolas et al. in robust statistics, we devise an SDP based on the Sum of Squares method for the following setting: given X1,…,Xn ∈ ℝk for large k and n = poly(k) with the promise that a subset of X1,…,Xn were sampled from a probability distribution with bounded moments, recover some information about that distribution.
我们使用平方和方法来开发新的高效算法,用于学习良好分离的高斯混合和鲁棒平均估计,两者都是在高维上,大大提高了以前高效算法所实现的统计保证。我们的贡献是:具有分离均值的混合模型:我们研究多(k)-许多k维分布的混合,其中每对分布的均值至少相隔kε。在球形高斯混合的特殊情况下,我们给出了一个kO(1/ε)时间算法,该算法学习了假设分离至少为kε的均值,对于任何ε> 0。这是第一个改进贪婪(“单链接”)和谱聚类的算法,打破了在分离k1/4时高效算法的长期障碍。鲁棒性估计:当从均值为μ的亚高斯分布中选取未知的(1−ε)分数X1,…,Xn,而其余的点都是逆向选取时,我们给出了一种在kO(t)时间内恢复μ to误差ε1−1/t的算法,只要在O(t)阶矩以内的亚高斯性可以通过平方和证明得到证明。这是第一个多项式时间算法,保证接近非高斯分布的信息论极限。以往的算法均不能达到优于ε1/2的误差。作为推论,我们在稳健协方差估计上也得到了类似的结果。这两个结果都是基于一个统一的技术。受Diakonikolas等人在鲁棒统计中的最新算法的启发,我们设计了一种基于平方和方法的SDP,用于以下设置:给定X1,…,Xn∈∈k(大k), n = poly(k),并承诺从具有有界矩的概率分布中采样X1,…,Xn的子集,恢复该分布的一些信息。
{"title":"Mixture models, robustness, and sum of squares proofs","authors":"Samuel B. Hopkins, Jerry Li","doi":"10.1145/3188745.3188748","DOIUrl":"https://doi.org/10.1145/3188745.3188748","url":null,"abstract":"We use the Sum of Squares method to develop new efficient algorithms for learning well-separated mixtures of Gaussians and robust mean estimation, both in high dimensions, that substantially improve upon the statistical guarantees achieved by previous efficient algorithms. Our contributions are: Mixture models with separated means: We study mixtures of poly(k)-many k-dimensional distributions where the means of every pair of distributions are separated by at least kε. In the special case of spherical Gaussian mixtures, we give a kO(1/ε)-time algorithm that learns the means assuming separation at least kε, for any ε> 0. This is the first algorithm to improve on greedy (“single-linkage”) and spectral clustering, breaking a long-standing barrier for efficient algorithms at separation k1/4. Robust estimation: When an unknown (1−ε)-fraction of X1,…,Xn are chosen from a sub-Gaussian distribution with mean µ but the remaining points are chosen adversarially, we give an algorithm recovering µ to error ε1−1/t in time kO(t), so long as sub-Gaussian-ness up to O(t) moments can be certified by a Sum of Squares proof. This is the first polynomial-time algorithm with guarantees approaching the information-theoretic limit for non-Gaussian distributions. Previous algorithms could not achieve error better than ε1/2. As a corollary, we achieve similar results for robust covariance estimation. Both of these results are based on a unified technique. Inspired by recent algorithms of Diakonikolas et al. in robust statistics, we devise an SDP based on the Sum of Squares method for the following setting: given X1,…,Xn ∈ ℝk for large k and n = poly(k) with the promise that a subset of X1,…,Xn were sampled from a probability distribution with bounded moments, recover some information about that distribution.","PeriodicalId":20593,"journal":{"name":"Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing","volume":"108 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81633100","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 158
期刊
Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1