On approximating functions of the singular values in a stream

Proceedings of the forty-eighth annual ACM symposium on Theory of Computing Pub Date : 2016-04-29 DOI:10.1145/2897518.2897581

Yi Li, David P. Woodruff

{"title":"On approximating functions of the singular values in a stream","authors":"Yi Li, David P. Woodruff","doi":"10.1145/2897518.2897581","DOIUrl":null,"url":null,"abstract":"For any real number p > 0, we nearly completely characterize the space complexity of estimating ||A||pp = ∑i=1n σip for n × n matrices A in which each row and each column has O(1) non-zero entries and whose entries are presented one at a time in a data stream model. Here the σi are the singular values of A, and when p ≥ 1, ||A||pp is the p-th power of the Schatten p-norm. We show that when p is not an even integer, to obtain a (1+є)-approximation to ||A||pp with constant probability, any 1-pass algorithm requires n1−g(є) bits of space, where g(є) → 0 as є → 0 and є > 0 is a constant independent of n. However, when p is an even integer, we give an upper bound of n1−2/p (є−1logn) bits of space, which holds even in the turnstile data stream model. The latter is optimal up to (є−1 logn) factors. Our results considerably strengthen lower bounds in previous work for arbitrary (not necessarily sparse) matrices A: the previous best lower bound was Ω(logn) for p∈ (0,1), Ω(n1/p−1/2/logn) for p∈ [1,2) and Ω(n1−2/p) for p∈ (2,∞). We note for p ∈ (2, ∞), while our lower bound for even integers is the same, for other p in this range our lower bound is n1−g(є), which is considerably stronger than the previous n1−2/p for small enough constant є > 0. We obtain similar near-linear lower bounds for Ky-Fan norms, eigenvalue shrinkers, and M-estimators, many of which could have been solvable in logarithmic space prior to our work.","PeriodicalId":442965,"journal":{"name":"Proceedings of the forty-eighth annual ACM symposium on Theory of Computing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"35","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the forty-eighth annual ACM symposium on Theory of Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2897518.2897581","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 35

Abstract

For any real number p > 0, we nearly completely characterize the space complexity of estimating ||A||pp = ∑i=1n σip for n × n matrices A in which each row and each column has O(1) non-zero entries and whose entries are presented one at a time in a data stream model. Here the σi are the singular values of A, and when p ≥ 1, ||A||pp is the p-th power of the Schatten p-norm. We show that when p is not an even integer, to obtain a (1+є)-approximation to ||A||pp with constant probability, any 1-pass algorithm requires n1−g(є) bits of space, where g(є) → 0 as є → 0 and є > 0 is a constant independent of n. However, when p is an even integer, we give an upper bound of n1−2/p (є−1logn) bits of space, which holds even in the turnstile data stream model. The latter is optimal up to (є−1 logn) factors. Our results considerably strengthen lower bounds in previous work for arbitrary (not necessarily sparse) matrices A: the previous best lower bound was Ω(logn) for p∈ (0,1), Ω(n1/p−1/2/logn) for p∈ [1,2) and Ω(n1−2/p) for p∈ (2,∞). We note for p ∈ (2, ∞), while our lower bound for even integers is the same, for other p in this range our lower bound is n1−g(є), which is considerably stronger than the previous n1−2/p for small enough constant є > 0. We obtain similar near-linear lower bounds for Ky-Fan norms, eigenvalue shrinkers, and M-estimators, many of which could have been solvable in logarithmic space prior to our work.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

流中奇异值的近似函数

对于任意实数p > 0，我们几乎完全刻画了对于n × n矩阵A估计|A| pp =∑i=1n σip的空间复杂度，其中每一行和每一列都有O(1)个非零项，并且在数据流模型中每次呈现一个项。其中σi为A的奇异值，当p≥1时，σi为Schatten p-范数的p次幂。我们证明了当p不是偶数时，为了以恒定的概率获得对|| a ||pp的(1+ _)-近似，任何1-pass算法都需要n1−g(_)位空间，其中g(_)→0作为_→0并且_ > 0是与n无关的常数。然而，当p是偶数时，我们给出了n1−2/p(_−1logn)位空间的上界，这在turnstile数据流模型中是偶数。后者是最优的(−1 logn)因子。我们的结果大大加强了先前工作中任意(不一定稀疏)矩阵A的下界:先前的最佳下界为p∈(0,1)的Ω(logn)， p∈[1,2]的Ω(n1/p−1/2/logn)和p∈(2，∞)的Ω(n1−2/p)。我们注意到对于p∈(2，∞)，虽然我们对于偶数的下界是相同的，对于这个范围内的其他p，我们的下界是n1 - g(k)，这比之前的n1 - 2/p要强得多，因为足够小的常数k > 0。我们获得了key - fan范数、特征值收缩器和m -估计器的近似线性下界，其中许多在我们的工作之前可以在对数空间中求解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the forty-eighth annual ACM symposium on Theory of Computing

自引率

0.00%

发文量

期刊最新文献

Exponential separation of communication and external information Proceedings of the forty-eighth annual ACM symposium on Theory of Computing Explicit two-source extractors and resilient functions Constant-rate coding for multiparty interactive communication is impossible Approximating connectivity domination in weighted bounded-genus graphs