Sample Efficient Estimation and Recovery in Sparse FFT via Isolation on Average

2017 IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS) Pub Date : 2017-08-15 DOI:10.1109/FOCS.2017.66

M. Kapralov

{"title":"Sample Efficient Estimation and Recovery in Sparse FFT via Isolation on Average","authors":"M. Kapralov","doi":"10.1109/FOCS.2017.66","DOIUrl":null,"url":null,"abstract":"The problem of computing the Fourier Transform of a signal whose spectrum is dominated by a small number k of frequencies quickly and using a small number of samples of the signal in time domain (the Sparse FFT problem) has received significant attention recently. It is known how to approximately compute the k-sparse Fourier transform in \\approx k\\log^2 n time [Hassanieh et alSTOC12], or using the optimal number O(k\\log n) of samples [Indyk et alFOCS14] in time domain, or come within (\\log\\log n)^{O(1)} factors of both these bounds simultaneously, but no algorithm achieving the optimal O(k\\log n) bound in sublinear time is known.At a high level, sublinear time Sparse FFT algorithms operate by hashing the spectrum of the input signal into \\approx k buckets, identifying frequencies that are isolated in their buckets, subtracting them from the signal and repeating until the entire signal is recovered. The notion of isolation in a bucket, inspired by applications of hashing in sparse recovery with arbitrary linear measurements, has been the main tool in the analysis of Fourier hashing schemes in the literature. However, Fourier hashing schemes, which are implemented via filtering, tend to be noisy in the sense that a frequency that hashes into a bucket contributes a non-negligible amount to neighboring buckets. This leakage to neighboring buckets makes identification and estimation challenging, and the standard analysis based on isolation becomes difficult to use without losing Ω(1) factors in sample complexity.In this paper we propose a new technique for analysing noisy hashing schemes that arise in Sparse FFT, which we refer to as isolation on average}. We apply this technique to two problems in Sparse FFT: estimating the values of a list of frequencies using few samples and computing Sparse FFT itself, achieving sample-optimal results in k\\log^{O(1)} n time for both. We feel that our approach will likely be of interest in designing Fourier sampling schemes for more general settings (e.g. model based Sparse FFT).","PeriodicalId":311592,"journal":{"name":"2017 IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"27","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FOCS.2017.66","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 27

Abstract

The problem of computing the Fourier Transform of a signal whose spectrum is dominated by a small number k of frequencies quickly and using a small number of samples of the signal in time domain (the Sparse FFT problem) has received significant attention recently. It is known how to approximately compute the k-sparse Fourier transform in \approx k\log^2 n time [Hassanieh et alSTOC12], or using the optimal number O(k\log n) of samples [Indyk et alFOCS14] in time domain, or come within (\log\log n)^{O(1)} factors of both these bounds simultaneously, but no algorithm achieving the optimal O(k\log n) bound in sublinear time is known.At a high level, sublinear time Sparse FFT algorithms operate by hashing the spectrum of the input signal into \approx k buckets, identifying frequencies that are isolated in their buckets, subtracting them from the signal and repeating until the entire signal is recovered. The notion of isolation in a bucket, inspired by applications of hashing in sparse recovery with arbitrary linear measurements, has been the main tool in the analysis of Fourier hashing schemes in the literature. However, Fourier hashing schemes, which are implemented via filtering, tend to be noisy in the sense that a frequency that hashes into a bucket contributes a non-negligible amount to neighboring buckets. This leakage to neighboring buckets makes identification and estimation challenging, and the standard analysis based on isolation becomes difficult to use without losing Ω(1) factors in sample complexity.In this paper we propose a new technique for analysing noisy hashing schemes that arise in Sparse FFT, which we refer to as isolation on average}. We apply this technique to two problems in Sparse FFT: estimating the values of a list of frequencies using few samples and computing Sparse FFT itself, achieving sample-optimal results in k\log^{O(1)} n time for both. We feel that our approach will likely be of interest in designing Fourier sampling schemes for more general settings (e.g. model based Sparse FFT).

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于平均隔离的稀疏FFT样本效率估计与恢复

在时域中使用少量的信号样本快速计算由少量k个频率占主导的信号的傅里叶变换(稀疏FFT问题)是近年来备受关注的问题。已知如何在\approx k \log ^ 2n时间内近似计算k-稀疏傅里叶变换[Hassanieh et alSTOC12]，或在时域内使用样本的最优数O(k \log n) [Indyk et alFOCS14]，或同时在这两个界的(\log\log n{)^O(1)}个因子内，但没有已知的算法在亚线性时间内实现最优O(k \log n)界。在高层次上，亚线性时间稀疏FFT算法通过将输入信号的频谱散列到\approx k个桶中，识别在其桶中隔离的频率，从信号中减去它们并重复，直到整个信号被恢复。桶中隔离的概念，受到散列在任意线性测量的稀疏恢复中的应用的启发，已经成为文献中分析傅里叶散列方案的主要工具。然而，通过滤波实现的傅里叶哈希方案往往是有噪声的，因为哈希到一个桶中的频率对相邻桶的贡献不可忽略。这种对相邻桶的泄漏使得识别和估计具有挑战性，并且在不丢失样本复杂性因素的情况下，基于隔离的标准分析变得难以使用。在本文中，我们提出了一种新的技术来分析稀疏FFT中出现的噪声哈希方案，我们称之为平均隔离。我们将这种技术应用于稀疏FFT中的两个问题:使用少量样本估计频率列表的值和计算稀疏FFT本身，在k \log ^{O(1)} n时间内实现样本最优结果。我们觉得我们的方法可能会对设计更一般设置的傅里叶采样方案感兴趣(例如，基于模型的稀疏FFT)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助