自适应对手的平滑分析

IF 2.5 2区计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Journal of the ACM Pub Date : 2024-04-13 DOI:10.1145/3656638

Nika Haghtalab, Tim Roughgarden, Abhishek Shetty

{"title":"自适应对手的平滑分析","authors":"Nika Haghtalab, Tim Roughgarden, Abhishek Shetty","doi":"10.1145/3656638","DOIUrl":null,"url":null,"abstract":"We prove novel algorithmic guarantees for several online problems in the smoothed analysis model. In this model, at each time step an adversary chooses an input distribution with density function bounded above pointwise by \\(\\tfrac{1}{\\sigma } \\) times that of the uniform distribution; nature then samples an input from this distribution. Here, σ is a parameter that interpolates between the extremes of worst-case and average case analysis. Crucially, our results hold for adaptive adversaries that can base their choice of an input distribution on the decisions of the algorithm and the realizations of the inputs in the previous time steps. An adaptive adversary can nontrivially correlate inputs at different time steps with each other and with the algorithm’s current state; this appears to rule out the standard proof approaches in smoothed analysis. This paper presents a general technique for proving smoothed algorithmic guarantees against adaptive adversaries, in effect reducing the setting of an adaptive adversary to the much simpler case of an oblivious adversary (i.e., an adversary that commits in advance to the entire sequence of input distributions). We apply this technique to prove strong smoothed guarantees for three different problems: <table border=\"0\" list-type=\"ordered\" width=\"95%\"><tr><td valign=\"top\">(1)</td><td colspan=\"5\" valign=\"top\">Online learning: We consider the online prediction problem, where instances are generated from an adaptive sequence of σ-smooth distributions and the hypothesis class has VC dimension d. We bound the regret by \\(\\tilde{O}\\big (\\sqrt {T d\\ln (1/\\sigma)} + d\\ln (T/\\sigma) \\big) \\) and provide a near-matching lower bound. Our result shows that under smoothed analysis, learnability against adaptive adversaries is characterized by the finiteness of the VC dimension. This is as opposed to the worst-case analysis, where online learnability is characterized by Littlestone dimension (which is infinite even in the extremely restricted case of one-dimensional threshold functions). Our results fully answer an open question of Rakhlin et al. [64]. </td></tr><tr><td valign=\"top\">(2)</td><td colspan=\"5\" valign=\"top\">Online discrepancy minimization: We consider the setting of the online Komlós problem, where the input is generated from an adaptive sequence of σ-smooth and isotropic distributions on the ℓ2 unit ball. We bound the ℓ∞ norm of the discrepancy vector by \\(\\tilde{O}\\big (\\ln ^2\\big (\\frac{nT}{\\sigma }\\big) \\big) \\). This is as opposed to the worst-case analysis, where the tight discrepancy bound is \\(\\Theta (\\sqrt {T/n}) \\). We show such polylog(nT/σ) discrepancy guarantees are not achievable for non-isotropic σ-smooth distributions. </td></tr><tr><td valign=\"top\">(3)</td><td colspan=\"5\" valign=\"top\">Dispersion in online optimization: We consider online optimization with piecewise Lipschitz functions where functions with ℓ discontinuities are chosen by a smoothed adaptive adversary and show that the resulting sequence is \\(\\big ({\\sigma }/{\\sqrt {T\\ell }}, \\tilde{O}\\big (\\sqrt {T\\ell } \\big)\\big) \\)-dispersed. That is, every ball of radius \\({\\sigma }/{\\sqrt {T\\ell }} \\) is split by \\(\\tilde{O}\\big (\\sqrt {T\\ell } \\big) \\) of the partitions made by these functions. This result matches the dispersion parameters of Balcan et al. [13] for oblivious smooth adversaries, up to logarithmic factors. On the other hand, worst-case sequences are trivially (0, T)-dispersed.</td></tr></table>","PeriodicalId":50022,"journal":{"name":"Journal of the ACM","volume":"72 1","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2024-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Smoothed Analysis with Adaptive Adversaries\",\"authors\":\"Nika Haghtalab, Tim Roughgarden, Abhishek Shetty\",\"doi\":\"10.1145/3656638\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We prove novel algorithmic guarantees for several online problems in the smoothed analysis model. In this model, at each time step an adversary chooses an input distribution with density function bounded above pointwise by \\\\(\\\\tfrac{1}{\\\\sigma } \\\\) times that of the uniform distribution; nature then samples an input from this distribution. Here, σ is a parameter that interpolates between the extremes of worst-case and average case analysis. Crucially, our results hold for adaptive adversaries that can base their choice of an input distribution on the decisions of the algorithm and the realizations of the inputs in the previous time steps. An adaptive adversary can nontrivially correlate inputs at different time steps with each other and with the algorithm’s current state; this appears to rule out the standard proof approaches in smoothed analysis. This paper presents a general technique for proving smoothed algorithmic guarantees against adaptive adversaries, in effect reducing the setting of an adaptive adversary to the much simpler case of an oblivious adversary (i.e., an adversary that commits in advance to the entire sequence of input distributions). We apply this technique to prove strong smoothed guarantees for three different problems: <table border=\\\"0\\\" list-type=\\\"ordered\\\" width=\\\"95%\\\"><tr><td valign=\\\"top\\\">(1)</td><td colspan=\\\"5\\\" valign=\\\"top\\\">Online learning: We consider the online prediction problem, where instances are generated from an adaptive sequence of σ-smooth distributions and the hypothesis class has VC dimension d. We bound the regret by \\\\(\\\\tilde{O}\\\\big (\\\\sqrt {T d\\\\ln (1/\\\\sigma)} + d\\\\ln (T/\\\\sigma) \\\\big) \\\\) and provide a near-matching lower bound. Our result shows that under smoothed analysis, learnability against adaptive adversaries is characterized by the finiteness of the VC dimension. This is as opposed to the worst-case analysis, where online learnability is characterized by Littlestone dimension (which is infinite even in the extremely restricted case of one-dimensional threshold functions). Our results fully answer an open question of Rakhlin et al. [64]. </td></tr><tr><td valign=\\\"top\\\">(2)</td><td colspan=\\\"5\\\" valign=\\\"top\\\">Online discrepancy minimization: We consider the setting of the online Komlós problem, where the input is generated from an adaptive sequence of σ-smooth and isotropic distributions on the ℓ2 unit ball. We bound the ℓ∞ norm of the discrepancy vector by \\\\(\\\\tilde{O}\\\\big (\\\\ln ^2\\\\big (\\\\frac{nT}{\\\\sigma }\\\\big) \\\\big) \\\\). This is as opposed to the worst-case analysis, where the tight discrepancy bound is \\\\(\\\\Theta (\\\\sqrt {T/n}) \\\\). We show such polylog(nT/σ) discrepancy guarantees are not achievable for non-isotropic σ-smooth distributions. </td></tr><tr><td valign=\\\"top\\\">(3)</td><td colspan=\\\"5\\\" valign=\\\"top\\\">Dispersion in online optimization: We consider online optimization with piecewise Lipschitz functions where functions with ℓ discontinuities are chosen by a smoothed adaptive adversary and show that the resulting sequence is \\\\(\\\\big ({\\\\sigma }/{\\\\sqrt {T\\\\ell }}, \\\\tilde{O}\\\\big (\\\\sqrt {T\\\\ell } \\\\big)\\\\big) \\\\)-dispersed. That is, every ball of radius \\\\({\\\\sigma }/{\\\\sqrt {T\\\\ell }} \\\\) is split by \\\\(\\\\tilde{O}\\\\big (\\\\sqrt {T\\\\ell } \\\\big) \\\\) of the partitions made by these functions. This result matches the dispersion parameters of Balcan et al. [13] for oblivious smooth adversaries, up to logarithmic factors. On the other hand, worst-case sequences are trivially (0, T)-dispersed.</td></tr></table>\",\"PeriodicalId\":50022,\"journal\":{\"name\":\"Journal of the ACM\",\"volume\":\"72 1\",\"pages\":\"\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2024-04-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of the ACM\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1145/3656638\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the ACM","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1145/3656638","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}

引用次数: 0

摘要

我们证明了平滑分析模型中若干在线问题的新算法保证。在该模型中，对手在每个时间步选择一个输入分布，该输入分布的密度函数以 \(\tfrac{1}\{sigma } \) 倍于均匀分布的密度函数为界。这里，σ 是一个介于最坏情况分析和平均情况分析两个极端之间的参数。最重要的是，我们的结果适用于自适应对手，它们可以根据算法的决策和之前时间步骤中输入的实现情况来选择输入分布。自适应对手可以将不同时间步骤的输入与算法的当前状态非难地联系起来；这似乎排除了平滑分析中的标准证明方法。本文提出了一种证明针对自适应对手的平滑算法保证的通用技术，实际上是将自适应对手的设置简化为更简单的遗忘对手（即事先承诺整个输入分布序列的对手）。我们运用这一技术证明了三个不同问题的强平滑保证：(1) 在线学习：我们用 \(\tilde{O}\big (\sqrt {T d\ln (1/\sigma)} + d\ln (T/\sigma) \big) \)来约束遗憾，并提供了一个接近匹配的下限。我们的结果表明，在平滑分析下，针对自适应对手的可学习性是以 VC 维度的有限性为特征的。这与最坏情况分析相反，在最坏情况分析下，在线可学性的特征是利特尔斯通维度（即使在一维阈值函数这种极其有限的情况下，利特尔斯通维度也是无限的）。我们的结果完全回答了 Rakhlin 等人[64]提出的一个开放问题。(2) 在线差异最小化：我们考虑的是在线 Komlós 问题，输入由 ℓ2 单位球上的σ 平滑各向同性分布的自适应序列生成。我们用 \(\tilde{O}\big (\ln ^2\big (\frac{nT}{\sigma }\big) \big) 约束差异向量的 ℓ∞ norm。）这与最坏情况分析相反，在最坏情况分析中，严格的差异约束是 \(\Theta (\sqrt {T/n}) \)。我们证明，对于非各向异性的σ光滑分布，这种polylog(nT/σ)差异保证是无法实现的。(3) 在线优化中的离散性：我们考虑了具有片状 Lipschitz 函数的在线优化，其中具有 ℓ 不连续性的函数是由平滑自适应对手选择的，并且证明了所得到的序列是 \(бig ({\sigma }/\{sqrt {T\ell }}, \tilde{O}\бig (\sqrt {T\ell } \бig)\бig) 分散的。也就是说，每个半径为 \({\sigma }/{sqrt {T\ell }}\) 的球都被这些函数所做的分割的 \(\tilde{O}\big (\sqrt {T\ell } \big) \) 分割。这一结果与 Balcan 等人[13]针对遗忘平滑对手的分散参数相匹配，达到对数因子。另一方面，最坏情况序列的离散度是(0, T)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Smoothed Analysis with Adaptive Adversaries

We prove novel algorithmic guarantees for several online problems in the smoothed analysis model. In this model, at each time step an adversary chooses an input distribution with density function bounded above pointwise by \(\tfrac{1}{\sigma } \) times that of the uniform distribution; nature then samples an input from this distribution. Here, σ is a parameter that interpolates between the extremes of worst-case and average case analysis. Crucially, our results hold for adaptive adversaries that can base their choice of an input distribution on the decisions of the algorithm and the realizations of the inputs in the previous time steps. An adaptive adversary can nontrivially correlate inputs at different time steps with each other and with the algorithm’s current state; this appears to rule out the standard proof approaches in smoothed analysis.

This paper presents a general technique for proving smoothed algorithmic guarantees against adaptive adversaries, in effect reducing the setting of an adaptive adversary to the much simpler case of an oblivious adversary (i.e., an adversary that commits in advance to the entire sequence of input distributions). We apply this technique to prove strong smoothed guarantees for three different problems:

(1)	Online learning: We consider the online prediction problem, where instances are generated from an adaptive sequence of σ-smooth distributions and the hypothesis class has VC dimension d. We bound the regret by \(\tilde{O}\big (\sqrt {T d\ln (1/\sigma)} + d\ln (T/\sigma) \big) \) and provide a near-matching lower bound. Our result shows that under smoothed analysis, learnability against adaptive adversaries is characterized by the finiteness of the VC dimension. This is as opposed to the worst-case analysis, where online learnability is characterized by Littlestone dimension (which is infinite even in the extremely restricted case of one-dimensional threshold functions). Our results fully answer an open question of Rakhlin et al. [64].
(2)	Online discrepancy minimization: We consider the setting of the online Komlós problem, where the input is generated from an adaptive sequence of σ-smooth and isotropic distributions on the ℓ₂ unit ball. We bound the ℓ_∞ norm of the discrepancy vector by \(\tilde{O}\big (\ln ^2\big (\frac{nT}{\sigma }\big) \big) \). This is as opposed to the worst-case analysis, where the tight discrepancy bound is \(\Theta (\sqrt {T/n}) \). We show such polylog(nT/σ) discrepancy guarantees are not achievable for non-isotropic σ-smooth distributions.
(3)	Dispersion in online optimization: We consider online optimization with piecewise Lipschitz functions where functions with ℓ discontinuities are chosen by a smoothed adaptive adversary and show that the resulting sequence is \(\big ({\sigma }/{\sqrt {T\ell }}, \tilde{O}\big (\sqrt {T\ell } \big)\big) \)-dispersed. That is, every ball of radius \({\sigma }/{\sqrt {T\ell }} \) is split by \(\tilde{O}\big (\sqrt {T\ell } \big) \) of the partitions made by these functions. This result matches the dispersion parameters of Balcan et al. [13] for oblivious smooth adversaries, up to logarithmic factors. On the other hand, worst-case sequences are trivially (0, T)-dispersed.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of the ACM 工程技术-计算机：理论方法

CiteScore

7.50

自引率

0.00%

发文量

审稿时长

3 months

期刊介绍： The best indicator of the scope of the journal is provided by the areas covered by its Editorial Board. These areas change from time to time, as the field evolves. The following areas are currently covered by a member of the Editorial Board: Algorithms and Combinatorial Optimization; Algorithms and Data Structures; Algorithms, Combinatorial Optimization, and Games; Artificial Intelligence; Complexity Theory; Computational Biology; Computational Geometry; Computer Graphics and Computer Vision; Computer-Aided Verification; Cryptography and Security; Cyber-Physical, Embedded, and Real-Time Systems; Database Systems and Theory; Distributed Computing; Economics and Computation; Information Theory; Logic and Computation; Logic, Algorithms, and Complexity; Machine Learning and Computational Learning Theory; Networking; Parallel Computing and Architecture; Programming Languages; Quantum Computing; Randomized Algorithms and Probabilistic Analysis of Algorithms; Scientific Computing and High Performance Computing; Software Engineering; Web Algorithms and Data Mining

期刊最新文献

Query lower bounds for log-concave sampling Transaction Fee Mechanism Design Sparse Higher Order Čech Filtrations Killing a Vortex Separations in Proof Complexity and TFNP