{"title":"Perseus:变分法不等式的简单优化高阶方法","authors":"Tianyi Lin, Michael I. Jordan","doi":"10.1007/s10107-024-02075-2","DOIUrl":null,"url":null,"abstract":"<p>This paper settles an open and challenging question pertaining to the design of simple and optimal high-order methods for solving smooth and monotone variational inequalities (VIs). A VI involves finding <span>\\(x^\\star \\in {\\mathcal {X}}\\)</span> such that <span>\\(\\langle F(x), x - x^\\star \\rangle \\ge 0\\)</span> for all <span>\\(x \\in {\\mathcal {X}}\\)</span>. We consider the setting in which <span>\\(F: {\\mathbb {R}}^d \\rightarrow {\\mathbb {R}}^d\\)</span> is smooth with up to <span>\\((p-1)^{\\text {th}}\\)</span>-order derivatives. For <span>\\(p = 2\\)</span>, the cubic regularization of Newton’s method has been extended to VIs with a global rate of <span>\\(O(\\epsilon ^{-1})\\)</span> (Nesterov in Cubic regularization of Newton’s method for convex problems with constraints, Tech. rep., Université catholique de Louvain, Center for Operations Research and Econometrics (CORE), 2006). An improved rate of <span>\\(O(\\epsilon ^{-2/3}\\log \\log (1/\\epsilon ))\\)</span> can be obtained via an alternative second-order method, but this method requires a nontrivial line-search procedure as an inner loop. Similarly, the existing high-order methods based on line-search procedures have been shown to achieve a rate of <span>\\(O(\\epsilon ^{-2/(p+1)}\\log \\log (1/\\epsilon ))\\)</span> (Bullins and Lai in SIAM J Optim 32(3):2208–2229, 2022; Jiang and Mokhtari in Generalized optimistic methods for convex–concave saddle point problems, 2022; Lin and Jordan in Math Oper Res 48(4):2353–2382, 2023). As emphasized by Nesterov (Lectures on convex optimization, vol 137, Springer, Berlin, 2018), however, such procedures do not necessarily imply the practical applicability in large-scale applications, and it is desirable to complement these results with a simple high-order VI method that retains the optimality of the more complex methods. We propose a <span>\\(p^{\\text {th}}\\)</span>-order method that does <i>not</i> require any line search procedure and provably converges to a weak solution at a rate of <span>\\(O(\\epsilon ^{-2/(p+1)})\\)</span>. We prove that our <span>\\(p^{\\text {th}}\\)</span>-order method is optimal in the monotone setting by establishing a lower bound of <span>\\(\\Omega (\\epsilon ^{-2/(p+1)})\\)</span> under a generalized linear span assumption. A restarted version of our <span>\\(p^{\\text {th}}\\)</span>-order method attains a linear rate for smooth and <span>\\(p^{\\text {th}}\\)</span>-order uniformly monotone VIs and another restarted version of our <span>\\(p^{\\text {th}}\\)</span>-order method attains a local superlinear rate for smooth and strongly monotone VIs. Further, the similar <span>\\(p^{\\text {th}}\\)</span>-order method achieves a global rate of <span>\\(O(\\epsilon ^{-2/p})\\)</span> for solving smooth and nonmonotone VIs satisfying the Minty condition. Two restarted versions attain a global linear rate under additional <span>\\(p^{\\text {th}}\\)</span>-order uniform Minty condition and a local superlinear rate under additional strong Minty condition.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"16 1","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2024-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Perseus: a simple and optimal high-order method for variational inequalities\",\"authors\":\"Tianyi Lin, Michael I. Jordan\",\"doi\":\"10.1007/s10107-024-02075-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>This paper settles an open and challenging question pertaining to the design of simple and optimal high-order methods for solving smooth and monotone variational inequalities (VIs). A VI involves finding <span>\\\\(x^\\\\star \\\\in {\\\\mathcal {X}}\\\\)</span> such that <span>\\\\(\\\\langle F(x), x - x^\\\\star \\\\rangle \\\\ge 0\\\\)</span> for all <span>\\\\(x \\\\in {\\\\mathcal {X}}\\\\)</span>. We consider the setting in which <span>\\\\(F: {\\\\mathbb {R}}^d \\\\rightarrow {\\\\mathbb {R}}^d\\\\)</span> is smooth with up to <span>\\\\((p-1)^{\\\\text {th}}\\\\)</span>-order derivatives. For <span>\\\\(p = 2\\\\)</span>, the cubic regularization of Newton’s method has been extended to VIs with a global rate of <span>\\\\(O(\\\\epsilon ^{-1})\\\\)</span> (Nesterov in Cubic regularization of Newton’s method for convex problems with constraints, Tech. rep., Université catholique de Louvain, Center for Operations Research and Econometrics (CORE), 2006). An improved rate of <span>\\\\(O(\\\\epsilon ^{-2/3}\\\\log \\\\log (1/\\\\epsilon ))\\\\)</span> can be obtained via an alternative second-order method, but this method requires a nontrivial line-search procedure as an inner loop. Similarly, the existing high-order methods based on line-search procedures have been shown to achieve a rate of <span>\\\\(O(\\\\epsilon ^{-2/(p+1)}\\\\log \\\\log (1/\\\\epsilon ))\\\\)</span> (Bullins and Lai in SIAM J Optim 32(3):2208–2229, 2022; Jiang and Mokhtari in Generalized optimistic methods for convex–concave saddle point problems, 2022; Lin and Jordan in Math Oper Res 48(4):2353–2382, 2023). As emphasized by Nesterov (Lectures on convex optimization, vol 137, Springer, Berlin, 2018), however, such procedures do not necessarily imply the practical applicability in large-scale applications, and it is desirable to complement these results with a simple high-order VI method that retains the optimality of the more complex methods. We propose a <span>\\\\(p^{\\\\text {th}}\\\\)</span>-order method that does <i>not</i> require any line search procedure and provably converges to a weak solution at a rate of <span>\\\\(O(\\\\epsilon ^{-2/(p+1)})\\\\)</span>. We prove that our <span>\\\\(p^{\\\\text {th}}\\\\)</span>-order method is optimal in the monotone setting by establishing a lower bound of <span>\\\\(\\\\Omega (\\\\epsilon ^{-2/(p+1)})\\\\)</span> under a generalized linear span assumption. A restarted version of our <span>\\\\(p^{\\\\text {th}}\\\\)</span>-order method attains a linear rate for smooth and <span>\\\\(p^{\\\\text {th}}\\\\)</span>-order uniformly monotone VIs and another restarted version of our <span>\\\\(p^{\\\\text {th}}\\\\)</span>-order method attains a local superlinear rate for smooth and strongly monotone VIs. Further, the similar <span>\\\\(p^{\\\\text {th}}\\\\)</span>-order method achieves a global rate of <span>\\\\(O(\\\\epsilon ^{-2/p})\\\\)</span> for solving smooth and nonmonotone VIs satisfying the Minty condition. Two restarted versions attain a global linear rate under additional <span>\\\\(p^{\\\\text {th}}\\\\)</span>-order uniform Minty condition and a local superlinear rate under additional strong Minty condition.</p>\",\"PeriodicalId\":18297,\"journal\":{\"name\":\"Mathematical Programming\",\"volume\":\"16 1\",\"pages\":\"\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2024-03-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Mathematical Programming\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.1007/s10107-024-02075-2\",\"RegionNum\":2,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mathematical Programming","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1007/s10107-024-02075-2","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0
摘要
本文解决了一个开放且具有挑战性的问题,即设计简单且最优的高阶方法来求解平滑且单调的变分不等式(VIs)。变分不等式涉及找到 \(x^\star\in {\mathcal {X}}) such that \(angle F(x), x - x^\star\rangle \ge 0\) for all \(x \in {\mathcal {X}}\).我们考虑这样一种情况:F: {\mathbb {R}}^d \rightarrow {\mathbb {R}}^d\) 是光滑的,最多有((p-1)^{text {th}})阶导数。对于 \(p = 2\), 牛顿方法的立方正则化已经扩展到 VIs,其全局速率为 \(O(\epsilon ^{-1})\)(Nesterov 在 Cubic regularization of Newton's method for convex problems with constraints, Tech. rep.通过另一种二阶方法可以得到一个改进的速率(O(\epsilon ^{-2/3}\log \log (1/\epsilon ))\) ,但这种方法需要一个非线性的线性搜索过程作为内循环。同样,现有的基于线性搜索过程的高阶方法已经被证明可以达到 \(O(\epsilon ^{-2/(p+1)}\log \log (1/\epsilon ))\) (Bullins 和 Lai 在 SIAM J Optim 32(3):2208-2229, 2022; Jiang and Mokhtari in Generalized optimistic methods for convex-concave saddle point problems, 2022; Lin and Jordan in Math Oper Res 48(4):2353-2382, 2023)。然而,正如涅斯捷罗夫(Lectures on convex optimization, vol 137, Springer, Berlin, 2018)所强调的,这些程序并不一定意味着在大规模应用中的实际适用性,我们希望用一种简单的高阶 VI 方法来补充这些结果,同时保留更复杂方法的最优性。我们提出了一种 \(p^{text {th}}\)阶方法,它不需要任何线性搜索过程,并能以 \(O(\epsilon ^{-2/(p+1)})\) 的速率收敛到弱解。我们通过在广义线性跨度假设下建立一个 \(\Omega (\epsilon ^{-2/(p+1)})\) 的下限,证明我们的 \(p^{text {th}}\)-order 方法在单调设置中是最优的。我们的\(p^{text {th}}\)阶方法的重启版本对于平滑和\(p^{text {th}}\)阶均匀单调VI达到了线性速率,我们的\(p^{text {th}}\)阶方法的另一个重启版本对于平滑和强单调VI达到了局部超线性速率。此外,类似的\(p^{text {th}}\)阶方法在求解满足Minty条件的平滑和非单调VI时达到了\(O(epsilon ^{-2/p})\)的全局速率。两个重启版本在附加的 \(p^{text {th}}\)-order uniform Minty 条件下达到了全局线性速率,在附加的强 Minty 条件下达到了局部超线性速率。
Perseus: a simple and optimal high-order method for variational inequalities
This paper settles an open and challenging question pertaining to the design of simple and optimal high-order methods for solving smooth and monotone variational inequalities (VIs). A VI involves finding \(x^\star \in {\mathcal {X}}\) such that \(\langle F(x), x - x^\star \rangle \ge 0\) for all \(x \in {\mathcal {X}}\). We consider the setting in which \(F: {\mathbb {R}}^d \rightarrow {\mathbb {R}}^d\) is smooth with up to \((p-1)^{\text {th}}\)-order derivatives. For \(p = 2\), the cubic regularization of Newton’s method has been extended to VIs with a global rate of \(O(\epsilon ^{-1})\) (Nesterov in Cubic regularization of Newton’s method for convex problems with constraints, Tech. rep., Université catholique de Louvain, Center for Operations Research and Econometrics (CORE), 2006). An improved rate of \(O(\epsilon ^{-2/3}\log \log (1/\epsilon ))\) can be obtained via an alternative second-order method, but this method requires a nontrivial line-search procedure as an inner loop. Similarly, the existing high-order methods based on line-search procedures have been shown to achieve a rate of \(O(\epsilon ^{-2/(p+1)}\log \log (1/\epsilon ))\) (Bullins and Lai in SIAM J Optim 32(3):2208–2229, 2022; Jiang and Mokhtari in Generalized optimistic methods for convex–concave saddle point problems, 2022; Lin and Jordan in Math Oper Res 48(4):2353–2382, 2023). As emphasized by Nesterov (Lectures on convex optimization, vol 137, Springer, Berlin, 2018), however, such procedures do not necessarily imply the practical applicability in large-scale applications, and it is desirable to complement these results with a simple high-order VI method that retains the optimality of the more complex methods. We propose a \(p^{\text {th}}\)-order method that does not require any line search procedure and provably converges to a weak solution at a rate of \(O(\epsilon ^{-2/(p+1)})\). We prove that our \(p^{\text {th}}\)-order method is optimal in the monotone setting by establishing a lower bound of \(\Omega (\epsilon ^{-2/(p+1)})\) under a generalized linear span assumption. A restarted version of our \(p^{\text {th}}\)-order method attains a linear rate for smooth and \(p^{\text {th}}\)-order uniformly monotone VIs and another restarted version of our \(p^{\text {th}}\)-order method attains a local superlinear rate for smooth and strongly monotone VIs. Further, the similar \(p^{\text {th}}\)-order method achieves a global rate of \(O(\epsilon ^{-2/p})\) for solving smooth and nonmonotone VIs satisfying the Minty condition. Two restarted versions attain a global linear rate under additional \(p^{\text {th}}\)-order uniform Minty condition and a local superlinear rate under additional strong Minty condition.
期刊介绍:
Mathematical Programming publishes original articles dealing with every aspect of mathematical optimization; that is, everything of direct or indirect use concerning the problem of optimizing a function of many variables, often subject to a set of constraints. This involves theoretical and computational issues as well as application studies. Included, along with the standard topics of linear, nonlinear, integer, conic, stochastic and combinatorial optimization, are techniques for formulating and applying mathematical programming models, convex, nonsmooth and variational analysis, the theory of polyhedra, variational inequalities, and control and game theory viewed from the perspective of mathematical programming.