SIAM Journal on Optimization最新文献

英文中文

Single-Projection Procedure for Infinite Dimensional Convex Optimization Problems 无穷维凸优化问题的单投影程序

IF 3.1 1区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Optimization

Pub Date : 2024-05-03 DOI: 10.1137/22m1530173

Hoa T. Bui, Regina S. Burachik, Evgeni A. Nurminski, Matthew K. Tam

SIAM Journal on Optimization, Volume 34, Issue 2, Page 1646-1678, June 2024.
Abstract. We consider a class of convex optimization problems in a Hilbert space that can be solved by performing a single projection, i.e., by projecting an infeasible point onto the feasible set. Our results improve those established for the linear programming setting in Nurminski (2015) by considering problems that (i) may have multiple solutions, (ii) do not satisfy strict complementarity conditions, and (iii) possess nonlinear convex constraints. As a by-product of our analysis, we provide a quantitative estimate on the required distance between the infeasible point and the feasible set in order for its projection to be a solution of the problem. Our analysis relies on a “sharpness” property of the constraint set, a new property we introduce here.

SIAM 优化期刊》第 34 卷第 2 期第 1646-1678 页，2024 年 6 月。摘要。我们考虑了一类希尔伯特空间中的凸优化问题，这些问题可以通过执行一次投影求解，即把一个不可行点投影到可行集上。通过考虑以下问题，我们的结果改进了 Nurminski（2015）在线性规划设置中建立的结果：(i) 可能有多个解；(ii) 不满足严格的互补条件；(iii) 具有非线性凸约束。作为分析的副产品，我们对不可行点与可行集之间的必要距离进行了定量估计，以使其投影成为问题的解。我们的分析依赖于约束集的 "锐度 "属性，这是我们在此引入的一个新属性。

引用次数: 0

Exact Augmented Lagrangian Duality for Mixed Integer Convex Optimization 混合整数凸优化的精确增量拉格朗日对偶性

IF 3.1 1区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Optimization

Pub Date : 2024-05-02 DOI: 10.1137/22m1526204

Avinash Bhardwaj, Vishnu Narayanan, Abhishek Pathapati

SIAM Journal on Optimization, Volume 34, Issue 2, Page 1622-1645, June 2024.
Abstract. Augmented Lagrangian dual augments the classical Lagrangian dual with a nonnegative nonlinear penalty function of the violation of the relaxed/dualized constraints in order to reduce the duality gap. We investigate the cases in which mixed integer convex optimization problems have an exact penalty representation using sharp augmenting functions (norms as augmenting penalty functions). We present a generalizable constructive proof technique for proving existence of exact penalty representations for mixed integer convex programs under specific conditions using the associated value functions. This generalizes the recent results for mixed integer linear programming [M. J. Feizollahi, S. Ahmed, and A. Sun, Math. Program., 161 (2017), pp. 365–387] and mixed integer quadratic progamming [X. Gu, S. Ahmed, and S. S. Dey, SIAM J. Optim., 30 (2020), pp. 781–797] while also providing an alternative proof for the aforementioned along with quantification of the finite penalty parameter in these cases.

SIAM 优化期刊》第 34 卷第 2 期第 1622-1645 页，2024 年 6 月。摘要增量拉格朗日对偶用违反松弛/对偶约束的非负非线性惩罚函数来增量经典拉格朗日对偶，以减小对偶差距。我们研究了混合整数凸优化问题中使用尖锐增强函数（作为增强惩罚函数的规范）进行精确惩罚表示的情况。我们提出了一种可推广的构造证明技术，在特定条件下利用相关的值函数证明混合整数凸程序存在精确的惩罚表示。这概括了混合整数线性规划的最新成果 [M. J. Feizollahi, M. J. Feizollahi, M. J. M.J. Feizollahi, S. Ahmed, and A. Sun, Math.161 (2017), pp. 365-387] 和混合整数二次编程 [X. Gu, S. Ahmed, and S. S. Dey, SIAM J. Optim., 30 (2020), pp.

引用次数: 0

Frugal Splitting Operators: Representation, Minimal Lifting, and Convergence 节俭的拆分算子：表示、最小提升和收敛

IF 3.1 1区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Optimization

Pub Date : 2024-04-29 DOI: 10.1137/22m1531105

Martin Morin, Sebastian Banert, Pontus Giselsson

SIAM Journal on Optimization, Volume 34, Issue 2, Page 1595-1621, June 2024.
Abstract. We investigate frugal splitting operators for finite sum monotone inclusion problems. These operators utilize exactly one direct or resolvent evaluation of each operator of the sum, and the splitting operator’s output is dictated by linear combinations of these evaluations’ inputs and outputs. To facilitate analysis, we introduce a novel representation of frugal splitting operators via a generalized primal-dual resolvent. The representation is characterized by an index and four matrices, and we provide conditions on these that ensure equivalence between the classes of frugal splitting operators and generalized primal-dual resolvents. Our representation paves the way for new results regarding lifting numbers and the development of a unified convergence analysis for frugal splitting operator methods, contingent on the directly evaluated operators being cocoercive. The minimal lifting number is [math] where [math] is the number of monotone operators and [math] is the number of direct evaluations in the splitting. Notably, this lifting number is achievable only if the first and last operator evaluations are resolvent evaluations. These results generalize the minimal lifting results by Ryu and by Malitsky and Tam that consider frugal resolvent splittings. Building on our representation, we delineate a constructive method to design frugal splitting operators, exemplified in the design of a novel, convergent, and parallelizable frugal splitting operator with minimal lifting.

SIAM 优化期刊》第 34 卷第 2 期第 1595-1621 页，2024 年 6 月。摘要。我们研究了有限和单调包含问题的节俭拆分算子。这些算子只需对和的每个算子进行一次直接或解析评估，分裂算子的输出由这些评估的输入和输出的线性组合决定。为了便于分析，我们通过广义的基元-二元解析式引入了节俭拆分算子的新表示法。该表示法的特征是一个索引和四个矩阵，我们对这些矩阵提供了条件，确保节俭拆分算子类和广义基元-二元解析子类之间的等价性。我们的表示法为有关提升数的新结果和节俭拆分算子方法的统一收敛分析的发展铺平了道路，而这取决于直接评估的算子是否具有协迫性。最小提升数是 [math]，其中 [math] 是单调算子的数量，[math] 是拆分中直接求值的数量。值得注意的是，只有当第一个和最后一个算子求值都是解析求值时，这个提升数才能达到。这些结果概括了 Ryu 以及 Malitsky 和 Tam 考虑节俭的 resolvent 分裂的最小提升结果。在我们的表述基础上，我们描述了一种设计节俭拆分算子的构造方法，并以设计具有最小提升的新颖、收敛和可并行的节俭拆分算子为例加以说明。

{"title":"Frugal Splitting Operators: Representation, Minimal Lifting, and Convergence","authors":"Martin Morin, Sebastian Banert, Pontus Giselsson","doi":"10.1137/22m1531105","DOIUrl":"https://doi.org/10.1137/22m1531105","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 2, Page 1595-1621, June 2024. Abstract. We investigate frugal splitting operators for finite sum monotone inclusion problems. These operators utilize exactly one direct or resolvent evaluation of each operator of the sum, and the splitting operator’s output is dictated by linear combinations of these evaluations’ inputs and outputs. To facilitate analysis, we introduce a novel representation of frugal splitting operators via a generalized primal-dual resolvent. The representation is characterized by an index and four matrices, and we provide conditions on these that ensure equivalence between the classes of frugal splitting operators and generalized primal-dual resolvents. Our representation paves the way for new results regarding lifting numbers and the development of a unified convergence analysis for frugal splitting operator methods, contingent on the directly evaluated operators being cocoercive. The minimal lifting number is [math] where [math] is the number of monotone operators and [math] is the number of direct evaluations in the splitting. Notably, this lifting number is achievable only if the first and last operator evaluations are resolvent evaluations. These results generalize the minimal lifting results by Ryu and by Malitsky and Tam that consider frugal resolvent splittings. Building on our representation, we delineate a constructive method to design frugal splitting operators, exemplified in the design of a novel, convergent, and parallelizable frugal splitting operator with minimal lifting.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":"8 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140830756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Graph and Distributed Extensions of the Douglas–Rachford Method 道格拉斯-拉赫福德方法的图和分布式扩展

IF 3.1 1区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Optimization

Pub Date : 2024-04-24 DOI: 10.1137/22m1535097

Kristian Bredies, Enis Chenchene, Emanuele Naldi

SIAM Journal on Optimization, Volume 34, Issue 2, Page 1569-1594, June 2024.
Abstract. In this paper, we propose several graph-based extensions of the Douglas–Rachford splitting (DRS) method to solve monotone inclusion problems involving the sum of [math] maximal monotone operators. Our construction is based on the choice of two nested graphs, to which we associate a generalization of the DRS algorithm that presents a prescribed structure. The resulting schemes can be understood as unconditionally stable frugal resolvent splitting methods with minimal lifting in the sense of Ryu [Math. Program., 182 (2020), pp. 233–273] as well as instances of the (degenerate) preconditioned proximal point method, which provides robust convergence guarantees. We further describe how the graph-based extensions of the DRS method can be leveraged to design new fully distributed protocols. Applications to a congested optimal transport problem and to distributed support vector machines show interesting connections with the underlying graph topology and highly competitive performances with state-of-the-art distributed optimization approaches.

SIAM 优化期刊》，第 34 卷第 2 期，第 1569-1594 页，2024 年 6 月。摘要本文提出了 Douglas-Rachford 分裂（DRS）方法的几种基于图的扩展，以解决涉及 [math] 最大单调算子之和的单调包含问题。我们的构造基于两个嵌套图的选择，我们将 DRS 算法的广义化与这两个嵌套图关联起来，从而呈现出一种规定的结构。由此产生的方案可以理解为无条件稳定的、具有 Ryu [Math. Program.我们进一步介绍了如何利用 DRS 方法基于图的扩展来设计新的全分布式协议。对拥挤的最优运输问题和分布式支持向量机的应用显示了与底层图拓扑的有趣联系，以及与最先进的分布式优化方法相比极具竞争力的性能。

引用次数: 0

On Enhanced KKT Optimality Conditions for Smooth Nonlinear Optimization 论平滑非线性优化的增强型 KKT 最优条件

IF 3.1 1区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Optimization

Pub Date : 2024-04-17 DOI: 10.1137/22m1539678

Roberto Andreani, María L. Schuverdt, Leonardo D. Secchin

SIAM Journal on Optimization, Volume 34, Issue 2, Page 1515-1539, June 2024.
Abstract. The Fritz John (FJ) and Karush–Kuhn–Tucker (KKT) conditions are fundamental tools for characterizing minimizers and form the basis of almost all methods for constrained optimization. Since the seminal works of Fritz John, Karush, Kuhn, and Tucker, FJ/KKT conditions have been enhanced by adding extra necessary conditions. Such an extension was initially proposed by Hestenes in the 1970s and later extensively studied by Bertsekas and collaborators. In this work, we revisit enhanced KKT stationarity for standard (smooth) nonlinear programming. We argue that every KKT point satisfies the usual enhanced versions found in the literature. Therefore, enhanced KKT stationarity only concerns the Lagrange multipliers. We then analyze some properties of the corresponding multipliers under the quasi-normality constraint qualification (QNCQ), showing in particular that the set of so-called quasinormal multipliers is compact under QNCQ. Also, we report some consequences of introducing an extra abstract constraint to the problem. Given that enhanced FJ/KKT concepts are obtained by aggregating sequential conditions to FJ/KKT, we discuss the relevance of our findings with respect to the well-known sequential optimality conditions, which have been crucial in generalizing the global convergence of a well-established safeguarded augmented Lagrangian method. Finally, we apply our theory to mathematical programs with complementarity constraints and multiobjective problems, improving and elucidating previous results in the literature.

SIAM 优化期刊》，第 34 卷第 2 期，第 1515-1539 页，2024 年 6 月。摘要。弗里茨-约翰（FJ）和卡鲁什-库恩-塔克（KKT）条件是表征最小化的基本工具，是几乎所有约束优化方法的基础。自弗里茨-约翰、卡鲁什、库恩和塔克的开创性著作问世以来，FJ/KKT 条件通过增加额外的必要条件得到了改进。这种扩展最初是由 Hestenes 在 20 世纪 70 年代提出的，后来由 Bertsekas 及其合作者进行了广泛研究。在这项工作中，我们重新探讨了标准（平滑）非线性编程的增强 KKT 静止性。我们认为，每个 KKT 点都满足文献中常见的增强版本。因此，增强 KKT 驻足性只涉及拉格朗日乘数。然后，我们分析了准正态性约束条件（QNCQ）下相应乘数的一些特性，特别表明所谓的准正态性乘数集在 QNCQ 下是紧凑的。此外，我们还报告了在问题中引入额外抽象约束的一些后果。鉴于增强的 FJ/KKT 概念是通过将顺序条件汇总到 FJ/KKT 而得到的，我们讨论了我们的发现与众所周知的顺序最优性条件的相关性，这些条件对于推广一种成熟的保障性增强拉格朗日方法的全局收敛性至关重要。最后，我们将我们的理论应用于具有互补性约束的数学程序和多目标问题，改进并阐明了之前文献中的结果。

{"title":"On Enhanced KKT Optimality Conditions for Smooth Nonlinear Optimization","authors":"Roberto Andreani, María L. Schuverdt, Leonardo D. Secchin","doi":"10.1137/22m1539678","DOIUrl":"https://doi.org/10.1137/22m1539678","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 2, Page 1515-1539, June 2024. Abstract. The Fritz John (FJ) and Karush–Kuhn–Tucker (KKT) conditions are fundamental tools for characterizing minimizers and form the basis of almost all methods for constrained optimization. Since the seminal works of Fritz John, Karush, Kuhn, and Tucker, FJ/KKT conditions have been enhanced by adding extra necessary conditions. Such an extension was initially proposed by Hestenes in the 1970s and later extensively studied by Bertsekas and collaborators. In this work, we revisit enhanced KKT stationarity for standard (smooth) nonlinear programming. We argue that every KKT point satisfies the usual enhanced versions found in the literature. Therefore, enhanced KKT stationarity only concerns the Lagrange multipliers. We then analyze some properties of the corresponding multipliers under the quasi-normality constraint qualification (QNCQ), showing in particular that the set of so-called quasinormal multipliers is compact under QNCQ. Also, we report some consequences of introducing an extra abstract constraint to the problem. Given that enhanced FJ/KKT concepts are obtained by aggregating sequential conditions to FJ/KKT, we discuss the relevance of our findings with respect to the well-known sequential optimality conditions, which have been crucial in generalizing the global convergence of a well-established safeguarded augmented Lagrangian method. Finally, we apply our theory to mathematical programs with complementarity constraints and multiobjective problems, improving and elucidating previous results in the literature.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":"49 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140614679","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Weighted Geometric Mean, Minimum Mediated Set, and Optimal Simple Second-Order Cone Representation 加权几何平均、最小中介集和最佳简单二阶锥体表示法

IF 3.1 1区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Optimization

Pub Date : 2024-04-16 DOI: 10.1137/22m1531257

Jie Wang

SIAM Journal on Optimization, Volume 34, Issue 2, Page 1490-1514, June 2024.
Abstract. We study optimal simple second-order cone representations (a particular subclass of second-order cone representations) for weighted geometric means, which turns out to be closely related to minimum mediated sets. Several lower bounds and upper bounds on the size of optimal simple second-order cone representations are proved. In the case of bivariate weighted geometric means (equivalently, one-dimensional mediated sets), we are able to prove the exact size of an optimal simple second-order cone representation and give an algorithm to compute one. In the genenal case, fast heuristic algorithms and traversal algorithms are proposed to compute an approximately optimal simple second-order cone representation. Finally, applications to polynomial optimization, matrix optimization, and quantum information are provided.

SIAM 优化期刊》，第 34 卷第 2 期，第 1490-1514 页，2024 年 6 月。摘要。我们研究了加权几何平均数的最优简单二阶锥表示（二阶锥表示的一个特殊子类），它与最小中介集密切相关。证明了最优简单二阶锥表示大小的几个下界和上界。在双变量加权几何平均数（等价于一维中介集）的情况下，我们能够证明最优简单二阶圆锥表示的精确大小，并给出了计算最优简单二阶圆锥表示的算法。在一般情况下，我们提出了快速启发式算法和遍历算法，以计算近似最优的简单二阶锥表示。最后，还介绍了多项式优化、矩阵优化和量子信息的应用。

引用次数: 0

A Copositive Framework for Analysis of Hybrid Ising-Classical Algorithms 分析伊辛-经典混合算法的共正框架

IF 3.1 1区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Optimization

Pub Date : 2024-04-15 DOI: 10.1137/22m1514581

Robin Brown, David E. Bernal Neira, Davide Venturelli, Marco Pavone

SIAM Journal on Optimization, Volume 34, Issue 2, Page 1455-1489, June 2024.
Abstract. Recent years have seen significant advances in quantum/quantum-inspired technologies capable of approximately searching for the ground state of Ising spin Hamiltonians. The promise of leveraging such technologies to accelerate the solution of difficult optimization problems has spurred an increased interest in exploring methods to integrate Ising problems as part of their solution process, with existing approaches ranging from direct transcription to hybrid quantum-classical approaches rooted in existing optimization algorithms. While it is widely acknowledged that quantum computers should augment classical computers, rather than replace them entirely, comparatively little attention has been directed toward deriving analytical characterizations of their interactions. In this paper, we present a formal analysis of hybrid algorithms in the context of solving mixed-binary quadratic programs (MBQP) via Ising solvers. By leveraging an existing completely positive reformulation of MBQPs, as well as a new strong-duality result, we show the exactness of the dual problem over the cone of copositive matrices, thus allowing the resulting reformulation to inherit the straightforward analysis of convex optimization. We propose to solve this reformulation with a hybrid quantum-classical cutting-plane algorithm. Using existing complexity results for convex cutting-plane algorithms, we deduce that the classical portion of this hybrid framework is guaranteed to be polynomial time. This suggests that when applied to NP-hard problems, the complexity of the solution is shifted onto the subroutine handled by the Ising solver.

SIAM 优化期刊》，第 34 卷第 2 期，第 1455-1489 页，2024 年 6 月。摘要。近年来，能够近似搜索伊辛自旋哈密顿的基态的量子/量子启发技术取得了重大进展。利用这些技术加速解决困难的优化问题的前景，激发了人们对探索将伊辛问题作为其解决过程一部分的方法的更大兴趣，现有方法包括直接转录和植根于现有优化算法的量子-古典混合方法。虽然人们普遍认为量子计算机应该增强经典计算机的功能，而不是完全取代经典计算机，但人们却很少关注量子计算机相互作用的分析特征。在本文中，我们以通过伊辛求解器求解混合二元二次方程程序（MBQP）为背景，对混合算法进行了正式分析。通过利用 MBQPs 现有的完全正重构以及新的强对偶结果，我们展示了共正矩阵锥上对偶问题的精确性，从而使由此产生的重构继承了凸优化的直接分析。我们建议用混合量子经典切面算法来解决这个重构问题。利用凸切割平面算法的现有复杂性结果，我们推导出这个混合框架的经典部分保证是多项式时间。这表明，当应用于 NP-困难问题时，求解的复杂性会转移到伊辛求解器处理的子程序上。

{"title":"A Copositive Framework for Analysis of Hybrid Ising-Classical Algorithms","authors":"Robin Brown, David E. Bernal Neira, Davide Venturelli, Marco Pavone","doi":"10.1137/22m1514581","DOIUrl":"https://doi.org/10.1137/22m1514581","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 2, Page 1455-1489, June 2024. Abstract. Recent years have seen significant advances in quantum/quantum-inspired technologies capable of approximately searching for the ground state of Ising spin Hamiltonians. The promise of leveraging such technologies to accelerate the solution of difficult optimization problems has spurred an increased interest in exploring methods to integrate Ising problems as part of their solution process, with existing approaches ranging from direct transcription to hybrid quantum-classical approaches rooted in existing optimization algorithms. While it is widely acknowledged that quantum computers should augment classical computers, rather than replace them entirely, comparatively little attention has been directed toward deriving analytical characterizations of their interactions. In this paper, we present a formal analysis of hybrid algorithms in the context of solving mixed-binary quadratic programs (MBQP) via Ising solvers. By leveraging an existing completely positive reformulation of MBQPs, as well as a new strong-duality result, we show the exactness of the dual problem over the cone of copositive matrices, thus allowing the resulting reformulation to inherit the straightforward analysis of convex optimization. We propose to solve this reformulation with a hybrid quantum-classical cutting-plane algorithm. Using existing complexity results for convex cutting-plane algorithms, we deduce that the classical portion of this hybrid framework is guaranteed to be polynomial time. This suggests that when applied to NP-hard problems, the complexity of the solution is shifted onto the subroutine handled by the Ising solver.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":"54 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140568191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Benign Landscapes of Low-Dimensional Relaxations for Orthogonal Synchronization on General Graphs 通用图上正交同步的低维松弛的良性景观

IF 3.1 1区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Optimization

Pub Date : 2024-04-11 DOI: 10.1137/23m1584642

Andrew D. McRae, Nicolas Boumal

SIAM Journal on Optimization, Volume 34, Issue 2, Page 1427-1454, June 2024.
Abstract. Orthogonal group synchronization is the problem of estimating [math] elements [math] from the [math] orthogonal group given some relative measurements [math]. The least-squares formulation is nonconvex. To avoid its local minima, a Shor-type convex relaxation squares the dimension of the optimization problem from [math] to [math]. Alternatively, Burer–Monteiro-type nonconvex relaxations have generic landscape guarantees at dimension [math]. For smaller relaxations, the problem structure matters. It has been observed in the robotics literature that, for simultaneous localization and mapping problems, it seems sufficient to increase the dimension by a small constant multiple over the original. We partially explain this. This also has implications for Kuramoto oscillators. Specifically, we minimize the least-squares cost function in terms of estimators [math]. For [math], each [math] is relaxed to the Stiefel manifold [math] of [math] matrices with orthonormal rows. The available measurements implicitly define a (connected) graph [math] on [math] vertices. In the noiseless case, we show that, for all connected graphs [math], second-order critical points are globally optimal as soon as [math]. (This implies that Kuramoto oscillators on [math] synchronize for all [math].) This result is the best possible for general graphs; the previous best known result requires [math]. For [math], our result is robust to modest amounts of noise (depending on [math] and [math]). Our proof uses a novel randomized choice of tangent direction to prove (near-)optimality of second-order critical points. Finally, we partially extend our noiseless landscape results to the complex case (unitary group); we show that there are no spurious local minima when [math].

SIAM 优化期刊》，第 34 卷第 2 期，第 1427-1454 页，2024 年 6 月。摘要正交群同步是在给定一些相对测量值[数学]的情况下，从[数学]正交群中估计[数学]元素[数学]的问题。最小二乘公式是非凸的。为了避免出现局部极小值，肖尔型凸松弛将优化问题的维度从[数学]平方到[数学]。另外，Burer-Monteiro 类型的非凸松弛在维数 [math] 时有一般景观保证。对于较小的松弛，问题结构很重要。根据机器人学文献的观察，对于同时存在的定位和映射问题，似乎只需将维度提高到原来的一个小常数倍数即可。我们将对此做出部分解释。这对仓本振荡器也有影响。具体来说，我们要最小化估计器[math]的最小二乘成本函数。对于[math]，每个[math]都被放宽到具有正交行的[math]矩阵的 Stiefel 流形[math]。可用的测量值隐含地定义了[math]顶点上的[math]（连通）图。在无噪声情况下，我们证明，对于所有连通图[math]，只要[math]的二阶临界点是全局最优的。(这意味着[math]上的仓本振荡器对所有[math]都是同步的）。这个结果是一般图的最佳结果；之前已知的最佳结果需要 [math]。对于 [math]，我们的结果对适量噪声（取决于 [math] 和 [math]）是稳健的。我们的证明使用了一种新颖的随机选择切线方向的方法来证明二阶临界点的（近）最优性。最后，我们将无噪声景观结果部分扩展到复数情况（单元群）；我们证明了当 [math].

{"title":"Benign Landscapes of Low-Dimensional Relaxations for Orthogonal Synchronization on General Graphs","authors":"Andrew D. McRae, Nicolas Boumal","doi":"10.1137/23m1584642","DOIUrl":"https://doi.org/10.1137/23m1584642","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 2, Page 1427-1454, June 2024. Abstract. Orthogonal group synchronization is the problem of estimating [math] elements [math] from the [math] orthogonal group given some relative measurements [math]. The least-squares formulation is nonconvex. To avoid its local minima, a Shor-type convex relaxation squares the dimension of the optimization problem from [math] to [math]. Alternatively, Burer–Monteiro-type nonconvex relaxations have generic landscape guarantees at dimension [math]. For smaller relaxations, the problem structure matters. It has been observed in the robotics literature that, for simultaneous localization and mapping problems, it seems sufficient to increase the dimension by a small constant multiple over the original. We partially explain this. This also has implications for Kuramoto oscillators. Specifically, we minimize the least-squares cost function in terms of estimators [math]. For [math], each [math] is relaxed to the Stiefel manifold [math] of [math] matrices with orthonormal rows. The available measurements implicitly define a (connected) graph [math] on [math] vertices. In the noiseless case, we show that, for all connected graphs [math], second-order critical points are globally optimal as soon as [math]. (This implies that Kuramoto oscillators on [math] synchronize for all [math].) This result is the best possible for general graphs; the previous best known result requires [math]. For [math], our result is robust to modest amounts of noise (depending on [math] and [math]). Our proof uses a novel randomized choice of tangent direction to prove (near-)optimality of second-order critical points. Finally, we partially extend our noiseless landscape results to the complex case (unitary group); we show that there are no spurious local minima when [math].","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":"49 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140568095","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Gradient Complexity Analysis for Minimizing the Sum of Strongly Convex Functions with Varying Condition Numbers 最小化条件数强凸函数之和的梯度复杂性分析

IF 3.1 1区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Optimization

Pub Date : 2024-04-11 DOI: 10.1137/22m1503646

Nuozhou Wang, Shuzhong Zhang

SIAM Journal on Optimization, Volume 34, Issue 2, Page 1374-1401, June 2024.
Abstract. A popular approach to minimizing a finite sum of smooth convex functions is stochastic gradient descent (SGD) and its variants. Fundamental research questions associated with SGD include (i) how to find a lower bound on the number of times that the gradient oracle of each individual function must be assessed in order to find an [math]-minimizer of the overall objective; (ii) how to design algorithms which guarantee finding an [math]-minimizer of the overall objective in expectation no more than a certain number of times (in terms of [math]) that the gradient oracle of each function needs to be assessed (i.e., upper bound). If these two bounds are at the same order of magnitude, then the algorithms may be called optimal. Most existing results along this line of research typically assume that the functions in the objective share the same condition number. In this paper, the first model we study is the problem of minimizing the sum of finitely many strongly convex functions whose condition numbers are all different. We propose an SGD-based method for this model and show that it is optimal in gradient computations, up to a logarithmic factor. We then consider a constrained separate block optimization model and present lower and upper bounds for its gradient computation complexity. Next, we propose solving the Fenchel dual of the constrained block optimization model via generalized SSNM, which we introduce earlier, and show that it yields a lower iteration complexity than solving the original model by the ADMM-type approach. Finally, we extend the analysis to the general composite convex optimization model and obtain gradient-computation complexity results under certain conditions.

SIAM 优化期刊》，第 34 卷，第 2 期，第 1374-1401 页，2024 年 6 月。摘要。随机梯度下降法（SGD）及其变体是最小化平滑凸函数有限和的一种常用方法。与 SGD 相关的基本研究问题包括：(i) 如何找到为找到总目标的[数学]最小值而必须评估每个单独函数的梯度oracle 的次数的下限；(ii) 如何设计算法，保证在期望值不超过每个函数的梯度oracle 需要评估的一定次数（以[数学]为单位）的情况下找到总目标的[数学]最小值（即上限）。如果这两个界限的数量级相同，那么这些算法就可以称为最优算法。沿着这一研究方向的大多数现有成果通常都假设目标中的函数具有相同的条件数。在本文中，我们研究的第一个模型是最小化条件数都不同的有限多个强凸函数之和的问题。我们针对该模型提出了一种基于 SGD 的方法，并证明该方法在梯度计算中是最优的，最大可达对数因子。然后，我们考虑了一个受约束的独立块优化模型，并提出了其梯度计算复杂度的下限和上限。接下来，我们提出通过广义 SSNM 来求解受限分块优化模型的 Fenchel 对偶，并证明这种方法的迭代复杂度低于通过 ADMM 类型方法求解原始模型的迭代复杂度。最后，我们将分析扩展到一般复合凸优化模型，并在一定条件下得到梯度计算复杂度结果。

{"title":"A Gradient Complexity Analysis for Minimizing the Sum of Strongly Convex Functions with Varying Condition Numbers","authors":"Nuozhou Wang, Shuzhong Zhang","doi":"10.1137/22m1503646","DOIUrl":"https://doi.org/10.1137/22m1503646","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 2, Page 1374-1401, June 2024. Abstract. A popular approach to minimizing a finite sum of smooth convex functions is stochastic gradient descent (SGD) and its variants. Fundamental research questions associated with SGD include (i) how to find a lower bound on the number of times that the gradient oracle of each individual function must be assessed in order to find an [math]-minimizer of the overall objective; (ii) how to design algorithms which guarantee finding an [math]-minimizer of the overall objective in expectation no more than a certain number of times (in terms of [math]) that the gradient oracle of each function needs to be assessed (i.e., upper bound). If these two bounds are at the same order of magnitude, then the algorithms may be called optimal. Most existing results along this line of research typically assume that the functions in the objective share the same condition number. In this paper, the first model we study is the problem of minimizing the sum of finitely many strongly convex functions whose condition numbers are all different. We propose an SGD-based method for this model and show that it is optimal in gradient computations, up to a logarithmic factor. We then consider a constrained separate block optimization model and present lower and upper bounds for its gradient computation complexity. Next, we propose solving the Fenchel dual of the constrained block optimization model via generalized SSNM, which we introduce earlier, and show that it yields a lower iteration complexity than solving the original model by the ADMM-type approach. Finally, we extend the analysis to the general composite convex optimization model and obtain gradient-computation complexity results under certain conditions.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":"55 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140568296","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Stochastic Differential Equations for Modeling First Order Optimization Methods 一阶优化方法建模的随机微分方程

IF 3.1 1区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Optimization

Pub Date : 2024-04-11 DOI: 10.1137/21m1435665

M. Dambrine, Ch. Dossal, B. Puig, A. Rondepierre

SIAM Journal on Optimization, Volume 34, Issue 2, Page 1402-1426, June 2024.
Abstract. In this article, a family of SDEs are derived as a tool to understand the behavior of numerical optimization methods under random evaluations of the gradient. Our objective is to transpose the introduction of continuous versions through ODEs to understand the asymptotic behavior of a discrete optimization scheme to the stochastic setting. We consider a continuous version of the stochastic gradient scheme and of a stochastic inertial system. This article first studies the quality of the approximation of the discrete scheme by an SDE when the step size tends to 0. Then, it presents new asymptotic bounds on the values [math], where [math] is a solution of the SDE and [math], when [math] is convex and under integrability conditions on the noise. Results are provided under two sets of hypotheses: first considering [math] and convex functions and then adding some geometrical properties of [math]. All of these results provide insight on the behavior of these inertial and perturbed algorithms in the setting of stochastic algorithms.

SIAM 优化期刊》，第 34 卷第 2 期，第 1402-1426 页，2024 年 6 月。摘要本文导出了一系列 SDEs，作为理解梯度随机评估下数值优化方法行为的工具。我们的目的是通过 ODEs 将连续版本的引入转置到随机环境中，以理解离散优化方案的渐近行为。我们考虑了随机梯度方案的连续版本和随机惯性系统。本文首先研究了当步长趋近于 0 时，离散方案与 SDE 的近似质量。然后，本文提出了[math]（其中[math]是 SDE 的一个解）和[math]（其中[math]是凸的，并且在噪声的可整性条件下）值的新渐近约束。我们提供了两组假设下的结果：首先考虑 [math] 和凸函数，然后添加 [math] 的一些几何特性。所有这些结果都有助于深入了解这些惯性算法和扰动算法在随机算法环境下的行为。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

SIAM Journal on Optimization

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀