首页 > 最新文献

SIAM Journal on Optimization最新文献

英文 中文
A Copositive Framework for Analysis of Hybrid Ising-Classical Algorithms 分析伊辛-经典混合算法的共正框架
IF 3.1 1区 数学 Q1 Mathematics Pub Date : 2024-04-15 DOI: 10.1137/22m1514581
Robin Brown, David E. Bernal Neira, Davide Venturelli, Marco Pavone
SIAM Journal on Optimization, Volume 34, Issue 2, Page 1455-1489, June 2024.
Abstract. Recent years have seen significant advances in quantum/quantum-inspired technologies capable of approximately searching for the ground state of Ising spin Hamiltonians. The promise of leveraging such technologies to accelerate the solution of difficult optimization problems has spurred an increased interest in exploring methods to integrate Ising problems as part of their solution process, with existing approaches ranging from direct transcription to hybrid quantum-classical approaches rooted in existing optimization algorithms. While it is widely acknowledged that quantum computers should augment classical computers, rather than replace them entirely, comparatively little attention has been directed toward deriving analytical characterizations of their interactions. In this paper, we present a formal analysis of hybrid algorithms in the context of solving mixed-binary quadratic programs (MBQP) via Ising solvers. By leveraging an existing completely positive reformulation of MBQPs, as well as a new strong-duality result, we show the exactness of the dual problem over the cone of copositive matrices, thus allowing the resulting reformulation to inherit the straightforward analysis of convex optimization. We propose to solve this reformulation with a hybrid quantum-classical cutting-plane algorithm. Using existing complexity results for convex cutting-plane algorithms, we deduce that the classical portion of this hybrid framework is guaranteed to be polynomial time. This suggests that when applied to NP-hard problems, the complexity of the solution is shifted onto the subroutine handled by the Ising solver.
SIAM 优化期刊》,第 34 卷第 2 期,第 1455-1489 页,2024 年 6 月。 摘要。近年来,能够近似搜索伊辛自旋哈密顿的基态的量子/量子启发技术取得了重大进展。利用这些技术加速解决困难的优化问题的前景,激发了人们对探索将伊辛问题作为其解决过程一部分的方法的更大兴趣,现有方法包括直接转录和植根于现有优化算法的量子-古典混合方法。虽然人们普遍认为量子计算机应该增强经典计算机的功能,而不是完全取代经典计算机,但人们却很少关注量子计算机相互作用的分析特征。在本文中,我们以通过伊辛求解器求解混合二元二次方程程序(MBQP)为背景,对混合算法进行了正式分析。通过利用 MBQPs 现有的完全正重构以及新的强对偶结果,我们展示了共正矩阵锥上对偶问题的精确性,从而使由此产生的重构继承了凸优化的直接分析。我们建议用混合量子经典切面算法来解决这个重构问题。利用凸切割平面算法的现有复杂性结果,我们推导出这个混合框架的经典部分保证是多项式时间。这表明,当应用于 NP-困难问题时,求解的复杂性会转移到伊辛求解器处理的子程序上。
{"title":"A Copositive Framework for Analysis of Hybrid Ising-Classical Algorithms","authors":"Robin Brown, David E. Bernal Neira, Davide Venturelli, Marco Pavone","doi":"10.1137/22m1514581","DOIUrl":"https://doi.org/10.1137/22m1514581","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 2, Page 1455-1489, June 2024. <br/> Abstract. Recent years have seen significant advances in quantum/quantum-inspired technologies capable of approximately searching for the ground state of Ising spin Hamiltonians. The promise of leveraging such technologies to accelerate the solution of difficult optimization problems has spurred an increased interest in exploring methods to integrate Ising problems as part of their solution process, with existing approaches ranging from direct transcription to hybrid quantum-classical approaches rooted in existing optimization algorithms. While it is widely acknowledged that quantum computers should augment classical computers, rather than replace them entirely, comparatively little attention has been directed toward deriving analytical characterizations of their interactions. In this paper, we present a formal analysis of hybrid algorithms in the context of solving mixed-binary quadratic programs (MBQP) via Ising solvers. By leveraging an existing completely positive reformulation of MBQPs, as well as a new strong-duality result, we show the exactness of the dual problem over the cone of copositive matrices, thus allowing the resulting reformulation to inherit the straightforward analysis of convex optimization. We propose to solve this reformulation with a hybrid quantum-classical cutting-plane algorithm. Using existing complexity results for convex cutting-plane algorithms, we deduce that the classical portion of this hybrid framework is guaranteed to be polynomial time. This suggests that when applied to NP-hard problems, the complexity of the solution is shifted onto the subroutine handled by the Ising solver.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140568191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Benign Landscapes of Low-Dimensional Relaxations for Orthogonal Synchronization on General Graphs 通用图上正交同步的低维松弛的良性景观
IF 3.1 1区 数学 Q1 Mathematics Pub Date : 2024-04-11 DOI: 10.1137/23m1584642
Andrew D. McRae, Nicolas Boumal
SIAM Journal on Optimization, Volume 34, Issue 2, Page 1427-1454, June 2024.
Abstract. Orthogonal group synchronization is the problem of estimating [math] elements [math] from the [math] orthogonal group given some relative measurements [math]. The least-squares formulation is nonconvex. To avoid its local minima, a Shor-type convex relaxation squares the dimension of the optimization problem from [math] to [math]. Alternatively, Burer–Monteiro-type nonconvex relaxations have generic landscape guarantees at dimension [math]. For smaller relaxations, the problem structure matters. It has been observed in the robotics literature that, for simultaneous localization and mapping problems, it seems sufficient to increase the dimension by a small constant multiple over the original. We partially explain this. This also has implications for Kuramoto oscillators. Specifically, we minimize the least-squares cost function in terms of estimators [math]. For [math], each [math] is relaxed to the Stiefel manifold [math] of [math] matrices with orthonormal rows. The available measurements implicitly define a (connected) graph [math] on [math] vertices. In the noiseless case, we show that, for all connected graphs [math], second-order critical points are globally optimal as soon as [math]. (This implies that Kuramoto oscillators on [math] synchronize for all [math].) This result is the best possible for general graphs; the previous best known result requires [math]. For [math], our result is robust to modest amounts of noise (depending on [math] and [math]). Our proof uses a novel randomized choice of tangent direction to prove (near-)optimality of second-order critical points. Finally, we partially extend our noiseless landscape results to the complex case (unitary group); we show that there are no spurious local minima when [math].
SIAM 优化期刊》,第 34 卷第 2 期,第 1427-1454 页,2024 年 6 月。摘要正交群同步是在给定一些相对测量值[数学]的情况下,从[数学]正交群中估计[数学]元素[数学]的问题。最小二乘公式是非凸的。为了避免出现局部极小值,肖尔型凸松弛将优化问题的维度从[数学]平方到[数学]。另外,Burer-Monteiro 类型的非凸松弛在维数 [math] 时有一般景观保证。对于较小的松弛,问题结构很重要。根据机器人学文献的观察,对于同时存在的定位和映射问题,似乎只需将维度提高到原来的一个小常数倍数即可。我们将对此做出部分解释。这对仓本振荡器也有影响。具体来说,我们要最小化估计器[math]的最小二乘成本函数。对于[math],每个[math]都被放宽到具有正交行的[math]矩阵的 Stiefel 流形[math]。可用的测量值隐含地定义了[math]顶点上的[math](连通)图。在无噪声情况下,我们证明,对于所有连通图[math],只要[math]的二阶临界点是全局最优的。(这意味着[math]上的仓本振荡器对所有[math]都是同步的)。这个结果是一般图的最佳结果;之前已知的最佳结果需要 [math]。对于 [math],我们的结果对适量噪声(取决于 [math] 和 [math])是稳健的。我们的证明使用了一种新颖的随机选择切线方向的方法来证明二阶临界点的(近)最优性。最后,我们将无噪声景观结果部分扩展到复数情况(单元群);我们证明了当 [math].
{"title":"Benign Landscapes of Low-Dimensional Relaxations for Orthogonal Synchronization on General Graphs","authors":"Andrew D. McRae, Nicolas Boumal","doi":"10.1137/23m1584642","DOIUrl":"https://doi.org/10.1137/23m1584642","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 2, Page 1427-1454, June 2024. <br/>Abstract. Orthogonal group synchronization is the problem of estimating [math] elements [math] from the [math] orthogonal group given some relative measurements [math]. The least-squares formulation is nonconvex. To avoid its local minima, a Shor-type convex relaxation squares the dimension of the optimization problem from [math] to [math]. Alternatively, Burer–Monteiro-type nonconvex relaxations have generic landscape guarantees at dimension [math]. For smaller relaxations, the problem structure matters. It has been observed in the robotics literature that, for simultaneous localization and mapping problems, it seems sufficient to increase the dimension by a small constant multiple over the original. We partially explain this. This also has implications for Kuramoto oscillators. Specifically, we minimize the least-squares cost function in terms of estimators [math]. For [math], each [math] is relaxed to the Stiefel manifold [math] of [math] matrices with orthonormal rows. The available measurements implicitly define a (connected) graph [math] on [math] vertices. In the noiseless case, we show that, for all connected graphs [math], second-order critical points are globally optimal as soon as [math]. (This implies that Kuramoto oscillators on [math] synchronize for all [math].) This result is the best possible for general graphs; the previous best known result requires [math]. For [math], our result is robust to modest amounts of noise (depending on [math] and [math]). Our proof uses a novel randomized choice of tangent direction to prove (near-)optimality of second-order critical points. Finally, we partially extend our noiseless landscape results to the complex case (unitary group); we show that there are no spurious local minima when [math].","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140568095","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Gradient Complexity Analysis for Minimizing the Sum of Strongly Convex Functions with Varying Condition Numbers 最小化条件数强凸函数之和的梯度复杂性分析
IF 3.1 1区 数学 Q1 Mathematics Pub Date : 2024-04-11 DOI: 10.1137/22m1503646
Nuozhou Wang, Shuzhong Zhang
SIAM Journal on Optimization, Volume 34, Issue 2, Page 1374-1401, June 2024.
Abstract. A popular approach to minimizing a finite sum of smooth convex functions is stochastic gradient descent (SGD) and its variants. Fundamental research questions associated with SGD include (i) how to find a lower bound on the number of times that the gradient oracle of each individual function must be assessed in order to find an [math]-minimizer of the overall objective; (ii) how to design algorithms which guarantee finding an [math]-minimizer of the overall objective in expectation no more than a certain number of times (in terms of [math]) that the gradient oracle of each function needs to be assessed (i.e., upper bound). If these two bounds are at the same order of magnitude, then the algorithms may be called optimal. Most existing results along this line of research typically assume that the functions in the objective share the same condition number. In this paper, the first model we study is the problem of minimizing the sum of finitely many strongly convex functions whose condition numbers are all different. We propose an SGD-based method for this model and show that it is optimal in gradient computations, up to a logarithmic factor. We then consider a constrained separate block optimization model and present lower and upper bounds for its gradient computation complexity. Next, we propose solving the Fenchel dual of the constrained block optimization model via generalized SSNM, which we introduce earlier, and show that it yields a lower iteration complexity than solving the original model by the ADMM-type approach. Finally, we extend the analysis to the general composite convex optimization model and obtain gradient-computation complexity results under certain conditions.
SIAM 优化期刊》,第 34 卷,第 2 期,第 1374-1401 页,2024 年 6 月。 摘要。随机梯度下降法(SGD)及其变体是最小化平滑凸函数有限和的一种常用方法。与 SGD 相关的基本研究问题包括:(i) 如何找到为找到总目标的[数学]最小值而必须评估每个单独函数的梯度oracle 的次数的下限;(ii) 如何设计算法,保证在期望值不超过每个函数的梯度oracle 需要评估的一定次数(以[数学]为单位)的情况下找到总目标的[数学]最小值(即上限)。如果这两个界限的数量级相同,那么这些算法就可以称为最优算法。沿着这一研究方向的大多数现有成果通常都假设目标中的函数具有相同的条件数。在本文中,我们研究的第一个模型是最小化条件数都不同的有限多个强凸函数之和的问题。我们针对该模型提出了一种基于 SGD 的方法,并证明该方法在梯度计算中是最优的,最大可达对数因子。然后,我们考虑了一个受约束的独立块优化模型,并提出了其梯度计算复杂度的下限和上限。接下来,我们提出通过广义 SSNM 来求解受限分块优化模型的 Fenchel 对偶,并证明这种方法的迭代复杂度低于通过 ADMM 类型方法求解原始模型的迭代复杂度。最后,我们将分析扩展到一般复合凸优化模型,并在一定条件下得到梯度计算复杂度结果。
{"title":"A Gradient Complexity Analysis for Minimizing the Sum of Strongly Convex Functions with Varying Condition Numbers","authors":"Nuozhou Wang, Shuzhong Zhang","doi":"10.1137/22m1503646","DOIUrl":"https://doi.org/10.1137/22m1503646","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 2, Page 1374-1401, June 2024. <br/> Abstract. A popular approach to minimizing a finite sum of smooth convex functions is stochastic gradient descent (SGD) and its variants. Fundamental research questions associated with SGD include (i) how to find a lower bound on the number of times that the gradient oracle of each individual function must be assessed in order to find an [math]-minimizer of the overall objective; (ii) how to design algorithms which guarantee finding an [math]-minimizer of the overall objective in expectation no more than a certain number of times (in terms of [math]) that the gradient oracle of each function needs to be assessed (i.e., upper bound). If these two bounds are at the same order of magnitude, then the algorithms may be called optimal. Most existing results along this line of research typically assume that the functions in the objective share the same condition number. In this paper, the first model we study is the problem of minimizing the sum of finitely many strongly convex functions whose condition numbers are all different. We propose an SGD-based method for this model and show that it is optimal in gradient computations, up to a logarithmic factor. We then consider a constrained separate block optimization model and present lower and upper bounds for its gradient computation complexity. Next, we propose solving the Fenchel dual of the constrained block optimization model via generalized SSNM, which we introduce earlier, and show that it yields a lower iteration complexity than solving the original model by the ADMM-type approach. Finally, we extend the analysis to the general composite convex optimization model and obtain gradient-computation complexity results under certain conditions.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140568296","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stochastic Differential Equations for Modeling First Order Optimization Methods 一阶优化方法建模的随机微分方程
IF 3.1 1区 数学 Q1 Mathematics Pub Date : 2024-04-11 DOI: 10.1137/21m1435665
M. Dambrine, Ch. Dossal, B. Puig, A. Rondepierre
SIAM Journal on Optimization, Volume 34, Issue 2, Page 1402-1426, June 2024.
Abstract. In this article, a family of SDEs are derived as a tool to understand the behavior of numerical optimization methods under random evaluations of the gradient. Our objective is to transpose the introduction of continuous versions through ODEs to understand the asymptotic behavior of a discrete optimization scheme to the stochastic setting. We consider a continuous version of the stochastic gradient scheme and of a stochastic inertial system. This article first studies the quality of the approximation of the discrete scheme by an SDE when the step size tends to 0. Then, it presents new asymptotic bounds on the values [math], where [math] is a solution of the SDE and [math], when [math] is convex and under integrability conditions on the noise. Results are provided under two sets of hypotheses: first considering [math] and convex functions and then adding some geometrical properties of [math]. All of these results provide insight on the behavior of these inertial and perturbed algorithms in the setting of stochastic algorithms.
SIAM 优化期刊》,第 34 卷第 2 期,第 1402-1426 页,2024 年 6 月。摘要本文导出了一系列 SDEs,作为理解梯度随机评估下数值优化方法行为的工具。我们的目的是通过 ODEs 将连续版本的引入转置到随机环境中,以理解离散优化方案的渐近行为。我们考虑了随机梯度方案的连续版本和随机惯性系统。本文首先研究了当步长趋近于 0 时,离散方案与 SDE 的近似质量。然后,本文提出了[math](其中[math]是 SDE 的一个解)和[math](其中[math]是凸的,并且在噪声的可整性条件下)值的新渐近约束。我们提供了两组假设下的结果:首先考虑 [math] 和凸函数,然后添加 [math] 的一些几何特性。所有这些结果都有助于深入了解这些惯性算法和扰动算法在随机算法环境下的行为。
{"title":"Stochastic Differential Equations for Modeling First Order Optimization Methods","authors":"M. Dambrine, Ch. Dossal, B. Puig, A. Rondepierre","doi":"10.1137/21m1435665","DOIUrl":"https://doi.org/10.1137/21m1435665","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 2, Page 1402-1426, June 2024. <br/>Abstract. In this article, a family of SDEs are derived as a tool to understand the behavior of numerical optimization methods under random evaluations of the gradient. Our objective is to transpose the introduction of continuous versions through ODEs to understand the asymptotic behavior of a discrete optimization scheme to the stochastic setting. We consider a continuous version of the stochastic gradient scheme and of a stochastic inertial system. This article first studies the quality of the approximation of the discrete scheme by an SDE when the step size tends to 0. Then, it presents new asymptotic bounds on the values [math], where [math] is a solution of the SDE and [math], when [math] is convex and under integrability conditions on the noise. Results are provided under two sets of hypotheses: first considering [math] and convex functions and then adding some geometrical properties of [math]. All of these results provide insight on the behavior of these inertial and perturbed algorithms in the setting of stochastic algorithms.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140568189","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Certifying Optimality of Bell Inequality Violations: Noncommutative Polynomial Optimization through Semidefinite Programming and Local Optimization 认证贝尔不等式违反的最优性:通过半定量编程和局部优化实现非交换多项式优化
IF 3.1 1区 数学 Q1 Mathematics Pub Date : 2024-04-09 DOI: 10.1137/22m1473340
Timotej Hrga, Igor Klep, Janez Povh
SIAM Journal on Optimization, Volume 34, Issue 2, Page 1341-1373, June 2024.
Abstract. Bell inequalities are pillars of quantum physics in that their violations imply that certain properties of quantum physics (e.g., entanglement) cannot be represented by any classical picture of physics. In this article Bell inequalities and their violations are considered through the lens of noncommutative polynomial optimization. Optimality of these violations is certified for a large majority of a set of standard Bell inequalities, denoted A2–A89 in the literature. The main techniques used in the paper include the NPA hierarchy, i.e., the noncommutative version of the Lasserre semidefinite programming (SDP) hierarchies based on the Helton–McCullough Positivstellensatz, the Gelfand–Naimark–Segal (GNS) construction with a novel use of the Artin–Wedderburn theory for rounding and projecting, and nonlinear programming (NLP). A new “Newton chip”-like technique for reducing sizes of SDPs arising in the constructed polynomial optimization problems is presented. This technique is based on conditional expectations. Finally, noncommutative Gröbner bases are exploited to certify when an optimizer (a solution yielding optimum violation) cannot be extracted from a dual SDP solution.
SIAM 优化期刊》,第 34 卷第 2 期,第 1341-1373 页,2024 年 6 月。 摘要:贝尔不等式是量子物理学的支柱。贝尔不等式是量子物理学的支柱,因为违反贝尔不等式意味着量子物理学的某些特性(如纠缠)无法用任何经典物理学图景来表示。本文通过非交换多项式优化的视角来研究贝尔不等式及其违反情况。这些违反行为的最优性得到了一组标准贝尔不等式(文献中称为 A2-A89)中绝大多数不等式的认证。论文中使用的主要技术包括 NPA 层次结构(即基于 Helton-McCullough Positivstellensatz 的 Lasserre 半定量编程(SDP)层次结构的非交换版本)、Gelfand-Naimark-Segal(GNS)结构(新颖地使用 Artin-Wedderburn 理论进行舍入和投影)以及非线性编程(NLP)。本文提出了一种类似 "牛顿芯片 "的新技术,用于减小构造多项式优化问题中出现的 SDP 的大小。该技术基于条件期望。最后,在无法从对偶 SDP 解决方案中提取优化器(产生最佳违规的解决方案)时,利用非交换格罗布纳基进行证明。
{"title":"Certifying Optimality of Bell Inequality Violations: Noncommutative Polynomial Optimization through Semidefinite Programming and Local Optimization","authors":"Timotej Hrga, Igor Klep, Janez Povh","doi":"10.1137/22m1473340","DOIUrl":"https://doi.org/10.1137/22m1473340","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 2, Page 1341-1373, June 2024. <br/> Abstract. Bell inequalities are pillars of quantum physics in that their violations imply that certain properties of quantum physics (e.g., entanglement) cannot be represented by any classical picture of physics. In this article Bell inequalities and their violations are considered through the lens of noncommutative polynomial optimization. Optimality of these violations is certified for a large majority of a set of standard Bell inequalities, denoted A2–A89 in the literature. The main techniques used in the paper include the NPA hierarchy, i.e., the noncommutative version of the Lasserre semidefinite programming (SDP) hierarchies based on the Helton–McCullough Positivstellensatz, the Gelfand–Naimark–Segal (GNS) construction with a novel use of the Artin–Wedderburn theory for rounding and projecting, and nonlinear programming (NLP). A new “Newton chip”-like technique for reducing sizes of SDPs arising in the constructed polynomial optimization problems is presented. This technique is based on conditional expectations. Finally, noncommutative Gröbner bases are exploited to certify when an optimizer (a solution yielding optimum violation) cannot be extracted from a dual SDP solution.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140568187","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generalized Power Cones: Optimal Error Bounds and Automorphisms 广义幂锥:最佳误差界限和自动形态
IF 3.1 1区 数学 Q1 Mathematics Pub Date : 2024-04-09 DOI: 10.1137/22m1542921
Ying Lin, Scott B. Lindstrom, Bruno F. Lourenço, Ting Kei Pong
SIAM Journal on Optimization, Volume 34, Issue 2, Page 1316-1340, June 2024.
Abstract. Error bounds are a requisite for trusting or distrusting solutions in an informed way. Until recently, provable error bounds in the absence of constraint qualifications were unattainable for many classes of cones that do not admit projections with known succinct expressions. We build such error bounds for the generalized power cones, using the recently developed framework of one-step facial residual functions. We also show that our error bounds are tight in the sense of that framework. Besides their utility for understanding solution reliability, the error bounds we discover have additional applications to the algebraic structure of the underlying cone, which we describe. In particular we use the error bounds to compute the automorphisms of the generalized power cones, and to identify a set of generalized power cones that are self-dual, irreducible, nonhomogeneous, and perfect.
SIAM 优化期刊》,第 34 卷第 2 期,第 1316-1340 页,2024 年 6 月。 摘要。误差边界是明智地信任或不信任解决方案的必要条件。直到最近,在没有约束条件的情况下,对于许多无法用已知简洁表达式进行投影的锥体类别来说,可证明的误差边界还无法实现。我们利用最近开发的一步面部残差函数框架,为广义幂锥建立了这样的误差边界。我们还证明,我们的误差边界在该框架的意义上是紧密的。除了对理解解的可靠性有用之外,我们发现的误差边界在底层锥的代数结构上也有额外的应用,我们将对此进行描述。特别是,我们利用误差边界计算广义幂锥的自形性,并找出一组自偶、不可还原、非同质和完美的广义幂锥。
{"title":"Generalized Power Cones: Optimal Error Bounds and Automorphisms","authors":"Ying Lin, Scott B. Lindstrom, Bruno F. Lourenço, Ting Kei Pong","doi":"10.1137/22m1542921","DOIUrl":"https://doi.org/10.1137/22m1542921","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 2, Page 1316-1340, June 2024. <br/> Abstract. Error bounds are a requisite for trusting or distrusting solutions in an informed way. Until recently, provable error bounds in the absence of constraint qualifications were unattainable for many classes of cones that do not admit projections with known succinct expressions. We build such error bounds for the generalized power cones, using the recently developed framework of one-step facial residual functions. We also show that our error bounds are tight in the sense of that framework. Besides their utility for understanding solution reliability, the error bounds we discover have additional applications to the algebraic structure of the underlying cone, which we describe. In particular we use the error bounds to compute the automorphisms of the generalized power cones, and to identify a set of generalized power cones that are self-dual, irreducible, nonhomogeneous, and perfect.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140568188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Parabolic Optimal Control Problems with Combinatorial Switching Constraints, Part II: Outer Approximation Algorithm 具有组合切换约束的抛物线最优控制问题,第二部分:外近似算法
IF 3.1 1区 数学 Q1 Mathematics Pub Date : 2024-04-03 DOI: 10.1137/22m1490284
Christoph Buchheim, Alexandra Grütering, Christian Meyer
SIAM Journal on Optimization, Volume 34, Issue 2, Page 1295-1315, June 2024.
Abstract. We consider optimal control problems for partial differential equations where the controls take binary values but vary over the time horizon; they can thus be seen as dynamic switches. The switching patterns may be subject to combinatorial constraints such as, e.g., an upper bound on the total number of switchings or a lower bound on the time between two switchings. In a companion paper [C. Buchheim, A. Grütering, and C. Meyer, SIAM J. Optim., arXiv:2203.07121, 2024], we describe the [math]-closure of the convex hull of feasible switching patterns as the intersection of convex sets derived from finite-dimensional projections. In this paper, the resulting outer description is used for the construction of an outer approximation algorithm in function space, whose iterates are proven to converge strongly in [math] to the global minimizer of the convexified optimal control problem. The linear-quadratic subproblems arising in each iteration of the outer approximation algorithm are solved by means of a semismooth Newton method. A numerical example in two spatial dimensions illustrates the efficiency of the overall algorithm.
SIAM 优化期刊》第 34 卷第 2 期第 1295-1315 页,2024 年 6 月。 摘要。我们考虑的是偏微分方程的最优控制问题,其中控制取值为二进制,但随时间跨度而变化;因此可以将其视为动态开关。切换模式可能受到组合约束,例如切换总数的上限或两次切换之间时间的下限。在另一篇论文 [C. Buchheim, A. GrüglerBuchheim、A. Grütering 和 C. Meyer,SIAM J. Optim.,arXiv:2203.07121,2024]中,我们将可行切换模式凸壳的[数学]封闭描述为由有限维投影得出的凸集的交集。在本文中,所得到的外部描述被用于构造函数空间中的外部逼近算法,其迭代在[math]中被证明强烈收敛于凸化最优控制问题的全局最小值。外近似算法的每次迭代中出现的线性二次子问题都是通过半滑牛顿法求解的。在两个空间维度上的一个数值示例说明了整个算法的效率。
{"title":"Parabolic Optimal Control Problems with Combinatorial Switching Constraints, Part II: Outer Approximation Algorithm","authors":"Christoph Buchheim, Alexandra Grütering, Christian Meyer","doi":"10.1137/22m1490284","DOIUrl":"https://doi.org/10.1137/22m1490284","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 2, Page 1295-1315, June 2024. <br/> Abstract. We consider optimal control problems for partial differential equations where the controls take binary values but vary over the time horizon; they can thus be seen as dynamic switches. The switching patterns may be subject to combinatorial constraints such as, e.g., an upper bound on the total number of switchings or a lower bound on the time between two switchings. In a companion paper [C. Buchheim, A. Grütering, and C. Meyer, SIAM J. Optim., arXiv:2203.07121, 2024], we describe the [math]-closure of the convex hull of feasible switching patterns as the intersection of convex sets derived from finite-dimensional projections. In this paper, the resulting outer description is used for the construction of an outer approximation algorithm in function space, whose iterates are proven to converge strongly in [math] to the global minimizer of the convexified optimal control problem. The linear-quadratic subproblems arising in each iteration of the outer approximation algorithm are solved by means of a semismooth Newton method. A numerical example in two spatial dimensions illustrates the efficiency of the overall algorithm.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140568279","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Decomposition Methods for Global Solution of Mixed-Integer Linear Programs 混合整数线性方程组全局求解的分解方法
IF 3.1 1区 数学 Q1 Mathematics Pub Date : 2024-04-01 DOI: 10.1137/22m1487321
Kaizhao Sun, Mou Sun, Wotao Yin
SIAM Journal on Optimization, Volume 34, Issue 2, Page 1206-1235, June 2024.
Abstract. This paper introduces two decomposition-based methods for two-block mixed-integer linear programs (MILPs), which aim to take advantage of separable structures of the original problem by solving a sequence of lower-dimensional MILPs. The first method is based on the [math]-augmented Lagrangian method, and the second one is based on a modified alternating direction method of multipliers. In the presence of certain block-angular structures, both methods create parallel subproblems in one block of variables and add nonconvex cuts to update the other block; they converge to globally optimal solutions of the original MILP under proper conditions. Numerical experiments on three classes of MILPs demonstrate the advantages of the proposed methods on structured problems over the state-of-the-art MILP solvers.
SIAM 优化期刊》,第 34 卷第 2 期,第 1206-1235 页,2024 年 6 月。 摘要本文介绍了两种基于分解的两块混合整数线性程序(MILPs)方法,旨在通过求解一系列低维 MILPs 来利用原问题的可分离结构。第一种方法基于[math]增量拉格朗日法,第二种方法基于改进的乘法交替方向法。在存在某些块-角结构的情况下,这两种方法都能在一个变量块中创建并行子问题,并添加非凸切口来更新另一个变量块;在适当条件下,它们都能收敛到原始 MILP 的全局最优解。对三类 MILP 的数值实验表明,与最先进的 MILP 求解器相比,建议的方法在结构化问题上更具优势。
{"title":"Decomposition Methods for Global Solution of Mixed-Integer Linear Programs","authors":"Kaizhao Sun, Mou Sun, Wotao Yin","doi":"10.1137/22m1487321","DOIUrl":"https://doi.org/10.1137/22m1487321","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 2, Page 1206-1235, June 2024. <br/> Abstract. This paper introduces two decomposition-based methods for two-block mixed-integer linear programs (MILPs), which aim to take advantage of separable structures of the original problem by solving a sequence of lower-dimensional MILPs. The first method is based on the [math]-augmented Lagrangian method, and the second one is based on a modified alternating direction method of multipliers. In the presence of certain block-angular structures, both methods create parallel subproblems in one block of variables and add nonconvex cuts to update the other block; they converge to globally optimal solutions of the original MILP under proper conditions. Numerical experiments on three classes of MILPs demonstrate the advantages of the proposed methods on structured problems over the state-of-the-art MILP solvers.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140567777","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nonasymptotic Upper Estimates for Errors of the Sample Average Approximation Method to Solve Risk-Averse Stochastic Programs 解决风险厌恶随机程序的样本平均逼近法误差的非渐近上限估计值
IF 3.1 1区 数学 Q1 Mathematics Pub Date : 2024-04-01 DOI: 10.1137/22m1535425
Volker Krätschmer
SIAM Journal on Optimization, Volume 34, Issue 2, Page 1264-1294, June 2024.
Abstract. We study statistical properties of the optimal value of the sample average approximation (SAA). The focus is on the tail function of the absolute error induced by the SAA, deriving upper estimates of its outcomes dependent on the sample size. The estimates allow to conclude immediately convergence rates for the optimal value of the SAA. As a crucial point, the investigations are based on new types of conditions from the theory of empirical processes which do not rely on pathwise analytical properties of the goal functions. In particular, continuity in the parameter is not imposed in advance as often in the literature on the SAA method. It is also shown that the new condition is satisfied if the paths of the goal functions are Hölder continuous so that the main results carry over in this case. Moreover, the main results are applied to goal functions whose paths are piecewise Hölder continuous as, e.g., in two-stage mixed-integer programs. The main results are shown for classical risk-neutral stochastic programs, but we also demonstrate how to apply them to the sample average approximation of risk-averse stochastic programs. In this respect, we consider stochastic programs expressed in terms of mean upper semideviations and divergence risk measures.
SIAM 优化期刊》,第 34 卷第 2 期,第 1264-1294 页,2024 年 6 月。 摘要我们研究了样本平均近似(SAA)最优值的统计特性。重点是 SAA 引起的绝对误差的尾函数,推导出其结果取决于样本大小的上限估计值。通过这些估计值,可以立即得出 SAA 最佳值的收敛率。关键的一点是,研究基于经验过程理论中的新型条件,而这些条件并不依赖于目标函数的路径分析特性。特别是,没有像有关 SAA 方法的文献中经常提到的那样,事先强加参数的连续性。研究还表明,如果目标函数的路径是荷尔德连续的,那么新条件就会得到满足,因此主要结果在这种情况下也是如此。此外,主要结果还适用于路径为片断荷尔德连续的目标函数,例如两阶段混合整数程序。主要结果针对经典的风险中性随机程序,但我们也演示了如何将它们应用于风险规避随机程序的样本平均近似。在这方面,我们考虑了用均值上半偏差和发散风险度量表示的随机程序。
{"title":"Nonasymptotic Upper Estimates for Errors of the Sample Average Approximation Method to Solve Risk-Averse Stochastic Programs","authors":"Volker Krätschmer","doi":"10.1137/22m1535425","DOIUrl":"https://doi.org/10.1137/22m1535425","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 2, Page 1264-1294, June 2024. <br/> Abstract. We study statistical properties of the optimal value of the sample average approximation (SAA). The focus is on the tail function of the absolute error induced by the SAA, deriving upper estimates of its outcomes dependent on the sample size. The estimates allow to conclude immediately convergence rates for the optimal value of the SAA. As a crucial point, the investigations are based on new types of conditions from the theory of empirical processes which do not rely on pathwise analytical properties of the goal functions. In particular, continuity in the parameter is not imposed in advance as often in the literature on the SAA method. It is also shown that the new condition is satisfied if the paths of the goal functions are Hölder continuous so that the main results carry over in this case. Moreover, the main results are applied to goal functions whose paths are piecewise Hölder continuous as, e.g., in two-stage mixed-integer programs. The main results are shown for classical risk-neutral stochastic programs, but we also demonstrate how to apply them to the sample average approximation of risk-averse stochastic programs. In this respect, we consider stochastic programs expressed in terms of mean upper semideviations and divergence risk measures.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140567883","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Accelerated Forward-Backward Optimization Using Deep Learning 利用深度学习加速前向-后向优化
IF 3.1 1区 数学 Q1 Mathematics Pub Date : 2024-04-01 DOI: 10.1137/22m1532548
Sebastian Banert, Jevgenija Rudzusika, Ozan Öktem, Jonas Adler
SIAM Journal on Optimization, Volume 34, Issue 2, Page 1236-1263, June 2024.
Abstract. We propose several deep-learning accelerated optimization solvers with convergence guarantees. We use ideas from the analysis of accelerated forward-backward schemes like FISTA, but instead of the classical approach of proving convergence for a choice of parameters, such as a step-size, we show convergence whenever the update is chosen in a specific set. Rather than picking a point in this set using some predefined method, we train a deep neural network to pick the best update within a given space. Finally, we show that the method is applicable to several cases of smooth and nonsmooth optimization and show superior results to established accelerated solvers.
SIAM 优化期刊》,第 34 卷第 2 期,第 1236-1263 页,2024 年 6 月。 摘要我们提出了几种具有收敛性保证的深度学习加速优化求解器。我们使用了对 FISTA 等加速前向后向方案的分析思路,但我们并没有采用经典的方法来证明步长等参数选择的收敛性,而是证明了在特定集合中选择更新时的收敛性。我们不是使用某种预定义的方法在这个集合中选取一个点,而是训练一个深度神经网络,在给定的空间内选取最佳更新。最后,我们证明该方法适用于平滑和非平滑优化的几种情况,并显示出优于现有加速求解器的结果。
{"title":"Accelerated Forward-Backward Optimization Using Deep Learning","authors":"Sebastian Banert, Jevgenija Rudzusika, Ozan Öktem, Jonas Adler","doi":"10.1137/22m1532548","DOIUrl":"https://doi.org/10.1137/22m1532548","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 2, Page 1236-1263, June 2024. <br/> Abstract. We propose several deep-learning accelerated optimization solvers with convergence guarantees. We use ideas from the analysis of accelerated forward-backward schemes like FISTA, but instead of the classical approach of proving convergence for a choice of parameters, such as a step-size, we show convergence whenever the update is chosen in a specific set. Rather than picking a point in this set using some predefined method, we train a deep neural network to pick the best update within a given space. Finally, we show that the method is applicable to several cases of smooth and nonsmooth optimization and show superior results to established accelerated solvers.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140567782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
SIAM Journal on Optimization
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1