SIAM Journal on Optimization最新文献

英文中文

Certifying Optimality of Bell Inequality Violations: Noncommutative Polynomial Optimization through Semidefinite Programming and Local Optimization 认证贝尔不等式违反的最优性：通过半定量编程和局部优化实现非交换多项式优化

IF 3.1 1区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Optimization

Pub Date : 2024-04-09 DOI: 10.1137/22m1473340

Timotej Hrga, Igor Klep, Janez Povh

SIAM Journal on Optimization, Volume 34, Issue 2, Page 1341-1373, June 2024.
Abstract. Bell inequalities are pillars of quantum physics in that their violations imply that certain properties of quantum physics (e.g., entanglement) cannot be represented by any classical picture of physics. In this article Bell inequalities and their violations are considered through the lens of noncommutative polynomial optimization. Optimality of these violations is certified for a large majority of a set of standard Bell inequalities, denoted A2–A89 in the literature. The main techniques used in the paper include the NPA hierarchy, i.e., the noncommutative version of the Lasserre semidefinite programming (SDP) hierarchies based on the Helton–McCullough Positivstellensatz, the Gelfand–Naimark–Segal (GNS) construction with a novel use of the Artin–Wedderburn theory for rounding and projecting, and nonlinear programming (NLP). A new “Newton chip”-like technique for reducing sizes of SDPs arising in the constructed polynomial optimization problems is presented. This technique is based on conditional expectations. Finally, noncommutative Gröbner bases are exploited to certify when an optimizer (a solution yielding optimum violation) cannot be extracted from a dual SDP solution.

SIAM 优化期刊》，第 34 卷第 2 期，第 1341-1373 页，2024 年 6 月。摘要：贝尔不等式是量子物理学的支柱。贝尔不等式是量子物理学的支柱，因为违反贝尔不等式意味着量子物理学的某些特性（如纠缠）无法用任何经典物理学图景来表示。本文通过非交换多项式优化的视角来研究贝尔不等式及其违反情况。这些违反行为的最优性得到了一组标准贝尔不等式（文献中称为 A2-A89）中绝大多数不等式的认证。论文中使用的主要技术包括 NPA 层次结构（即基于 Helton-McCullough Positivstellensatz 的 Lasserre 半定量编程（SDP）层次结构的非交换版本）、Gelfand-Naimark-Segal（GNS）结构（新颖地使用 Artin-Wedderburn 理论进行舍入和投影）以及非线性编程（NLP）。本文提出了一种类似 "牛顿芯片 "的新技术，用于减小构造多项式优化问题中出现的 SDP 的大小。该技术基于条件期望。最后，在无法从对偶 SDP 解决方案中提取优化器（产生最佳违规的解决方案）时，利用非交换格罗布纳基进行证明。

{"title":"Certifying Optimality of Bell Inequality Violations: Noncommutative Polynomial Optimization through Semidefinite Programming and Local Optimization","authors":"Timotej Hrga, Igor Klep, Janez Povh","doi":"10.1137/22m1473340","DOIUrl":"https://doi.org/10.1137/22m1473340","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 2, Page 1341-1373, June 2024. Abstract. Bell inequalities are pillars of quantum physics in that their violations imply that certain properties of quantum physics (e.g., entanglement) cannot be represented by any classical picture of physics. In this article Bell inequalities and their violations are considered through the lens of noncommutative polynomial optimization. Optimality of these violations is certified for a large majority of a set of standard Bell inequalities, denoted A2–A89 in the literature. The main techniques used in the paper include the NPA hierarchy, i.e., the noncommutative version of the Lasserre semidefinite programming (SDP) hierarchies based on the Helton–McCullough Positivstellensatz, the Gelfand–Naimark–Segal (GNS) construction with a novel use of the Artin–Wedderburn theory for rounding and projecting, and nonlinear programming (NLP). A new “Newton chip”-like technique for reducing sizes of SDPs arising in the constructed polynomial optimization problems is presented. This technique is based on conditional expectations. Finally, noncommutative Gröbner bases are exploited to certify when an optimizer (a solution yielding optimum violation) cannot be extracted from a dual SDP solution.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":"254 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140568187","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Generalized Power Cones: Optimal Error Bounds and Automorphisms 广义幂锥：最佳误差界限和自动形态

IF 3.1 1区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Optimization

Pub Date : 2024-04-09 DOI: 10.1137/22m1542921

Ying Lin, Scott B. Lindstrom, Bruno F. Lourenço, Ting Kei Pong

SIAM Journal on Optimization, Volume 34, Issue 2, Page 1316-1340, June 2024.
Abstract. Error bounds are a requisite for trusting or distrusting solutions in an informed way. Until recently, provable error bounds in the absence of constraint qualifications were unattainable for many classes of cones that do not admit projections with known succinct expressions. We build such error bounds for the generalized power cones, using the recently developed framework of one-step facial residual functions. We also show that our error bounds are tight in the sense of that framework. Besides their utility for understanding solution reliability, the error bounds we discover have additional applications to the algebraic structure of the underlying cone, which we describe. In particular we use the error bounds to compute the automorphisms of the generalized power cones, and to identify a set of generalized power cones that are self-dual, irreducible, nonhomogeneous, and perfect.

SIAM 优化期刊》，第 34 卷第 2 期，第 1316-1340 页，2024 年 6 月。摘要。误差边界是明智地信任或不信任解决方案的必要条件。直到最近，在没有约束条件的情况下，对于许多无法用已知简洁表达式进行投影的锥体类别来说，可证明的误差边界还无法实现。我们利用最近开发的一步面部残差函数框架，为广义幂锥建立了这样的误差边界。我们还证明，我们的误差边界在该框架的意义上是紧密的。除了对理解解的可靠性有用之外，我们发现的误差边界在底层锥的代数结构上也有额外的应用，我们将对此进行描述。特别是，我们利用误差边界计算广义幂锥的自形性，并找出一组自偶、不可还原、非同质和完美的广义幂锥。

引用次数: 0

Parabolic Optimal Control Problems with Combinatorial Switching Constraints, Part II: Outer Approximation Algorithm 具有组合切换约束的抛物线最优控制问题，第二部分：外近似算法

IF 3.1 1区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Optimization

Pub Date : 2024-04-03 DOI: 10.1137/22m1490284

Christoph Buchheim, Alexandra Grütering, Christian Meyer

SIAM Journal on Optimization, Volume 34, Issue 2, Page 1295-1315, June 2024.
Abstract. We consider optimal control problems for partial differential equations where the controls take binary values but vary over the time horizon; they can thus be seen as dynamic switches. The switching patterns may be subject to combinatorial constraints such as, e.g., an upper bound on the total number of switchings or a lower bound on the time between two switchings. In a companion paper [C. Buchheim, A. Grütering, and C. Meyer, SIAM J. Optim., arXiv:2203.07121, 2024], we describe the [math]-closure of the convex hull of feasible switching patterns as the intersection of convex sets derived from finite-dimensional projections. In this paper, the resulting outer description is used for the construction of an outer approximation algorithm in function space, whose iterates are proven to converge strongly in [math] to the global minimizer of the convexified optimal control problem. The linear-quadratic subproblems arising in each iteration of the outer approximation algorithm are solved by means of a semismooth Newton method. A numerical example in two spatial dimensions illustrates the efficiency of the overall algorithm.

SIAM 优化期刊》第 34 卷第 2 期第 1295-1315 页，2024 年 6 月。摘要。我们考虑的是偏微分方程的最优控制问题，其中控制取值为二进制，但随时间跨度而变化；因此可以将其视为动态开关。切换模式可能受到组合约束，例如切换总数的上限或两次切换之间时间的下限。在另一篇论文 [C. Buchheim, A. GrüglerBuchheim、A. Grütering 和 C. Meyer，SIAM J. Optim.，arXiv:2203.07121，2024]中，我们将可行切换模式凸壳的[数学]封闭描述为由有限维投影得出的凸集的交集。在本文中，所得到的外部描述被用于构造函数空间中的外部逼近算法，其迭代在[math]中被证明强烈收敛于凸化最优控制问题的全局最小值。外近似算法的每次迭代中出现的线性二次子问题都是通过半滑牛顿法求解的。在两个空间维度上的一个数值示例说明了整个算法的效率。

{"title":"Parabolic Optimal Control Problems with Combinatorial Switching Constraints, Part II: Outer Approximation Algorithm","authors":"Christoph Buchheim, Alexandra Grütering, Christian Meyer","doi":"10.1137/22m1490284","DOIUrl":"https://doi.org/10.1137/22m1490284","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 2, Page 1295-1315, June 2024. Abstract. We consider optimal control problems for partial differential equations where the controls take binary values but vary over the time horizon; they can thus be seen as dynamic switches. The switching patterns may be subject to combinatorial constraints such as, e.g., an upper bound on the total number of switchings or a lower bound on the time between two switchings. In a companion paper [C. Buchheim, A. Grütering, and C. Meyer, SIAM J. Optim., arXiv:2203.07121, 2024], we describe the [math]-closure of the convex hull of feasible switching patterns as the intersection of convex sets derived from finite-dimensional projections. In this paper, the resulting outer description is used for the construction of an outer approximation algorithm in function space, whose iterates are proven to converge strongly in [math] to the global minimizer of the convexified optimal control problem. The linear-quadratic subproblems arising in each iteration of the outer approximation algorithm are solved by means of a semismooth Newton method. A numerical example in two spatial dimensions illustrates the efficiency of the overall algorithm.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":"49 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140568279","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Decomposition Methods for Global Solution of Mixed-Integer Linear Programs 混合整数线性方程组全局求解的分解方法

IF 3.1 1区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Optimization

Pub Date : 2024-04-01 DOI: 10.1137/22m1487321

Kaizhao Sun, Mou Sun, Wotao Yin

SIAM Journal on Optimization, Volume 34, Issue 2, Page 1206-1235, June 2024.
Abstract. This paper introduces two decomposition-based methods for two-block mixed-integer linear programs (MILPs), which aim to take advantage of separable structures of the original problem by solving a sequence of lower-dimensional MILPs. The first method is based on the [math]-augmented Lagrangian method, and the second one is based on a modified alternating direction method of multipliers. In the presence of certain block-angular structures, both methods create parallel subproblems in one block of variables and add nonconvex cuts to update the other block; they converge to globally optimal solutions of the original MILP under proper conditions. Numerical experiments on three classes of MILPs demonstrate the advantages of the proposed methods on structured problems over the state-of-the-art MILP solvers.

SIAM 优化期刊》，第 34 卷第 2 期，第 1206-1235 页，2024 年 6 月。摘要本文介绍了两种基于分解的两块混合整数线性程序（MILPs）方法，旨在通过求解一系列低维 MILPs 来利用原问题的可分离结构。第一种方法基于[math]增量拉格朗日法，第二种方法基于改进的乘法交替方向法。在存在某些块-角结构的情况下，这两种方法都能在一个变量块中创建并行子问题，并添加非凸切口来更新另一个变量块；在适当条件下，它们都能收敛到原始 MILP 的全局最优解。对三类 MILP 的数值实验表明，与最先进的 MILP 求解器相比，建议的方法在结构化问题上更具优势。

引用次数: 0

Nonasymptotic Upper Estimates for Errors of the Sample Average Approximation Method to Solve Risk-Averse Stochastic Programs 解决风险厌恶随机程序的样本平均逼近法误差的非渐近上限估计值

IF 3.1 1区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Optimization

Pub Date : 2024-04-01 DOI: 10.1137/22m1535425

Volker Krätschmer

SIAM Journal on Optimization, Volume 34, Issue 2, Page 1264-1294, June 2024.
Abstract. We study statistical properties of the optimal value of the sample average approximation (SAA). The focus is on the tail function of the absolute error induced by the SAA, deriving upper estimates of its outcomes dependent on the sample size. The estimates allow to conclude immediately convergence rates for the optimal value of the SAA. As a crucial point, the investigations are based on new types of conditions from the theory of empirical processes which do not rely on pathwise analytical properties of the goal functions. In particular, continuity in the parameter is not imposed in advance as often in the literature on the SAA method. It is also shown that the new condition is satisfied if the paths of the goal functions are Hölder continuous so that the main results carry over in this case. Moreover, the main results are applied to goal functions whose paths are piecewise Hölder continuous as, e.g., in two-stage mixed-integer programs. The main results are shown for classical risk-neutral stochastic programs, but we also demonstrate how to apply them to the sample average approximation of risk-averse stochastic programs. In this respect, we consider stochastic programs expressed in terms of mean upper semideviations and divergence risk measures.

SIAM 优化期刊》，第 34 卷第 2 期，第 1264-1294 页，2024 年 6 月。摘要我们研究了样本平均近似（SAA）最优值的统计特性。重点是 SAA 引起的绝对误差的尾函数，推导出其结果取决于样本大小的上限估计值。通过这些估计值，可以立即得出 SAA 最佳值的收敛率。关键的一点是，研究基于经验过程理论中的新型条件，而这些条件并不依赖于目标函数的路径分析特性。特别是，没有像有关 SAA 方法的文献中经常提到的那样，事先强加参数的连续性。研究还表明，如果目标函数的路径是荷尔德连续的，那么新条件就会得到满足，因此主要结果在这种情况下也是如此。此外，主要结果还适用于路径为片断荷尔德连续的目标函数，例如两阶段混合整数程序。主要结果针对经典的风险中性随机程序，但我们也演示了如何将它们应用于风险规避随机程序的样本平均近似。在这方面，我们考虑了用均值上半偏差和发散风险度量表示的随机程序。

{"title":"Nonasymptotic Upper Estimates for Errors of the Sample Average Approximation Method to Solve Risk-Averse Stochastic Programs","authors":"Volker Krätschmer","doi":"10.1137/22m1535425","DOIUrl":"https://doi.org/10.1137/22m1535425","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 2, Page 1264-1294, June 2024. Abstract. We study statistical properties of the optimal value of the sample average approximation (SAA). The focus is on the tail function of the absolute error induced by the SAA, deriving upper estimates of its outcomes dependent on the sample size. The estimates allow to conclude immediately convergence rates for the optimal value of the SAA. As a crucial point, the investigations are based on new types of conditions from the theory of empirical processes which do not rely on pathwise analytical properties of the goal functions. In particular, continuity in the parameter is not imposed in advance as often in the literature on the SAA method. It is also shown that the new condition is satisfied if the paths of the goal functions are Hölder continuous so that the main results carry over in this case. Moreover, the main results are applied to goal functions whose paths are piecewise Hölder continuous as, e.g., in two-stage mixed-integer programs. The main results are shown for classical risk-neutral stochastic programs, but we also demonstrate how to apply them to the sample average approximation of risk-averse stochastic programs. In this respect, we consider stochastic programs expressed in terms of mean upper semideviations and divergence risk measures.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":"49 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140567883","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Accelerated Forward-Backward Optimization Using Deep Learning 利用深度学习加速前向-后向优化

IF 3.1 1区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Optimization

Pub Date : 2024-04-01 DOI: 10.1137/22m1532548

Sebastian Banert, Jevgenija Rudzusika, Ozan Öktem, Jonas Adler

SIAM Journal on Optimization, Volume 34, Issue 2, Page 1236-1263, June 2024.
Abstract. We propose several deep-learning accelerated optimization solvers with convergence guarantees. We use ideas from the analysis of accelerated forward-backward schemes like FISTA, but instead of the classical approach of proving convergence for a choice of parameters, such as a step-size, we show convergence whenever the update is chosen in a specific set. Rather than picking a point in this set using some predefined method, we train a deep neural network to pick the best update within a given space. Finally, we show that the method is applicable to several cases of smooth and nonsmooth optimization and show superior results to established accelerated solvers.

SIAM 优化期刊》，第 34 卷第 2 期，第 1236-1263 页，2024 年 6 月。摘要我们提出了几种具有收敛性保证的深度学习加速优化求解器。我们使用了对 FISTA 等加速前向后向方案的分析思路，但我们并没有采用经典的方法来证明步长等参数选择的收敛性，而是证明了在特定集合中选择更新时的收敛性。我们不是使用某种预定义的方法在这个集合中选取一个点，而是训练一个深度神经网络，在给定的空间内选取最佳更新。最后，我们证明该方法适用于平滑和非平滑优化的几种情况，并显示出优于现有加速求解器的结果。

引用次数: 0

Parabolic Optimal Control Problems with Combinatorial Switching Constraints, Part I: Convex Relaxations 具有组合切换约束条件的抛物线优化控制问题，第一部分：凸松弛

IF 3.1 1区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Optimization

Pub Date : 2024-04-01 DOI: 10.1137/22m1490260

Christoph Buchheim, Alexandra Grütering, Christian Meyer

SIAM Journal on Optimization, Volume 34, Issue 2, Page 1187-1205, June 2024.
Abstract. We consider optimal control problems for partial differential equations where the controls take binary values but vary over the time horizon; they can thus be seen as dynamic switches. The switching patterns may be subject to combinatorial constraints such as, e.g., an upper bound on the total number of switchings or a lower bound on the time between two switchings. While such combinatorial constraints are often seen as an additional complication that is treated in a heuristic postprocessing, the core of our approach is to investigate the convex hull of all feasible switching patterns in order to define a tight convex relaxation of the control problem. The convex relaxation is built by cutting planes derived from finite-dimensional projections, which can be studied by means of polyhedral combinatorics. A numerical example for the case of a bounded number of switchings shows that our approach can significantly improve the dual bounds given by the straightforward continuous relaxation, which is obtained by relaxing binarity constraints.

SIAM 优化期刊》，第 34 卷第 2 期，第 1187-1205 页，2024 年 6 月。摘要。我们考虑的是偏微分方程的最优控制问题，其中控制取值为二进制，但随时间跨度而变化；因此可以将其视为动态开关。切换模式可能受到组合约束，例如切换总数的上限或两次切换之间时间的下限。这种组合约束通常被视为一种额外的复杂因素，在启发式后处理中加以处理，而我们方法的核心是研究所有可行切换模式的凸壳，从而定义控制问题的紧密凸松弛。凸松弛由有限维投影衍生的切割平面建立，可通过多面体组合学进行研究。一个关于有界切换次数的数值示例表明，我们的方法可以显著改善直接连续松弛法给出的对偶约束，而连续松弛法是通过松弛二值性约束得到的。

{"title":"Parabolic Optimal Control Problems with Combinatorial Switching Constraints, Part I: Convex Relaxations","authors":"Christoph Buchheim, Alexandra Grütering, Christian Meyer","doi":"10.1137/22m1490260","DOIUrl":"https://doi.org/10.1137/22m1490260","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 2, Page 1187-1205, June 2024. Abstract. We consider optimal control problems for partial differential equations where the controls take binary values but vary over the time horizon; they can thus be seen as dynamic switches. The switching patterns may be subject to combinatorial constraints such as, e.g., an upper bound on the total number of switchings or a lower bound on the time between two switchings. While such combinatorial constraints are often seen as an additional complication that is treated in a heuristic postprocessing, the core of our approach is to investigate the convex hull of all feasible switching patterns in order to define a tight convex relaxation of the control problem. The convex relaxation is built by cutting planes derived from finite-dimensional projections, which can be studied by means of polyhedral combinatorics. A numerical example for the case of a bounded number of switchings shows that our approach can significantly improve the dual bounds given by the straightforward continuous relaxation, which is obtained by relaxing binarity constraints.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":"54 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140567882","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Semismooth Newton Stochastic Proximal Point Algorithm with Variance Reduction 减少方差的半滑牛顿随机近点算法

IF 3.1 1区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Optimization

Pub Date : 2024-03-26 DOI: 10.1137/22m1488181

Andre Milzarek, Fabian Schaipp, Michael Ulbrich

SIAM Journal on Optimization, Volume 34, Issue 1, Page 1157-1185, March 2024.
Abstract. We develop an implementable stochastic proximal point (SPP) method for a class of weakly convex, composite optimization problems. The proposed stochastic proximal point algorithm incorporates a variance reduction mechanism and the resulting SPP updates are solved using an inexact semismooth Newton framework. We establish detailed convergence results that take the inexactness of the SPP steps into account and that are in accordance with existing convergence guarantees of (proximal) stochastic variance-reduced gradient methods. Numerical experiments show that the proposed algorithm competes favorably with other state-of-the-art methods and achieves higher robustness with respect to the step size selection.

SIAM 优化期刊》，第 34 卷第 1 期，第 1157-1185 页，2024 年 3 月。摘要。我们针对一类弱凸复合优化问题开发了一种可实现的随机近似点（SPP）方法。所提出的随机近似点算法结合了方差缩小机制，并使用不精确的半光滑牛顿框架求解 SPP 更新。我们建立了详细的收敛结果，这些结果考虑到了 SPP 步骤的不精确性，并且与（近点）随机方差缩小梯度方法的现有收敛保证相一致。数值实验表明，所提出的算法能与其他最先进的方法相媲美，并且在步长选择方面具有更高的鲁棒性。

引用次数: 0

Provably Accelerated Decentralized Gradient Methods Over Unbalanced Directed Graphs 不平衡有向图上的可证明加速分散梯度法

IF 3.1 1区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Optimization

Pub Date : 2024-03-22 DOI: 10.1137/22m148570x

Zhuoqing Song, Lei Shi, Shi Pu, Ming Yan

SIAM Journal on Optimization, Volume 34, Issue 1, Page 1131-1156, March 2024.
Abstract. We consider the decentralized optimization problem, where a network of [math] agents aims to collaboratively minimize the average of their individual smooth and convex objective functions through peer-to-peer communication in a directed graph. To tackle this problem, we propose two accelerated gradient tracking methods, namely Accelerated Push-DIGing (APD) and APD-SC, for non-strongly convex and strongly convex objective functions, respectively. We show that APD and APD-SC converge at the rates [math] and [math], respectively, up to constant factors depending only on the mixing matrix. APD and APD-SC are the first decentralized methods over unbalanced directed graphs that achieve the same provable acceleration as centralized methods. Numerical experiments demonstrate the effectiveness of both methods.

SIAM 优化期刊》，第 34 卷第 1 期，第 1131-1156 页，2024 年 3 月。摘要。我们考虑了分散优化问题，即一个由[数学]代理组成的网络旨在通过有向图中的点对点通信，协同最小化其各自平滑凸目标函数的平均值。为了解决这个问题，我们提出了两种加速梯度跟踪方法，即加速推导法（APD）和 APD-SC，分别适用于非强凸目标函数和强凸目标函数。我们的研究表明，APD 和 APD-SC 分别以 [math] 和 [math] 的速率收敛，收敛率可达常数因子，仅取决于混合矩阵。APD 和 APD-SC 是第一种在不平衡有向图上实现与集中式方法相同的可证明加速度的分散式方法。数值实验证明了这两种方法的有效性。

引用次数: 0

Robust Accelerated Primal-Dual Methods for Computing Saddle Points 用于计算鞍点的稳健加速原始双方法

IF 3.1 1区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Optimization

Pub Date : 2024-03-19 DOI: 10.1137/21m1462775

Xuan Zhang, Necdet Serhat Aybat, Mert Gürbüzbalaban

SIAM Journal on Optimization, Volume 34, Issue 1, Page 1097-1130, March 2024.
Abstract. We consider strongly-convex-strongly-concave saddle point problems assuming we have access to unbiased stochastic estimates of the gradients. We propose a stochastic accelerated primal-dual (SAPD) algorithm and show that the SAPD sequence, generated using constant primal-dual step sizes, linearly converges to a neighborhood of the unique saddle point. Interpreting the size of the neighborhood as a measure of robustness to gradient noise, we obtain explicit characterizations of robustness in terms of SAPD parameters and problem constants. Based on these characterizations, we develop computationally tractable techniques for optimizing the SAPD parameters, i.e., the primal and dual step sizes, and the momentum parameter, to achieve a desired trade-off between the convergence rate and robustness on the Pareto curve. This allows SAPD to enjoy fast convergence properties while being robust to noise as an accelerated method. SAPD admits convergence guarantees for the distance metric with a variance term optimal up to a logarithmic factor, which can be removed by employing a restarting strategy. We also discuss how convergence and robustness results extend to the merely-convex-merely-concave setting. Finally, we illustrate our framework on a distributionally robust logistic regression problem.

SIAM 优化期刊》，第 34 卷，第 1 期，第 1097-1130 页，2024 年 3 月。摘要。我们考虑了强凸-强凹鞍点问题，假设我们可以获得梯度的无偏随机估计。我们提出了一种随机加速初等二元算法（SAPD），并证明使用恒定初等二元步长生成的 SAPD 序列线性收敛于唯一鞍点的邻域。我们将邻域的大小解释为对梯度噪声的鲁棒性度量，并根据 SAPD 参数和问题常数获得了鲁棒性的明确特征。基于这些特征，我们开发了计算简单的技术，用于优化 SAPD 参数，即原始步长和对偶步长以及动量参数，从而在帕累托曲线上实现收敛速度和鲁棒性之间的理想权衡。这使得 SAPD 既能享受快速收敛特性，又能作为一种加速方法对噪声保持稳健。SAPD 可保证距离度量的收敛性，其方差项最优为对数因子，可通过采用重启策略消除。我们还讨论了收敛性和鲁棒性结果如何扩展到单纯凸-单纯凹设置。最后，我们在一个分布稳健的逻辑回归问题上说明了我们的框架。

{"title":"Robust Accelerated Primal-Dual Methods for Computing Saddle Points","authors":"Xuan Zhang, Necdet Serhat Aybat, Mert Gürbüzbalaban","doi":"10.1137/21m1462775","DOIUrl":"https://doi.org/10.1137/21m1462775","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 1, Page 1097-1130, March 2024. Abstract. We consider strongly-convex-strongly-concave saddle point problems assuming we have access to unbiased stochastic estimates of the gradients. We propose a stochastic accelerated primal-dual (SAPD) algorithm and show that the SAPD sequence, generated using constant primal-dual step sizes, linearly converges to a neighborhood of the unique saddle point. Interpreting the size of the neighborhood as a measure of robustness to gradient noise, we obtain explicit characterizations of robustness in terms of SAPD parameters and problem constants. Based on these characterizations, we develop computationally tractable techniques for optimizing the SAPD parameters, i.e., the primal and dual step sizes, and the momentum parameter, to achieve a desired trade-off between the convergence rate and robustness on the Pareto curve. This allows SAPD to enjoy fast convergence properties while being robust to noise as an accelerated method. SAPD admits convergence guarantees for the distance metric with a variance term optimal up to a logarithmic factor, which can be removed by employing a restarting strategy. We also discuss how convergence and robustness results extend to the merely-convex-merely-concave setting. Finally, we illustrate our framework on a distributionally robust logistic regression problem.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":"103 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140172253","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

SIAM Journal on Optimization

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀