首页 > 最新文献

SIAM Journal on Optimization最新文献

英文 中文
Parabolic Optimal Control Problems with Combinatorial Switching Constraints, Part I: Convex Relaxations 具有组合切换约束条件的抛物线优化控制问题,第一部分:凸松弛
IF 3.1 1区 数学 Q1 Mathematics Pub Date : 2024-04-01 DOI: 10.1137/22m1490260
Christoph Buchheim, Alexandra Grütering, Christian Meyer
SIAM Journal on Optimization, Volume 34, Issue 2, Page 1187-1205, June 2024.
Abstract. We consider optimal control problems for partial differential equations where the controls take binary values but vary over the time horizon; they can thus be seen as dynamic switches. The switching patterns may be subject to combinatorial constraints such as, e.g., an upper bound on the total number of switchings or a lower bound on the time between two switchings. While such combinatorial constraints are often seen as an additional complication that is treated in a heuristic postprocessing, the core of our approach is to investigate the convex hull of all feasible switching patterns in order to define a tight convex relaxation of the control problem. The convex relaxation is built by cutting planes derived from finite-dimensional projections, which can be studied by means of polyhedral combinatorics. A numerical example for the case of a bounded number of switchings shows that our approach can significantly improve the dual bounds given by the straightforward continuous relaxation, which is obtained by relaxing binarity constraints.
SIAM 优化期刊》,第 34 卷第 2 期,第 1187-1205 页,2024 年 6 月。 摘要。我们考虑的是偏微分方程的最优控制问题,其中控制取值为二进制,但随时间跨度而变化;因此可以将其视为动态开关。切换模式可能受到组合约束,例如切换总数的上限或两次切换之间时间的下限。这种组合约束通常被视为一种额外的复杂因素,在启发式后处理中加以处理,而我们方法的核心是研究所有可行切换模式的凸壳,从而定义控制问题的紧密凸松弛。凸松弛由有限维投影衍生的切割平面建立,可通过多面体组合学进行研究。一个关于有界切换次数的数值示例表明,我们的方法可以显著改善直接连续松弛法给出的对偶约束,而连续松弛法是通过松弛二值性约束得到的。
{"title":"Parabolic Optimal Control Problems with Combinatorial Switching Constraints, Part I: Convex Relaxations","authors":"Christoph Buchheim, Alexandra Grütering, Christian Meyer","doi":"10.1137/22m1490260","DOIUrl":"https://doi.org/10.1137/22m1490260","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 2, Page 1187-1205, June 2024. <br/> Abstract. We consider optimal control problems for partial differential equations where the controls take binary values but vary over the time horizon; they can thus be seen as dynamic switches. The switching patterns may be subject to combinatorial constraints such as, e.g., an upper bound on the total number of switchings or a lower bound on the time between two switchings. While such combinatorial constraints are often seen as an additional complication that is treated in a heuristic postprocessing, the core of our approach is to investigate the convex hull of all feasible switching patterns in order to define a tight convex relaxation of the control problem. The convex relaxation is built by cutting planes derived from finite-dimensional projections, which can be studied by means of polyhedral combinatorics. A numerical example for the case of a bounded number of switchings shows that our approach can significantly improve the dual bounds given by the straightforward continuous relaxation, which is obtained by relaxing binarity constraints.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140567882","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Semismooth Newton Stochastic Proximal Point Algorithm with Variance Reduction 减少方差的半滑牛顿随机近点算法
IF 3.1 1区 数学 Q1 Mathematics Pub Date : 2024-03-26 DOI: 10.1137/22m1488181
Andre Milzarek, Fabian Schaipp, Michael Ulbrich
SIAM Journal on Optimization, Volume 34, Issue 1, Page 1157-1185, March 2024.
Abstract. We develop an implementable stochastic proximal point (SPP) method for a class of weakly convex, composite optimization problems. The proposed stochastic proximal point algorithm incorporates a variance reduction mechanism and the resulting SPP updates are solved using an inexact semismooth Newton framework. We establish detailed convergence results that take the inexactness of the SPP steps into account and that are in accordance with existing convergence guarantees of (proximal) stochastic variance-reduced gradient methods. Numerical experiments show that the proposed algorithm competes favorably with other state-of-the-art methods and achieves higher robustness with respect to the step size selection.
SIAM 优化期刊》,第 34 卷第 1 期,第 1157-1185 页,2024 年 3 月。 摘要。我们针对一类弱凸复合优化问题开发了一种可实现的随机近似点(SPP)方法。所提出的随机近似点算法结合了方差缩小机制,并使用不精确的半光滑牛顿框架求解 SPP 更新。我们建立了详细的收敛结果,这些结果考虑到了 SPP 步骤的不精确性,并且与(近点)随机方差缩小梯度方法的现有收敛保证相一致。数值实验表明,所提出的算法能与其他最先进的方法相媲美,并且在步长选择方面具有更高的鲁棒性。
{"title":"A Semismooth Newton Stochastic Proximal Point Algorithm with Variance Reduction","authors":"Andre Milzarek, Fabian Schaipp, Michael Ulbrich","doi":"10.1137/22m1488181","DOIUrl":"https://doi.org/10.1137/22m1488181","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 1, Page 1157-1185, March 2024. <br/> Abstract. We develop an implementable stochastic proximal point (SPP) method for a class of weakly convex, composite optimization problems. The proposed stochastic proximal point algorithm incorporates a variance reduction mechanism and the resulting SPP updates are solved using an inexact semismooth Newton framework. We establish detailed convergence results that take the inexactness of the SPP steps into account and that are in accordance with existing convergence guarantees of (proximal) stochastic variance-reduced gradient methods. Numerical experiments show that the proposed algorithm competes favorably with other state-of-the-art methods and achieves higher robustness with respect to the step size selection.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140302825","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Provably Accelerated Decentralized Gradient Methods Over Unbalanced Directed Graphs 不平衡有向图上的可证明加速分散梯度法
IF 3.1 1区 数学 Q1 Mathematics Pub Date : 2024-03-22 DOI: 10.1137/22m148570x
Zhuoqing Song, Lei Shi, Shi Pu, Ming Yan
SIAM Journal on Optimization, Volume 34, Issue 1, Page 1131-1156, March 2024.
Abstract. We consider the decentralized optimization problem, where a network of [math] agents aims to collaboratively minimize the average of their individual smooth and convex objective functions through peer-to-peer communication in a directed graph. To tackle this problem, we propose two accelerated gradient tracking methods, namely Accelerated Push-DIGing (APD) and APD-SC, for non-strongly convex and strongly convex objective functions, respectively. We show that APD and APD-SC converge at the rates [math] and [math], respectively, up to constant factors depending only on the mixing matrix. APD and APD-SC are the first decentralized methods over unbalanced directed graphs that achieve the same provable acceleration as centralized methods. Numerical experiments demonstrate the effectiveness of both methods.
SIAM 优化期刊》,第 34 卷第 1 期,第 1131-1156 页,2024 年 3 月。 摘要。我们考虑了分散优化问题,即一个由[数学]代理组成的网络旨在通过有向图中的点对点通信,协同最小化其各自平滑凸目标函数的平均值。为了解决这个问题,我们提出了两种加速梯度跟踪方法,即加速推导法(APD)和 APD-SC,分别适用于非强凸目标函数和强凸目标函数。我们的研究表明,APD 和 APD-SC 分别以 [math] 和 [math] 的速率收敛,收敛率可达常数因子,仅取决于混合矩阵。APD 和 APD-SC 是第一种在不平衡有向图上实现与集中式方法相同的可证明加速度的分散式方法。数值实验证明了这两种方法的有效性。
{"title":"Provably Accelerated Decentralized Gradient Methods Over Unbalanced Directed Graphs","authors":"Zhuoqing Song, Lei Shi, Shi Pu, Ming Yan","doi":"10.1137/22m148570x","DOIUrl":"https://doi.org/10.1137/22m148570x","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 1, Page 1131-1156, March 2024. <br/> Abstract. We consider the decentralized optimization problem, where a network of [math] agents aims to collaboratively minimize the average of their individual smooth and convex objective functions through peer-to-peer communication in a directed graph. To tackle this problem, we propose two accelerated gradient tracking methods, namely Accelerated Push-DIGing (APD) and APD-SC, for non-strongly convex and strongly convex objective functions, respectively. We show that APD and APD-SC converge at the rates [math] and [math], respectively, up to constant factors depending only on the mixing matrix. APD and APD-SC are the first decentralized methods over unbalanced directed graphs that achieve the same provable acceleration as centralized methods. Numerical experiments demonstrate the effectiveness of both methods.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140199146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust Accelerated Primal-Dual Methods for Computing Saddle Points 用于计算鞍点的稳健加速原始双方法
IF 3.1 1区 数学 Q1 Mathematics Pub Date : 2024-03-19 DOI: 10.1137/21m1462775
Xuan Zhang, Necdet Serhat Aybat, Mert Gürbüzbalaban
SIAM Journal on Optimization, Volume 34, Issue 1, Page 1097-1130, March 2024.
Abstract. We consider strongly-convex-strongly-concave saddle point problems assuming we have access to unbiased stochastic estimates of the gradients. We propose a stochastic accelerated primal-dual (SAPD) algorithm and show that the SAPD sequence, generated using constant primal-dual step sizes, linearly converges to a neighborhood of the unique saddle point. Interpreting the size of the neighborhood as a measure of robustness to gradient noise, we obtain explicit characterizations of robustness in terms of SAPD parameters and problem constants. Based on these characterizations, we develop computationally tractable techniques for optimizing the SAPD parameters, i.e., the primal and dual step sizes, and the momentum parameter, to achieve a desired trade-off between the convergence rate and robustness on the Pareto curve. This allows SAPD to enjoy fast convergence properties while being robust to noise as an accelerated method. SAPD admits convergence guarantees for the distance metric with a variance term optimal up to a logarithmic factor, which can be removed by employing a restarting strategy. We also discuss how convergence and robustness results extend to the merely-convex-merely-concave setting. Finally, we illustrate our framework on a distributionally robust logistic regression problem.
SIAM 优化期刊》,第 34 卷,第 1 期,第 1097-1130 页,2024 年 3 月。 摘要。我们考虑了强凸-强凹鞍点问题,假设我们可以获得梯度的无偏随机估计。我们提出了一种随机加速初等二元算法(SAPD),并证明使用恒定初等二元步长生成的 SAPD 序列线性收敛于唯一鞍点的邻域。我们将邻域的大小解释为对梯度噪声的鲁棒性度量,并根据 SAPD 参数和问题常数获得了鲁棒性的明确特征。基于这些特征,我们开发了计算简单的技术,用于优化 SAPD 参数,即原始步长和对偶步长以及动量参数,从而在帕累托曲线上实现收敛速度和鲁棒性之间的理想权衡。这使得 SAPD 既能享受快速收敛特性,又能作为一种加速方法对噪声保持稳健。SAPD 可保证距离度量的收敛性,其方差项最优为对数因子,可通过采用重启策略消除。我们还讨论了收敛性和鲁棒性结果如何扩展到单纯凸-单纯凹设置。最后,我们在一个分布稳健的逻辑回归问题上说明了我们的框架。
{"title":"Robust Accelerated Primal-Dual Methods for Computing Saddle Points","authors":"Xuan Zhang, Necdet Serhat Aybat, Mert Gürbüzbalaban","doi":"10.1137/21m1462775","DOIUrl":"https://doi.org/10.1137/21m1462775","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 1, Page 1097-1130, March 2024. <br/> Abstract. We consider strongly-convex-strongly-concave saddle point problems assuming we have access to unbiased stochastic estimates of the gradients. We propose a stochastic accelerated primal-dual (SAPD) algorithm and show that the SAPD sequence, generated using constant primal-dual step sizes, linearly converges to a neighborhood of the unique saddle point. Interpreting the size of the neighborhood as a measure of robustness to gradient noise, we obtain explicit characterizations of robustness in terms of SAPD parameters and problem constants. Based on these characterizations, we develop computationally tractable techniques for optimizing the SAPD parameters, i.e., the primal and dual step sizes, and the momentum parameter, to achieve a desired trade-off between the convergence rate and robustness on the Pareto curve. This allows SAPD to enjoy fast convergence properties while being robust to noise as an accelerated method. SAPD admits convergence guarantees for the distance metric with a variance term optimal up to a logarithmic factor, which can be removed by employing a restarting strategy. We also discuss how convergence and robustness results extend to the merely-convex-merely-concave setting. Finally, we illustrate our framework on a distributionally robust logistic regression problem.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140172253","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On Integrality in Semidefinite Programming for Discrete Optimization 论离散优化半定量编程中的积分性
IF 3.1 1区 数学 Q1 Mathematics Pub Date : 2024-03-15 DOI: 10.1137/23m1580905
Frank de Meijer, Renata Sotirov
SIAM Journal on Optimization, Volume 34, Issue 1, Page 1071-1096, March 2024.
Abstract. It is well known that by adding integrality constraints to the semidefinite programming (SDP) relaxation of the max-cut problem, the resulting integer semidefinite program is an exact formulation of the problem. In this paper we show similar results for a wide variety of discrete optimization problems for which SDP relaxations have been derived. Based on a comprehensive study on discrete positive semidefinite matrices, we introduce a generic approach to derive mixed-integer SDP (MISDP) formulations of binary quadratically constrained quadratic programs and binary quadratic matrix programs. Applying a problem-specific approach, we derive more compact MISDP formulations of several problems, such as the quadratic assignment problem, the graph partition problem, and the integer matrix completion problem. We also show that several structured problems allow for novel compact MISDP formulations through the notion of association schemes. Complementary to the recent advances on algorithmic aspects related to MISDP, this work opens new perspectives on solution approaches for the here considered problems.
SIAM 优化期刊》,第 34 卷,第 1 期,第 1071-1096 页,2024 年 3 月。 摘要众所周知,通过在最大割问题的半有限编程(SDP)松弛中添加积分约束,得到的整数半有限编程是该问题的精确表述。在本文中,我们对已得到 SDP 松弛的各种离散优化问题展示了类似的结果。基于对离散正半有限矩阵的全面研究,我们介绍了一种通用方法,用于推导二元二次约束二次方程程序和二元二次矩阵程序的混合整数 SDP (MISDP) 公式。应用针对具体问题的方法,我们推导出了一些问题更紧凑的 MISDP 公式,如二次赋值问题、图分割问题和整数矩阵完成问题。我们还表明,通过关联方案的概念,一些结构化问题可以得到新颖紧凑的 MISDP 公式。作为与 MISDP 相关的算法方面最新进展的补充,这项工作为本文所考虑问题的解决方法开辟了新的视角。
{"title":"On Integrality in Semidefinite Programming for Discrete Optimization","authors":"Frank de Meijer, Renata Sotirov","doi":"10.1137/23m1580905","DOIUrl":"https://doi.org/10.1137/23m1580905","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 1, Page 1071-1096, March 2024. <br/> Abstract. It is well known that by adding integrality constraints to the semidefinite programming (SDP) relaxation of the max-cut problem, the resulting integer semidefinite program is an exact formulation of the problem. In this paper we show similar results for a wide variety of discrete optimization problems for which SDP relaxations have been derived. Based on a comprehensive study on discrete positive semidefinite matrices, we introduce a generic approach to derive mixed-integer SDP (MISDP) formulations of binary quadratically constrained quadratic programs and binary quadratic matrix programs. Applying a problem-specific approach, we derive more compact MISDP formulations of several problems, such as the quadratic assignment problem, the graph partition problem, and the integer matrix completion problem. We also show that several structured problems allow for novel compact MISDP formulations through the notion of association schemes. Complementary to the recent advances on algorithmic aspects related to MISDP, this work opens new perspectives on solution approaches for the here considered problems.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140153146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Randomized Douglas–Rachford Methods for Linear Systems: Improved Accuracy and Efficiency 线性系统的随机化道格拉斯-拉赫福德方法:提高精度和效率
IF 3.1 1区 数学 Q1 Mathematics Pub Date : 2024-03-15 DOI: 10.1137/23m1567503
Deren Han, Yansheng Su, Jiaxin Xie
SIAM Journal on Optimization, Volume 34, Issue 1, Page 1045-1070, March 2024.
Abstract. The Douglas–Rachford (DR) method is a widely used method for finding a point in the intersection of two closed convex sets (feasibility problem). However, the method converges weakly, and the associated rate of convergence is hard to analyze in general. In addition, the direct extension of the DR method for solving more-than-two-sets feasibility problems, called the [math]-sets-DR method, is not necessarily convergent. To improve the efficiency of the optimization algorithms, the introduction of randomization and the momentum technique has attracted increasing attention. In this paper, we propose the randomized [math]-sets-DR (RrDR) method for solving the feasibility problem derived from linear systems, showing the benefit of the randomization as it brings linear convergence in expectation to the otherwise divergent [math]-sets-DR method. Furthermore, the convergence rate does not depend on the dimension of the coefficient matrix. We also study RrDR with heavy ball momentum and establish its accelerated rate. Numerical experiments are provided to confirm our results and demonstrate the notable improvements in accuracy and efficiency of the DR method brought by the randomization and the momentum technique.
SIAM 优化期刊》,第 34 卷,第 1 期,第 1045-1070 页,2024 年 3 月。 摘要道格拉斯-拉克福德(Douglas-Rachford,DR)方法是一种广泛应用于寻找两个闭合凸集交点(可行性问题)的方法。然而,该方法的收敛性较弱,相关的收敛速率一般难以分析。此外,DR 方法的直接扩展用于求解多于两个集合的可行性问题,即[math]-sets-DR 方法,也不一定收敛。为了提高优化算法的效率,随机化和动量技术的引入引起了越来越多的关注。本文提出了随机化[math]-sets-DR(RrDR)方法,用于求解线性系统衍生的可行性问题,显示了随机化的好处,因为它给原本发散的[math]-sets-DR方法带来了期望值上的线性收敛。此外,收敛速度并不取决于系数矩阵的维度。我们还研究了重球动量下的 RrDR,并确定了其加速率。我们提供了数值实验来证实我们的结果,并证明随机化和动量技术显著提高了 DR 方法的精度和效率。
{"title":"Randomized Douglas–Rachford Methods for Linear Systems: Improved Accuracy and Efficiency","authors":"Deren Han, Yansheng Su, Jiaxin Xie","doi":"10.1137/23m1567503","DOIUrl":"https://doi.org/10.1137/23m1567503","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 1, Page 1045-1070, March 2024. <br/> Abstract. The Douglas–Rachford (DR) method is a widely used method for finding a point in the intersection of two closed convex sets (feasibility problem). However, the method converges weakly, and the associated rate of convergence is hard to analyze in general. In addition, the direct extension of the DR method for solving more-than-two-sets feasibility problems, called the [math]-sets-DR method, is not necessarily convergent. To improve the efficiency of the optimization algorithms, the introduction of randomization and the momentum technique has attracted increasing attention. In this paper, we propose the randomized [math]-sets-DR (RrDR) method for solving the feasibility problem derived from linear systems, showing the benefit of the randomization as it brings linear convergence in expectation to the otherwise divergent [math]-sets-DR method. Furthermore, the convergence rate does not depend on the dimension of the coefficient matrix. We also study RrDR with heavy ball momentum and establish its accelerated rate. Numerical experiments are provided to confirm our results and demonstrate the notable improvements in accuracy and efficiency of the DR method brought by the randomization and the momentum technique.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140152967","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Decentralized Gradient Descent Maximization Method for Composite Nonconvex Strongly-Concave Minimax Problems 复合非凸强凹最小问题的分散梯度下降最大化方法
IF 3.1 1区 数学 Q1 Mathematics Pub Date : 2024-03-12 DOI: 10.1137/23m1558677
Yangyang Xu
SIAM Journal on Optimization, Volume 34, Issue 1, Page 1006-1044, March 2024.
Abstract. Minimax problems have recently attracted a lot of research interests. A few efforts have been made to solve decentralized nonconvex strongly-concave (NCSC) minimax-structured optimization; however, all of them focus on smooth problems with at most a constraint on the maximization variable. In this paper, we make the first attempt on solving composite NCSC minimax problems that can have convex nonsmooth terms on both minimization and maximization variables. Our algorithm is designed based on a novel reformulation of the decentralized minimax problem that introduces a multiplier to absorb the dual consensus constraint. The removal of dual consensus constraint enables the most aggressive (i.e., local maximization instead of a gradient ascent step) dual update that leads to the benefit of taking a larger primal stepsize and better complexity results. In addition, the decoupling of the nonsmoothness and consensus on the dual variable eases the analysis of a decentralized algorithm; thus our reformulation creates a new way for interested researchers to design new (and possibly more efficient) decentralized methods on solving NCSC minimax problems. We show a global convergence result of the proposed algorithm and an iteration complexity result to produce a (near) stationary point of the reformulation. Moreover, a relation is established between the (near) stationarities of the reformulation and the original formulation. With this relation, we show that when the dual regularizer is smooth, our algorithm can have lower complexity results (with reduced dependence on a condition number) than existing ones to produce a near-stationary point of the original formulation. Numerical experiments are conducted on a distributionally robust logistic regression to demonstrate the performance of the proposed algorithm.
SIAM 优化期刊》,第 34 卷第 1 期,第 1006-1044 页,2024 年 3 月。 摘要最小问题最近引起了很多研究兴趣。然而,所有这些研究都集中在平滑问题上,而且最大化变量上最多只有一个约束。在本文中,我们首次尝试求解复合 NCSC minimax 问题,这些问题可能在最小化变量和最大化变量上都存在凸非光滑项。我们的算法是基于对分散最小问题的一种新的重新表述而设计的,它引入了一个乘数来吸收双重共识约束。去除对偶共识约束后,就能进行最激进的对偶更新(即局部最大化,而不是梯度上升步骤),从而获得更大的原始步长和更好的复杂度结果。此外,对偶变量的非光滑性和共识的解耦简化了分散算法的分析;因此,我们的重新表述为感兴趣的研究人员设计新的(可能更有效的)分散方法来解决 NCSC minimax 问题提供了新的途径。我们展示了所提算法的全局收敛结果和迭代复杂度结果,从而得出了重构算法的(近)静止点。此外,我们还建立了重新计算的(近)静止点与原始计算之间的关系。利用这种关系,我们证明了当对偶正则器是平滑的时,我们的算法可以比现有算法得到更低的复杂度结果(对条件数的依赖性降低),从而产生原始公式的近静止点。我们对分布稳健的逻辑回归进行了数值实验,以证明所提算法的性能。
{"title":"Decentralized Gradient Descent Maximization Method for Composite Nonconvex Strongly-Concave Minimax Problems","authors":"Yangyang Xu","doi":"10.1137/23m1558677","DOIUrl":"https://doi.org/10.1137/23m1558677","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 1, Page 1006-1044, March 2024. <br/> Abstract. Minimax problems have recently attracted a lot of research interests. A few efforts have been made to solve decentralized nonconvex strongly-concave (NCSC) minimax-structured optimization; however, all of them focus on smooth problems with at most a constraint on the maximization variable. In this paper, we make the first attempt on solving composite NCSC minimax problems that can have convex nonsmooth terms on both minimization and maximization variables. Our algorithm is designed based on a novel reformulation of the decentralized minimax problem that introduces a multiplier to absorb the dual consensus constraint. The removal of dual consensus constraint enables the most aggressive (i.e., local maximization instead of a gradient ascent step) dual update that leads to the benefit of taking a larger primal stepsize and better complexity results. In addition, the decoupling of the nonsmoothness and consensus on the dual variable eases the analysis of a decentralized algorithm; thus our reformulation creates a new way for interested researchers to design new (and possibly more efficient) decentralized methods on solving NCSC minimax problems. We show a global convergence result of the proposed algorithm and an iteration complexity result to produce a (near) stationary point of the reformulation. Moreover, a relation is established between the (near) stationarities of the reformulation and the original formulation. With this relation, we show that when the dual regularizer is smooth, our algorithm can have lower complexity results (with reduced dependence on a condition number) than existing ones to produce a near-stationary point of the original formulation. Numerical experiments are conducted on a distributionally robust logistic regression to demonstrate the performance of the proposed algorithm.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140117269","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How Do Exponential Size Solutions Arise in Semidefinite Programming? 半定式编程中如何产生指数大小的解决方案?
IF 3.1 1区 数学 Q1 Mathematics Pub Date : 2024-03-08 DOI: 10.1137/21m1434945
Gábor Pataki, Aleksandr Touzov
SIAM Journal on Optimization, Volume 34, Issue 1, Page 977-1005, March 2024.
Abstract. A striking pathology of semidefinite programs (SDPs) is illustrated by a classical example of Khachiyan: feasible solutions in SDPs may need exponential space even to write down. Such exponential size solutions are the main obstacle to solving a long standing, fundamental open problem: can we decide feasibility of SDPs in polynomial time? The consensus seems that SDPs with large size solutions are rare. However, here we prove that they are actually quite common: a linear change of variables transforms every strictly feasible SDP into a Khachiyan type SDP, in which the leading variables are large. As to “how large,” that depends on the singularity degree of a dual problem. Further, we present some SDPs coming from sum-of-squares proofs, in which large solutions appear naturally, without any change of variables. We also partially answer the question how do we represent such large solutions in polynomial space?
SIAM 优化期刊》第 34 卷第 1 期第 977-1005 页,2024 年 3 月。 摘要。哈奇扬的一个经典例子说明了半无限程序(SDP)的一个显著病理:SDP 中的可行解甚至需要指数级的空间才能写下来。这种指数大小的解是解决一个长期存在的基本开放问题的主要障碍:我们能否在多项式时间内决定 SDP 的可行性?人们似乎一致认为,具有大尺寸解的 SDPs 很少见。然而,我们在这里证明,它们其实很常见:变量的线性变化会将每一个严格可行的 SDP 转化为哈奇扬类型的 SDP,其中前导变量都很大。至于 "有多大",这取决于对偶问题的奇异度。此外,我们还介绍了一些来自平方和证明的 SDP,在这些 SDP 中,无需改变变量,大解就会自然出现。我们还部分回答了如何在多项式空间中表示这种大解的问题?
{"title":"How Do Exponential Size Solutions Arise in Semidefinite Programming?","authors":"Gábor Pataki, Aleksandr Touzov","doi":"10.1137/21m1434945","DOIUrl":"https://doi.org/10.1137/21m1434945","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 1, Page 977-1005, March 2024. <br/> Abstract. A striking pathology of semidefinite programs (SDPs) is illustrated by a classical example of Khachiyan: feasible solutions in SDPs may need exponential space even to write down. Such exponential size solutions are the main obstacle to solving a long standing, fundamental open problem: can we decide feasibility of SDPs in polynomial time? The consensus seems that SDPs with large size solutions are rare. However, here we prove that they are actually quite common: a linear change of variables transforms every strictly feasible SDP into a Khachiyan type SDP, in which the leading variables are large. As to “how large,” that depends on the singularity degree of a dual problem. Further, we present some SDPs coming from sum-of-squares proofs, in which large solutions appear naturally, without any change of variables. We also partially answer the question how do we represent such large solutions in polynomial space?","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140072290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning 双时间尺度随机优化框架在控制和强化学习中的应用
IF 3.1 1区 数学 Q1 Mathematics Pub Date : 2024-03-08 DOI: 10.1137/22m150277x
Sihan Zeng, Thinh T. Doan, Justin Romberg
SIAM Journal on Optimization, Volume 34, Issue 1, Page 946-976, March 2024.
Abstract. We study a new two-time-scale stochastic gradient method for solving optimization problems, where the gradients are computed with the aid of an auxiliary variable under samples generated by time-varying Markov random processes controlled by the underlying optimization variable. These time-varying samples make gradient directions in our update biased and dependent, which can potentially lead to the divergence of the iterates. In our two-time-scale approach, one scale is to estimate the true gradient from these samples, which is then used to update the estimate of the optimal solution. While these two iterates are implemented simultaneously, the former is updated “faster” (using bigger step sizes) than the latter (using smaller step sizes). Our first contribution is to characterize the finite-time complexity of the proposed two-time-scale stochastic gradient method. In particular, we provide explicit formulas for the convergence rates of this method under different structural assumptions, namely, strong convexity, the Polyak–Łojasiewicz condition, and general nonconvexity. We apply our framework to policy optimization problems in control and reinforcement learning. First, we look at the infinite-horizon average-reward Markov decision process with finite state and action spaces and derive a convergence rate of [math] for the online actor-critic algorithm under function approximation, which recovers the best known rate derived specifically for this problem. Second, we study the linear-quadratic regulator and show that an online actor-critic method converges with rate [math]. Third, we use the actor-critic algorithm to solve the policy optimization problem in an entropy regularized Markov decision process, where we also establish a convergence of [math]. The results we derive for both the second and third problems are novel and previously unknown in the literature. Finally, we briefly present the application of our framework to gradient-based policy evaluation algorithms in reinforcement learning.
SIAM 优化期刊》,第 34 卷第 1 期,第 946-976 页,2024 年 3 月。 摘要我们研究了一种求解优化问题的新的双时间尺度随机梯度法,在该方法中,梯度是在由基础优化变量控制的时变马尔可夫随机过程产生的样本下借助辅助变量计算的。这些时变样本会使我们更新的梯度方向产生偏差和依赖性,从而可能导致迭代发散。在我们的双时间尺度方法中,一个尺度是从这些样本中估计真实梯度,然后用于更新最优解的估计值。虽然这两个迭代是同时进行的,但前者的更新(使用较大的步长)比后者(使用较小的步长)"更快"。我们的第一个贡献是描述了所提出的双时间尺度随机梯度法的有限时间复杂性。特别是,我们提供了该方法在不同结构假设(即强凸性、Polyak-Łojasiewicz 条件和一般非凸性)下的收敛率的明确公式。我们将我们的框架应用于控制和强化学习中的策略优化问题。首先,我们研究了具有有限状态和行动空间的无限视距平均回报马尔可夫决策过程,并推导出了函数近似下在线行动者批判算法的收敛率[math],这恢复了专门针对该问题推导出的已知最佳收敛率。其次,我们研究了线性二次调节器,并证明在线行动者批判方法的收敛率为 [math]。第三,我们使用行为批判算法来解决熵正则化马尔可夫决策过程中的政策优化问题,在此我们也建立了[math]的收敛性。我们对第二和第三个问题得出的结果都是新颖的,在以前的文献中是未知的。最后,我们简要介绍了我们的框架在强化学习中基于梯度的策略评估算法中的应用。
{"title":"A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning","authors":"Sihan Zeng, Thinh T. Doan, Justin Romberg","doi":"10.1137/22m150277x","DOIUrl":"https://doi.org/10.1137/22m150277x","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 1, Page 946-976, March 2024. <br/> Abstract. We study a new two-time-scale stochastic gradient method for solving optimization problems, where the gradients are computed with the aid of an auxiliary variable under samples generated by time-varying Markov random processes controlled by the underlying optimization variable. These time-varying samples make gradient directions in our update biased and dependent, which can potentially lead to the divergence of the iterates. In our two-time-scale approach, one scale is to estimate the true gradient from these samples, which is then used to update the estimate of the optimal solution. While these two iterates are implemented simultaneously, the former is updated “faster” (using bigger step sizes) than the latter (using smaller step sizes). Our first contribution is to characterize the finite-time complexity of the proposed two-time-scale stochastic gradient method. In particular, we provide explicit formulas for the convergence rates of this method under different structural assumptions, namely, strong convexity, the Polyak–Łojasiewicz condition, and general nonconvexity. We apply our framework to policy optimization problems in control and reinforcement learning. First, we look at the infinite-horizon average-reward Markov decision process with finite state and action spaces and derive a convergence rate of [math] for the online actor-critic algorithm under function approximation, which recovers the best known rate derived specifically for this problem. Second, we study the linear-quadratic regulator and show that an online actor-critic method converges with rate [math]. Third, we use the actor-critic algorithm to solve the policy optimization problem in an entropy regularized Markov decision process, where we also establish a convergence of [math]. The results we derive for both the second and third problems are novel and previously unknown in the literature. Finally, we briefly present the application of our framework to gradient-based policy evaluation algorithms in reinforcement learning.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140072202","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Chain Rule for Strict Twice Epi-Differentiability and Its Applications 严格两次外差的连锁规则及其应用
IF 3.1 1区 数学 Q1 Mathematics Pub Date : 2024-02-29 DOI: 10.1137/22m1520025
Nguyen T. V. Hang, M. Ebrahim Sarabi
SIAM Journal on Optimization, Volume 34, Issue 1, Page 918-945, March 2024.
Abstract. The presence of second-order smoothness for objective functions of optimization problems can provide valuable information about their stability properties and help us design efficient numerical algorithms for solving these problems. Such second-order information, however, cannot be expected in various constrained and composite optimization problems since we often have to express their objective functions in terms of extended-real-valued functions for which the classical second derivative may not exist. One powerful geometrical tool to use for dealing with such functions is the concept of twice epi-differentiability. In this paper, we study a stronger version of this concept, called strict twice epi-differentiability. We characterize this concept for certain composite functions and use it to establish the equivalence of metric regularity and strong metric regularity for a class of generalized equations at their nondegenerate solutions. Finally, we present a characterization of continuous differentiability of the proximal mapping of our composite functions.
SIAM 优化期刊》,第 34 卷第 1 期,第 918-945 页,2024 年 3 月。 摘要优化问题目标函数的二阶平滑性可以提供有关其稳定性的宝贵信息,并帮助我们设计求解这些问题的高效数值算法。然而,在各种约束和复合优化问题中,这种二阶信息是无法预期的,因为我们通常必须用扩展实值函数来表达其目标函数,而这些函数的经典二阶导数可能并不存在。处理这类函数的一个强有力的几何工具是两次外差概念。在本文中,我们将研究这一概念的更强版本,即严格的两次表微分性。我们为某些复合函数描述了这一概念的特征,并利用它为一类广义方程的非生成解建立了等价的度量正则性和强度量正则性。最后,我们提出了复合函数近似映射的连续可微分性特征。
{"title":"A Chain Rule for Strict Twice Epi-Differentiability and Its Applications","authors":"Nguyen T. V. Hang, M. Ebrahim Sarabi","doi":"10.1137/22m1520025","DOIUrl":"https://doi.org/10.1137/22m1520025","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 1, Page 918-945, March 2024. <br/> Abstract. The presence of second-order smoothness for objective functions of optimization problems can provide valuable information about their stability properties and help us design efficient numerical algorithms for solving these problems. Such second-order information, however, cannot be expected in various constrained and composite optimization problems since we often have to express their objective functions in terms of extended-real-valued functions for which the classical second derivative may not exist. One powerful geometrical tool to use for dealing with such functions is the concept of twice epi-differentiability. In this paper, we study a stronger version of this concept, called strict twice epi-differentiability. We characterize this concept for certain composite functions and use it to establish the equivalence of metric regularity and strong metric regularity for a class of generalized equations at their nondegenerate solutions. Finally, we present a characterization of continuous differentiability of the proximal mapping of our composite functions.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-02-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140004161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
SIAM Journal on Optimization
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1