首页 > 最新文献

SIAM Journal on Optimization最新文献

英文 中文
On Integrality in Semidefinite Programming for Discrete Optimization 论离散优化半定量编程中的积分性
IF 3.1 1区 数学 Q1 MATHEMATICS, APPLIED Pub Date : 2024-03-15 DOI: 10.1137/23m1580905
Frank de Meijer, Renata Sotirov
SIAM Journal on Optimization, Volume 34, Issue 1, Page 1071-1096, March 2024.
Abstract. It is well known that by adding integrality constraints to the semidefinite programming (SDP) relaxation of the max-cut problem, the resulting integer semidefinite program is an exact formulation of the problem. In this paper we show similar results for a wide variety of discrete optimization problems for which SDP relaxations have been derived. Based on a comprehensive study on discrete positive semidefinite matrices, we introduce a generic approach to derive mixed-integer SDP (MISDP) formulations of binary quadratically constrained quadratic programs and binary quadratic matrix programs. Applying a problem-specific approach, we derive more compact MISDP formulations of several problems, such as the quadratic assignment problem, the graph partition problem, and the integer matrix completion problem. We also show that several structured problems allow for novel compact MISDP formulations through the notion of association schemes. Complementary to the recent advances on algorithmic aspects related to MISDP, this work opens new perspectives on solution approaches for the here considered problems.
SIAM 优化期刊》,第 34 卷,第 1 期,第 1071-1096 页,2024 年 3 月。 摘要众所周知,通过在最大割问题的半有限编程(SDP)松弛中添加积分约束,得到的整数半有限编程是该问题的精确表述。在本文中,我们对已得到 SDP 松弛的各种离散优化问题展示了类似的结果。基于对离散正半有限矩阵的全面研究,我们介绍了一种通用方法,用于推导二元二次约束二次方程程序和二元二次矩阵程序的混合整数 SDP (MISDP) 公式。应用针对具体问题的方法,我们推导出了一些问题更紧凑的 MISDP 公式,如二次赋值问题、图分割问题和整数矩阵完成问题。我们还表明,通过关联方案的概念,一些结构化问题可以得到新颖紧凑的 MISDP 公式。作为与 MISDP 相关的算法方面最新进展的补充,这项工作为本文所考虑问题的解决方法开辟了新的视角。
{"title":"On Integrality in Semidefinite Programming for Discrete Optimization","authors":"Frank de Meijer, Renata Sotirov","doi":"10.1137/23m1580905","DOIUrl":"https://doi.org/10.1137/23m1580905","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 1, Page 1071-1096, March 2024. <br/> Abstract. It is well known that by adding integrality constraints to the semidefinite programming (SDP) relaxation of the max-cut problem, the resulting integer semidefinite program is an exact formulation of the problem. In this paper we show similar results for a wide variety of discrete optimization problems for which SDP relaxations have been derived. Based on a comprehensive study on discrete positive semidefinite matrices, we introduce a generic approach to derive mixed-integer SDP (MISDP) formulations of binary quadratically constrained quadratic programs and binary quadratic matrix programs. Applying a problem-specific approach, we derive more compact MISDP formulations of several problems, such as the quadratic assignment problem, the graph partition problem, and the integer matrix completion problem. We also show that several structured problems allow for novel compact MISDP formulations through the notion of association schemes. Complementary to the recent advances on algorithmic aspects related to MISDP, this work opens new perspectives on solution approaches for the here considered problems.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":"47 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140153146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Randomized Douglas–Rachford Methods for Linear Systems: Improved Accuracy and Efficiency 线性系统的随机化道格拉斯-拉赫福德方法:提高精度和效率
IF 3.1 1区 数学 Q1 MATHEMATICS, APPLIED Pub Date : 2024-03-15 DOI: 10.1137/23m1567503
Deren Han, Yansheng Su, Jiaxin Xie
SIAM Journal on Optimization, Volume 34, Issue 1, Page 1045-1070, March 2024.
Abstract. The Douglas–Rachford (DR) method is a widely used method for finding a point in the intersection of two closed convex sets (feasibility problem). However, the method converges weakly, and the associated rate of convergence is hard to analyze in general. In addition, the direct extension of the DR method for solving more-than-two-sets feasibility problems, called the [math]-sets-DR method, is not necessarily convergent. To improve the efficiency of the optimization algorithms, the introduction of randomization and the momentum technique has attracted increasing attention. In this paper, we propose the randomized [math]-sets-DR (RrDR) method for solving the feasibility problem derived from linear systems, showing the benefit of the randomization as it brings linear convergence in expectation to the otherwise divergent [math]-sets-DR method. Furthermore, the convergence rate does not depend on the dimension of the coefficient matrix. We also study RrDR with heavy ball momentum and establish its accelerated rate. Numerical experiments are provided to confirm our results and demonstrate the notable improvements in accuracy and efficiency of the DR method brought by the randomization and the momentum technique.
SIAM 优化期刊》,第 34 卷,第 1 期,第 1045-1070 页,2024 年 3 月。 摘要道格拉斯-拉克福德(Douglas-Rachford,DR)方法是一种广泛应用于寻找两个闭合凸集交点(可行性问题)的方法。然而,该方法的收敛性较弱,相关的收敛速率一般难以分析。此外,DR 方法的直接扩展用于求解多于两个集合的可行性问题,即[math]-sets-DR 方法,也不一定收敛。为了提高优化算法的效率,随机化和动量技术的引入引起了越来越多的关注。本文提出了随机化[math]-sets-DR(RrDR)方法,用于求解线性系统衍生的可行性问题,显示了随机化的好处,因为它给原本发散的[math]-sets-DR方法带来了期望值上的线性收敛。此外,收敛速度并不取决于系数矩阵的维度。我们还研究了重球动量下的 RrDR,并确定了其加速率。我们提供了数值实验来证实我们的结果,并证明随机化和动量技术显著提高了 DR 方法的精度和效率。
{"title":"Randomized Douglas–Rachford Methods for Linear Systems: Improved Accuracy and Efficiency","authors":"Deren Han, Yansheng Su, Jiaxin Xie","doi":"10.1137/23m1567503","DOIUrl":"https://doi.org/10.1137/23m1567503","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 1, Page 1045-1070, March 2024. <br/> Abstract. The Douglas–Rachford (DR) method is a widely used method for finding a point in the intersection of two closed convex sets (feasibility problem). However, the method converges weakly, and the associated rate of convergence is hard to analyze in general. In addition, the direct extension of the DR method for solving more-than-two-sets feasibility problems, called the [math]-sets-DR method, is not necessarily convergent. To improve the efficiency of the optimization algorithms, the introduction of randomization and the momentum technique has attracted increasing attention. In this paper, we propose the randomized [math]-sets-DR (RrDR) method for solving the feasibility problem derived from linear systems, showing the benefit of the randomization as it brings linear convergence in expectation to the otherwise divergent [math]-sets-DR method. Furthermore, the convergence rate does not depend on the dimension of the coefficient matrix. We also study RrDR with heavy ball momentum and establish its accelerated rate. Numerical experiments are provided to confirm our results and demonstrate the notable improvements in accuracy and efficiency of the DR method brought by the randomization and the momentum technique.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":"69 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140152967","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Decentralized Gradient Descent Maximization Method for Composite Nonconvex Strongly-Concave Minimax Problems 复合非凸强凹最小问题的分散梯度下降最大化方法
IF 3.1 1区 数学 Q1 MATHEMATICS, APPLIED Pub Date : 2024-03-12 DOI: 10.1137/23m1558677
Yangyang Xu
SIAM Journal on Optimization, Volume 34, Issue 1, Page 1006-1044, March 2024.
Abstract. Minimax problems have recently attracted a lot of research interests. A few efforts have been made to solve decentralized nonconvex strongly-concave (NCSC) minimax-structured optimization; however, all of them focus on smooth problems with at most a constraint on the maximization variable. In this paper, we make the first attempt on solving composite NCSC minimax problems that can have convex nonsmooth terms on both minimization and maximization variables. Our algorithm is designed based on a novel reformulation of the decentralized minimax problem that introduces a multiplier to absorb the dual consensus constraint. The removal of dual consensus constraint enables the most aggressive (i.e., local maximization instead of a gradient ascent step) dual update that leads to the benefit of taking a larger primal stepsize and better complexity results. In addition, the decoupling of the nonsmoothness and consensus on the dual variable eases the analysis of a decentralized algorithm; thus our reformulation creates a new way for interested researchers to design new (and possibly more efficient) decentralized methods on solving NCSC minimax problems. We show a global convergence result of the proposed algorithm and an iteration complexity result to produce a (near) stationary point of the reformulation. Moreover, a relation is established between the (near) stationarities of the reformulation and the original formulation. With this relation, we show that when the dual regularizer is smooth, our algorithm can have lower complexity results (with reduced dependence on a condition number) than existing ones to produce a near-stationary point of the original formulation. Numerical experiments are conducted on a distributionally robust logistic regression to demonstrate the performance of the proposed algorithm.
SIAM 优化期刊》,第 34 卷第 1 期,第 1006-1044 页,2024 年 3 月。 摘要最小问题最近引起了很多研究兴趣。然而,所有这些研究都集中在平滑问题上,而且最大化变量上最多只有一个约束。在本文中,我们首次尝试求解复合 NCSC minimax 问题,这些问题可能在最小化变量和最大化变量上都存在凸非光滑项。我们的算法是基于对分散最小问题的一种新的重新表述而设计的,它引入了一个乘数来吸收双重共识约束。去除对偶共识约束后,就能进行最激进的对偶更新(即局部最大化,而不是梯度上升步骤),从而获得更大的原始步长和更好的复杂度结果。此外,对偶变量的非光滑性和共识的解耦简化了分散算法的分析;因此,我们的重新表述为感兴趣的研究人员设计新的(可能更有效的)分散方法来解决 NCSC minimax 问题提供了新的途径。我们展示了所提算法的全局收敛结果和迭代复杂度结果,从而得出了重构算法的(近)静止点。此外,我们还建立了重新计算的(近)静止点与原始计算之间的关系。利用这种关系,我们证明了当对偶正则器是平滑的时,我们的算法可以比现有算法得到更低的复杂度结果(对条件数的依赖性降低),从而产生原始公式的近静止点。我们对分布稳健的逻辑回归进行了数值实验,以证明所提算法的性能。
{"title":"Decentralized Gradient Descent Maximization Method for Composite Nonconvex Strongly-Concave Minimax Problems","authors":"Yangyang Xu","doi":"10.1137/23m1558677","DOIUrl":"https://doi.org/10.1137/23m1558677","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 1, Page 1006-1044, March 2024. <br/> Abstract. Minimax problems have recently attracted a lot of research interests. A few efforts have been made to solve decentralized nonconvex strongly-concave (NCSC) minimax-structured optimization; however, all of them focus on smooth problems with at most a constraint on the maximization variable. In this paper, we make the first attempt on solving composite NCSC minimax problems that can have convex nonsmooth terms on both minimization and maximization variables. Our algorithm is designed based on a novel reformulation of the decentralized minimax problem that introduces a multiplier to absorb the dual consensus constraint. The removal of dual consensus constraint enables the most aggressive (i.e., local maximization instead of a gradient ascent step) dual update that leads to the benefit of taking a larger primal stepsize and better complexity results. In addition, the decoupling of the nonsmoothness and consensus on the dual variable eases the analysis of a decentralized algorithm; thus our reformulation creates a new way for interested researchers to design new (and possibly more efficient) decentralized methods on solving NCSC minimax problems. We show a global convergence result of the proposed algorithm and an iteration complexity result to produce a (near) stationary point of the reformulation. Moreover, a relation is established between the (near) stationarities of the reformulation and the original formulation. With this relation, we show that when the dual regularizer is smooth, our algorithm can have lower complexity results (with reduced dependence on a condition number) than existing ones to produce a near-stationary point of the original formulation. Numerical experiments are conducted on a distributionally robust logistic regression to demonstrate the performance of the proposed algorithm.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":"4 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140117269","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How Do Exponential Size Solutions Arise in Semidefinite Programming? 半定式编程中如何产生指数大小的解决方案?
IF 3.1 1区 数学 Q1 MATHEMATICS, APPLIED Pub Date : 2024-03-08 DOI: 10.1137/21m1434945
Gábor Pataki, Aleksandr Touzov
SIAM Journal on Optimization, Volume 34, Issue 1, Page 977-1005, March 2024.
Abstract. A striking pathology of semidefinite programs (SDPs) is illustrated by a classical example of Khachiyan: feasible solutions in SDPs may need exponential space even to write down. Such exponential size solutions are the main obstacle to solving a long standing, fundamental open problem: can we decide feasibility of SDPs in polynomial time? The consensus seems that SDPs with large size solutions are rare. However, here we prove that they are actually quite common: a linear change of variables transforms every strictly feasible SDP into a Khachiyan type SDP, in which the leading variables are large. As to “how large,” that depends on the singularity degree of a dual problem. Further, we present some SDPs coming from sum-of-squares proofs, in which large solutions appear naturally, without any change of variables. We also partially answer the question how do we represent such large solutions in polynomial space?
SIAM 优化期刊》第 34 卷第 1 期第 977-1005 页,2024 年 3 月。 摘要。哈奇扬的一个经典例子说明了半无限程序(SDP)的一个显著病理:SDP 中的可行解甚至需要指数级的空间才能写下来。这种指数大小的解是解决一个长期存在的基本开放问题的主要障碍:我们能否在多项式时间内决定 SDP 的可行性?人们似乎一致认为,具有大尺寸解的 SDPs 很少见。然而,我们在这里证明,它们其实很常见:变量的线性变化会将每一个严格可行的 SDP 转化为哈奇扬类型的 SDP,其中前导变量都很大。至于 "有多大",这取决于对偶问题的奇异度。此外,我们还介绍了一些来自平方和证明的 SDP,在这些 SDP 中,无需改变变量,大解就会自然出现。我们还部分回答了如何在多项式空间中表示这种大解的问题?
{"title":"How Do Exponential Size Solutions Arise in Semidefinite Programming?","authors":"Gábor Pataki, Aleksandr Touzov","doi":"10.1137/21m1434945","DOIUrl":"https://doi.org/10.1137/21m1434945","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 1, Page 977-1005, March 2024. <br/> Abstract. A striking pathology of semidefinite programs (SDPs) is illustrated by a classical example of Khachiyan: feasible solutions in SDPs may need exponential space even to write down. Such exponential size solutions are the main obstacle to solving a long standing, fundamental open problem: can we decide feasibility of SDPs in polynomial time? The consensus seems that SDPs with large size solutions are rare. However, here we prove that they are actually quite common: a linear change of variables transforms every strictly feasible SDP into a Khachiyan type SDP, in which the leading variables are large. As to “how large,” that depends on the singularity degree of a dual problem. Further, we present some SDPs coming from sum-of-squares proofs, in which large solutions appear naturally, without any change of variables. We also partially answer the question how do we represent such large solutions in polynomial space?","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":"65 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140072290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning 双时间尺度随机优化框架在控制和强化学习中的应用
IF 3.1 1区 数学 Q1 MATHEMATICS, APPLIED Pub Date : 2024-03-08 DOI: 10.1137/22m150277x
Sihan Zeng, Thinh T. Doan, Justin Romberg
SIAM Journal on Optimization, Volume 34, Issue 1, Page 946-976, March 2024.
Abstract. We study a new two-time-scale stochastic gradient method for solving optimization problems, where the gradients are computed with the aid of an auxiliary variable under samples generated by time-varying Markov random processes controlled by the underlying optimization variable. These time-varying samples make gradient directions in our update biased and dependent, which can potentially lead to the divergence of the iterates. In our two-time-scale approach, one scale is to estimate the true gradient from these samples, which is then used to update the estimate of the optimal solution. While these two iterates are implemented simultaneously, the former is updated “faster” (using bigger step sizes) than the latter (using smaller step sizes). Our first contribution is to characterize the finite-time complexity of the proposed two-time-scale stochastic gradient method. In particular, we provide explicit formulas for the convergence rates of this method under different structural assumptions, namely, strong convexity, the Polyak–Łojasiewicz condition, and general nonconvexity. We apply our framework to policy optimization problems in control and reinforcement learning. First, we look at the infinite-horizon average-reward Markov decision process with finite state and action spaces and derive a convergence rate of [math] for the online actor-critic algorithm under function approximation, which recovers the best known rate derived specifically for this problem. Second, we study the linear-quadratic regulator and show that an online actor-critic method converges with rate [math]. Third, we use the actor-critic algorithm to solve the policy optimization problem in an entropy regularized Markov decision process, where we also establish a convergence of [math]. The results we derive for both the second and third problems are novel and previously unknown in the literature. Finally, we briefly present the application of our framework to gradient-based policy evaluation algorithms in reinforcement learning.
SIAM 优化期刊》,第 34 卷第 1 期,第 946-976 页,2024 年 3 月。 摘要我们研究了一种求解优化问题的新的双时间尺度随机梯度法,在该方法中,梯度是在由基础优化变量控制的时变马尔可夫随机过程产生的样本下借助辅助变量计算的。这些时变样本会使我们更新的梯度方向产生偏差和依赖性,从而可能导致迭代发散。在我们的双时间尺度方法中,一个尺度是从这些样本中估计真实梯度,然后用于更新最优解的估计值。虽然这两个迭代是同时进行的,但前者的更新(使用较大的步长)比后者(使用较小的步长)"更快"。我们的第一个贡献是描述了所提出的双时间尺度随机梯度法的有限时间复杂性。特别是,我们提供了该方法在不同结构假设(即强凸性、Polyak-Łojasiewicz 条件和一般非凸性)下的收敛率的明确公式。我们将我们的框架应用于控制和强化学习中的策略优化问题。首先,我们研究了具有有限状态和行动空间的无限视距平均回报马尔可夫决策过程,并推导出了函数近似下在线行动者批判算法的收敛率[math],这恢复了专门针对该问题推导出的已知最佳收敛率。其次,我们研究了线性二次调节器,并证明在线行动者批判方法的收敛率为 [math]。第三,我们使用行为批判算法来解决熵正则化马尔可夫决策过程中的政策优化问题,在此我们也建立了[math]的收敛性。我们对第二和第三个问题得出的结果都是新颖的,在以前的文献中是未知的。最后,我们简要介绍了我们的框架在强化学习中基于梯度的策略评估算法中的应用。
{"title":"A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning","authors":"Sihan Zeng, Thinh T. Doan, Justin Romberg","doi":"10.1137/22m150277x","DOIUrl":"https://doi.org/10.1137/22m150277x","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 1, Page 946-976, March 2024. <br/> Abstract. We study a new two-time-scale stochastic gradient method for solving optimization problems, where the gradients are computed with the aid of an auxiliary variable under samples generated by time-varying Markov random processes controlled by the underlying optimization variable. These time-varying samples make gradient directions in our update biased and dependent, which can potentially lead to the divergence of the iterates. In our two-time-scale approach, one scale is to estimate the true gradient from these samples, which is then used to update the estimate of the optimal solution. While these two iterates are implemented simultaneously, the former is updated “faster” (using bigger step sizes) than the latter (using smaller step sizes). Our first contribution is to characterize the finite-time complexity of the proposed two-time-scale stochastic gradient method. In particular, we provide explicit formulas for the convergence rates of this method under different structural assumptions, namely, strong convexity, the Polyak–Łojasiewicz condition, and general nonconvexity. We apply our framework to policy optimization problems in control and reinforcement learning. First, we look at the infinite-horizon average-reward Markov decision process with finite state and action spaces and derive a convergence rate of [math] for the online actor-critic algorithm under function approximation, which recovers the best known rate derived specifically for this problem. Second, we study the linear-quadratic regulator and show that an online actor-critic method converges with rate [math]. Third, we use the actor-critic algorithm to solve the policy optimization problem in an entropy regularized Markov decision process, where we also establish a convergence of [math]. The results we derive for both the second and third problems are novel and previously unknown in the literature. Finally, we briefly present the application of our framework to gradient-based policy evaluation algorithms in reinforcement learning.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":"64 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140072202","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Chain Rule for Strict Twice Epi-Differentiability and Its Applications 严格两次外差的连锁规则及其应用
IF 3.1 1区 数学 Q1 MATHEMATICS, APPLIED Pub Date : 2024-02-29 DOI: 10.1137/22m1520025
Nguyen T. V. Hang, M. Ebrahim Sarabi
SIAM Journal on Optimization, Volume 34, Issue 1, Page 918-945, March 2024.
Abstract. The presence of second-order smoothness for objective functions of optimization problems can provide valuable information about their stability properties and help us design efficient numerical algorithms for solving these problems. Such second-order information, however, cannot be expected in various constrained and composite optimization problems since we often have to express their objective functions in terms of extended-real-valued functions for which the classical second derivative may not exist. One powerful geometrical tool to use for dealing with such functions is the concept of twice epi-differentiability. In this paper, we study a stronger version of this concept, called strict twice epi-differentiability. We characterize this concept for certain composite functions and use it to establish the equivalence of metric regularity and strong metric regularity for a class of generalized equations at their nondegenerate solutions. Finally, we present a characterization of continuous differentiability of the proximal mapping of our composite functions.
SIAM 优化期刊》,第 34 卷第 1 期,第 918-945 页,2024 年 3 月。 摘要优化问题目标函数的二阶平滑性可以提供有关其稳定性的宝贵信息,并帮助我们设计求解这些问题的高效数值算法。然而,在各种约束和复合优化问题中,这种二阶信息是无法预期的,因为我们通常必须用扩展实值函数来表达其目标函数,而这些函数的经典二阶导数可能并不存在。处理这类函数的一个强有力的几何工具是两次外差概念。在本文中,我们将研究这一概念的更强版本,即严格的两次表微分性。我们为某些复合函数描述了这一概念的特征,并利用它为一类广义方程的非生成解建立了等价的度量正则性和强度量正则性。最后,我们提出了复合函数近似映射的连续可微分性特征。
{"title":"A Chain Rule for Strict Twice Epi-Differentiability and Its Applications","authors":"Nguyen T. V. Hang, M. Ebrahim Sarabi","doi":"10.1137/22m1520025","DOIUrl":"https://doi.org/10.1137/22m1520025","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 1, Page 918-945, March 2024. <br/> Abstract. The presence of second-order smoothness for objective functions of optimization problems can provide valuable information about their stability properties and help us design efficient numerical algorithms for solving these problems. Such second-order information, however, cannot be expected in various constrained and composite optimization problems since we often have to express their objective functions in terms of extended-real-valued functions for which the classical second derivative may not exist. One powerful geometrical tool to use for dealing with such functions is the concept of twice epi-differentiability. In this paper, we study a stronger version of this concept, called strict twice epi-differentiability. We characterize this concept for certain composite functions and use it to establish the equivalence of metric regularity and strong metric regularity for a class of generalized equations at their nondegenerate solutions. Finally, we present a characterization of continuous differentiability of the proximal mapping of our composite functions.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":"69 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-02-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140004161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Approximating Higher-Order Derivative Tensors Using Secant Updates 利用 Secant 更新逼近高阶微分张量
IF 3.1 1区 数学 Q1 MATHEMATICS, APPLIED Pub Date : 2024-02-28 DOI: 10.1137/23m1549687
Karl Welzel, Raphael A. Hauser
SIAM Journal on Optimization, Volume 34, Issue 1, Page 893-917, March 2024.
Abstract. Quasi-Newton methods employ an update rule that gradually improves the Hessian approximation using the already available gradient evaluations. We propose higher-order secant updates which generalize this idea to higher-order derivatives, approximating, for example, third derivatives (which are tensors) from given Hessian evaluations. Our generalization is based on the observation that quasi-Newton updates are least-change updates satisfying the secant equation, with different methods using different norms to measure the size of the change. We present a full characterization for least-change updates in weighted Frobenius norms (satisfying an analogue of the secant equation) for derivatives of arbitrary order. Moreover, we establish convergence of the approximations to the true derivative under standard assumptions and explore the quality of the generated approximations in numerical experiments.
SIAM 优化期刊》,第 34 卷,第 1 期,第 893-917 页,2024 年 3 月。 摘要。准牛顿方法采用一种更新规则,利用已有的梯度评估逐步改进赫塞斯近似值。我们提出的高阶正割更新将这一思想推广到高阶导数,例如,从给定的 Hessian 评估中逼近三阶导数(三阶导数是张量)。我们的概括基于以下观察:准牛顿更新是满足secant方程的最小变化更新,不同的方法使用不同的规范来衡量变化的大小。对于任意阶的导数,我们提出了加权弗罗贝尼斯规范(满足secant方程的类似方法)中最小变化更新的完整特征。此外,我们还确定了在标准假设下近似值对真实导数的收敛性,并在数值实验中探索了生成的近似值的质量。
{"title":"Approximating Higher-Order Derivative Tensors Using Secant Updates","authors":"Karl Welzel, Raphael A. Hauser","doi":"10.1137/23m1549687","DOIUrl":"https://doi.org/10.1137/23m1549687","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 1, Page 893-917, March 2024. <br/> Abstract. Quasi-Newton methods employ an update rule that gradually improves the Hessian approximation using the already available gradient evaluations. We propose higher-order secant updates which generalize this idea to higher-order derivatives, approximating, for example, third derivatives (which are tensors) from given Hessian evaluations. Our generalization is based on the observation that quasi-Newton updates are least-change updates satisfying the secant equation, with different methods using different norms to measure the size of the change. We present a full characterization for least-change updates in weighted Frobenius norms (satisfying an analogue of the secant equation) for derivatives of arbitrary order. Moreover, we establish convergence of the approximations to the true derivative under standard assumptions and explore the quality of the generated approximations in numerical experiments.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":"77 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140004159","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Continuous Selections of Solutions to Parametric Variational Inequalities 参数变分不等式解的连续选择
IF 3.1 1区 数学 Q1 MATHEMATICS, APPLIED Pub Date : 2024-02-28 DOI: 10.1137/22m1514982
Shaoning Han, Jong-Shi Pang
SIAM Journal on Optimization, Volume 34, Issue 1, Page 870-892, March 2024.
Abstract. This paper studies the existence of a (Lipschitz) continuous (single-valued) solution function of parametric variational inequalities under functional and constraint perturbations. At the most elementary level, this issue can be explained from classical parametric linear programming and its resolution by the parametric simplex method, which computes a solution trajectory of the problem when the objective coefficients and the right-hand sides of the constraints are parameterized by a single scalar parameter. The computed optimal solution vector (and not the optimal objective value) is a continuous piecewise affine function in the parameter when the objective coefficients are kept constant, whereas the computed solution vector can be discontinuous when the right-hand constraint coefficients are kept fixed and there is a basis change at a critical value of the parameter in the objective. We investigate this issue more broadly first in the context of an affine variational inequality (AVI) and obtain results that go beyond those pertaining to the lower semicontinuity of the solution map with joint vector perturbations; the latter property is closely tied to a stability theory of a parametric AVI and in particular to Robinson’s seminal concept of strong regularity. Extensions to nonlinear variational inequalities is also investigated without requiring solution uniqueness (and therefore applicable to nonstrongly regular problems). The role of solution uniqueness in this issue of continuous single-valued solution selection is further clarified.
SIAM 优化期刊》,第 34 卷,第 1 期,第 870-892 页,2024 年 3 月。 摘要本文研究了参数变分不等式在函数和约束扰动下的(Lipschitz)连续(单值)解函数的存在性。在最基本的层面上,这个问题可以从经典参数线性规划及其参数单纯形法的解决方法中得到解释,当目标系数和约束条件的右侧由单一标量参数参数化时,参数单纯形法计算问题的解轨迹。当目标系数保持不变时,计算出的最优解向量(而非最优目标值)是参数中连续的片断仿射函数;而当右侧约束系数保持不变,且目标中参数的临界值发生基础变化时,计算出的解向量可能是不连续的。我们首先在仿射变分不等式(AVI)的背景下对这一问题进行了更广泛的研究,得到的结果超越了与联合向量扰动解图的下半连续性有关的结果;后者的性质与参数变分不等式的稳定性理论,特别是与罗宾逊的强正则性开创性概念密切相关。此外,还研究了非线性变分不等式的扩展,而不要求解的唯一性(因此适用于非强正则性问题)。解唯一性在连续单值解选择问题中的作用得到了进一步澄清。
{"title":"Continuous Selections of Solutions to Parametric Variational Inequalities","authors":"Shaoning Han, Jong-Shi Pang","doi":"10.1137/22m1514982","DOIUrl":"https://doi.org/10.1137/22m1514982","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 1, Page 870-892, March 2024. <br/> Abstract. This paper studies the existence of a (Lipschitz) continuous (single-valued) solution function of parametric variational inequalities under functional and constraint perturbations. At the most elementary level, this issue can be explained from classical parametric linear programming and its resolution by the parametric simplex method, which computes a solution trajectory of the problem when the objective coefficients and the right-hand sides of the constraints are parameterized by a single scalar parameter. The computed optimal solution vector (and not the optimal objective value) is a continuous piecewise affine function in the parameter when the objective coefficients are kept constant, whereas the computed solution vector can be discontinuous when the right-hand constraint coefficients are kept fixed and there is a basis change at a critical value of the parameter in the objective. We investigate this issue more broadly first in the context of an affine variational inequality (AVI) and obtain results that go beyond those pertaining to the lower semicontinuity of the solution map with joint vector perturbations; the latter property is closely tied to a stability theory of a parametric AVI and in particular to Robinson’s seminal concept of strong regularity. Extensions to nonlinear variational inequalities is also investigated without requiring solution uniqueness (and therefore applicable to nonstrongly regular problems). The role of solution uniqueness in this issue of continuous single-valued solution selection is further clarified.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":"12 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140004166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sample Size Estimates for Risk-Neutral Semilinear PDE-Constrained Optimization 风险中性半线性 PDE 受限优化的样本量估计
IF 3.1 1区 数学 Q1 MATHEMATICS, APPLIED Pub Date : 2024-02-23 DOI: 10.1137/22m1512636
Johannes Milz, Michael Ulbrich
SIAM Journal on Optimization, Volume 34, Issue 1, Page 844-869, March 2024.
Abstract. The sample average approximation (SAA) approach is applied to risk-neutral optimization problems governed by semilinear elliptic partial differential equations with random inputs. After constructing a compact set that contains the SAA critical points, we derive nonasymptotic sample size estimates for SAA critical points using the covering number approach. Thereby, we derive upper bounds on the number of samples needed to obtain accurate critical points of the risk-neutral PDE-constrained optimization problem through SAA critical points. We quantify accuracy using expectation and exponential tail bounds. Numerical illustrations are presented.
SIAM 优化期刊》第 34 卷第 1 期第 844-869 页,2024 年 3 月。 摘要。样本平均近似(SAA)方法适用于由随机输入的半线性椭圆偏微分方程控制的风险中性优化问题。在构建了包含 SAA 临界点的紧凑集之后,我们利用覆盖数方法推导出了 SAA 临界点的非渐近样本大小估计值。因此,我们推导出了通过 SAA 临界点获得风险中性 PDE 受限优化问题准确临界点所需的样本数量上限。我们使用期望值和指数尾边界来量化精确度。并给出了数值说明。
{"title":"Sample Size Estimates for Risk-Neutral Semilinear PDE-Constrained Optimization","authors":"Johannes Milz, Michael Ulbrich","doi":"10.1137/22m1512636","DOIUrl":"https://doi.org/10.1137/22m1512636","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 1, Page 844-869, March 2024. <br/> Abstract. The sample average approximation (SAA) approach is applied to risk-neutral optimization problems governed by semilinear elliptic partial differential equations with random inputs. After constructing a compact set that contains the SAA critical points, we derive nonasymptotic sample size estimates for SAA critical points using the covering number approach. Thereby, we derive upper bounds on the number of samples needed to obtain accurate critical points of the risk-neutral PDE-constrained optimization problem through SAA critical points. We quantify accuracy using expectation and exponential tail bounds. Numerical illustrations are presented.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":"255 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139951949","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Subset Selection and the Cone of Factor-Width-k Matrices 子集选择和因子宽度-k 矩阵的锥形
IF 3.1 1区 数学 Q1 MATHEMATICS, APPLIED Pub Date : 2024-02-22 DOI: 10.1137/23m1549444
Walid Ben-Ameur
SIAM Journal on Optimization, Volume 34, Issue 1, Page 817-843, March 2024.
Abstract. We study the cone of factor-width-[math] matrices, where the factor width of a positive semidefinite matrix is defined as the smallest number [math] allowing it to be expressed as a sum of positive semidefinite matrices that are nonzero only on a single [math] principal submatrix. Two hierarchies of approximations are proposed for this cone. Some theoretical bounds to assess the quality of the new approximations are derived. We also use these approximations to build convex conic relaxations for the subset selection problem where one has to minimize [math] under the constraint that [math] has at most [math] nonzero components. Several numerical experiments are performed showing that some of these relaxations provide a good compromise between tightness and computational complexity and rank well compared to perspective-type relaxations.
SIAM 优化期刊》,第 34 卷,第 1 期,第 817-843 页,2024 年 3 月。 摘要。我们研究了因子宽度-[math] 矩阵的锥体,其中正半inite 矩阵的因子宽度被定义为最小的[math]数,允许将其表示为仅在单个[math]主子矩阵上不为零的正半inite 矩阵之和。针对这一锥体提出了两种近似等级。我们还推导出了一些评估新近似值质量的理论边界。我们还利用这些近似值建立了子集选择问题的凸圆锥松弛,在这个问题中,我们必须在[math]最多有[math]个非零分量的约束条件下最小化[math]。几个数值实验表明,其中一些松弛方法在严密性和计算复杂性之间取得了很好的折衷,与透视型松弛方法相比,它们的效果也很好。
{"title":"Subset Selection and the Cone of Factor-Width-k Matrices","authors":"Walid Ben-Ameur","doi":"10.1137/23m1549444","DOIUrl":"https://doi.org/10.1137/23m1549444","url":null,"abstract":"SIAM Journal on Optimization, Volume 34, Issue 1, Page 817-843, March 2024. <br/> Abstract. We study the cone of factor-width-[math] matrices, where the factor width of a positive semidefinite matrix is defined as the smallest number [math] allowing it to be expressed as a sum of positive semidefinite matrices that are nonzero only on a single [math] principal submatrix. Two hierarchies of approximations are proposed for this cone. Some theoretical bounds to assess the quality of the new approximations are derived. We also use these approximations to build convex conic relaxations for the subset selection problem where one has to minimize [math] under the constraint that [math] has at most [math] nonzero components. Several numerical experiments are performed showing that some of these relaxations provide a good compromise between tightness and computational complexity and rank well compared to perspective-type relaxations.","PeriodicalId":49529,"journal":{"name":"SIAM Journal on Optimization","volume":"48 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139952160","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
SIAM Journal on Optimization
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1