首页 > 最新文献

Mathematical Programming最新文献

英文 中文
Generalized Nash equilibrium problems with mixed-integer variables 具有混合整数变量的广义纳什均衡问题
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-03-13 DOI: 10.1007/s10107-024-02063-6

Abstract

We consider generalized Nash equilibrium problems (GNEPs) with non-convex strategy spaces and non-convex cost functions. This general class of games includes the important case of games with mixed-integer variables for which only a few results are known in the literature. We present a new approach to characterize equilibria via a convexification technique using the Nikaido–Isoda function. To any given instance of the GNEP, we construct a set of convexified instances and show that a feasible strategy profile is an equilibrium for the original instance if and only if it is an equilibrium for any convexified instance and the convexified cost functions coincide with the initial ones. We develop this convexification approach along three dimensions: We first show that for quasi-linear models, where a convexified instance exists in which for fixed strategies of the opponent players, the cost function of every player is linear and the respective strategy space is polyhedral, the convexification reduces the GNEP to a standard (non-linear) optimization problem. Secondly, we derive two complete characterizations of those GNEPs for which the convexification leads to a jointly constrained or a jointly convex GNEP, respectively. These characterizations require new concepts related to the interplay of the convex hull operator applied to restricted subsets of feasible strategies and may be interesting on their own. Note that this characterization is also computationally relevant as jointly convex GNEPs have been extensively studied in the literature. Finally, we demonstrate the applicability of our results by presenting a numerical study regarding the computation of equilibria for three classes of GNEPs related to integral network flows and discrete market equilibria.

摘要 我们考虑的是具有非凸策略空间和非凸成本函数的广义纳什均衡问题(GNEPs)。这一类博弈包括具有混合整数变量的重要博弈,文献中仅有少数几个结果。我们提出了一种新方法,通过使用 Nikaido-Isoda 函数的凸化技术来表征均衡。对于任何给定的 GNEP 实例,我们都会构建一组凸化实例,并证明当且仅当一个可行策略剖面是任何凸化实例的均衡且凸化成本函数与初始函数重合时,该策略剖面才是原始实例的均衡。我们从三个维度发展了这种凸化方法:我们首先证明,对于准线性模型,即存在一个凸化实例,其中对于对手棋手的固定策略,每个棋手的成本函数都是线性的,并且各自的策略空间都是多面体的,凸化将 GNEP 简化为一个标准(非线性)优化问题。其次,我们对凸化分别导致联合约束或联合凸GNEP的GNEP进行了两个完整的描述。这些特征需要与应用于可行策略受限子集的凸壳算子的相互作用有关的新概念,它们本身可能也很有趣。需要注意的是,这种表征在计算上也是相关的,因为文献中已经对共凸 GNEP 进行了广泛研究。最后,我们通过对与积分网络流和离散市场均衡相关的三类 GNEP 的均衡计算进行数值研究,证明了我们结果的适用性。
{"title":"Generalized Nash equilibrium problems with mixed-integer variables","authors":"","doi":"10.1007/s10107-024-02063-6","DOIUrl":"https://doi.org/10.1007/s10107-024-02063-6","url":null,"abstract":"<h3>Abstract</h3> <p>We consider generalized Nash equilibrium problems (GNEPs) with non-convex strategy spaces and non-convex cost functions. This general class of games includes the important case of games with mixed-integer variables for which only a few results are known in the literature. We present a new approach to characterize equilibria via a convexification technique using the Nikaido–Isoda function. To any given instance of the GNEP, we construct a set of convexified instances and show that a feasible strategy profile is an equilibrium for the original instance if and only if it is an equilibrium for any convexified instance and the convexified cost functions coincide with the initial ones. We develop this convexification approach along three dimensions: We first show that for <em>quasi-linear</em> models, where a convexified instance exists in which for fixed strategies of the opponent players, the cost function of every player is linear and the respective strategy space is polyhedral, the convexification reduces the GNEP to a standard (non-linear) optimization problem. Secondly, we derive two complete characterizations of those GNEPs for which the convexification leads to a jointly constrained or a jointly convex GNEP, respectively. These characterizations require new concepts related to the interplay of the convex hull operator applied to restricted subsets of feasible strategies and may be interesting on their own. Note that this characterization is also computationally relevant as jointly convex GNEPs have been extensively studied in the literature. Finally, we demonstrate the applicability of our results by presenting a numerical study regarding the computation of equilibria for three classes of GNEPs related to integral network flows and discrete market equilibria.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"109 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140128067","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Chvátal–Gomory procedure for integer SDPs with applications in combinatorial optimization 整数 SDP 的 Chvátal-Gomory 程序及其在组合优化中的应用
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-03-13 DOI: 10.1007/s10107-024-02069-0

Abstract

In this paper we study the well-known Chvátal–Gomory (CG) procedure for the class of integer semidefinite programs (ISDPs). We prove several results regarding the hierarchy of relaxations obtained by iterating this procedure. We also study different formulations of the elementary closure of spectrahedra. A polyhedral description of the elementary closure for a specific type of spectrahedra is derived by exploiting total dual integrality for SDPs. Moreover, we show how to exploit (strengthened) CG cuts in a branch-and-cut framework for ISDPs. Different from existing algorithms in the literature, the separation routine in our approach exploits both the semidefinite and the integrality constraints. We provide separation routines for several common classes of binary SDPs resulting from combinatorial optimization problems. In the second part of the paper we present a comprehensive application of our approach to the quadratic traveling salesman problem (QTSP). Based on the algebraic connectivity of the directed Hamiltonian cycle, two ISDPs that model the QTSP are introduced. We show that the CG cuts resulting from these formulations contain several well-known families of cutting planes. Numerical results illustrate the practical strength of the CG cuts in our branch-and-cut algorithm, which outperforms alternative ISDP solvers and is able to solve large QTSP instances to optimality.

摘要 本文研究了著名的整数半定式程序(ISDP)的 Chvátal-Gomory (CG) 过程。我们证明了通过迭代该程序所得到的松弛层次的几个结果。我们还研究了谱的基本封闭的不同形式。通过利用 SDP 的总对偶积分性,我们得出了特定类型谱的基本封闭的多面体描述。此外,我们还展示了如何在分支切割框架中利用(加强的)CG 切割来处理 ISDP。与文献中的现有算法不同,我们方法中的分离例程同时利用了半有限性和积分性约束。我们为组合优化问题中常见的几类二元 SDP 提供了分离例程。在论文的第二部分,我们介绍了我们的方法在二次旅行推销员问题(QTSP)中的综合应用。基于有向哈密顿循环的代数连接性,我们引入了两个模拟 QTSP 的 ISDP。我们证明,由这些公式产生的 CG 切分包含几个著名的切分平面族。数值结果表明了 CG 切分在我们的分支-切分算法中的实用性,该算法的性能优于其他 ISDP 求解器,能够最优地求解大型 QTSP 实例。
{"title":"The Chvátal–Gomory procedure for integer SDPs with applications in combinatorial optimization","authors":"","doi":"10.1007/s10107-024-02069-0","DOIUrl":"https://doi.org/10.1007/s10107-024-02069-0","url":null,"abstract":"<h3>Abstract</h3> <p>In this paper we study the well-known Chvátal–Gomory (CG) procedure for the class of integer semidefinite programs (ISDPs). We prove several results regarding the hierarchy of relaxations obtained by iterating this procedure. We also study different formulations of the elementary closure of spectrahedra. A polyhedral description of the elementary closure for a specific type of spectrahedra is derived by exploiting total dual integrality for SDPs. Moreover, we show how to exploit (strengthened) CG cuts in a branch-and-cut framework for ISDPs. Different from existing algorithms in the literature, the separation routine in our approach exploits both the semidefinite and the integrality constraints. We provide separation routines for several common classes of binary SDPs resulting from combinatorial optimization problems. In the second part of the paper we present a comprehensive application of our approach to the quadratic traveling salesman problem (<span>QTSP</span>). Based on the algebraic connectivity of the directed Hamiltonian cycle, two ISDPs that model the <span>QTSP</span> are introduced. We show that the CG cuts resulting from these formulations contain several well-known families of cutting planes. Numerical results illustrate the practical strength of the CG cuts in our branch-and-cut algorithm, which outperforms alternative ISDP solvers and is able to solve large <span>QTSP</span> instances to optimality.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"2017 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140127998","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generalized minimum 0-extension problem and discrete convexity 广义最小 0-扩展问题和离散凸性
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-03-07 DOI: 10.1007/s10107-024-02064-5
Martin Dvorak, Vladimir Kolmogorov

Given a fixed finite metric space ((V,mu )), the minimum 0-extension problem, denoted as (mathtt{0hbox {-}Ext}[{mu }]), is equivalent to the following optimization problem: minimize function of the form (min nolimits _{xin V^n} sum _i f_i(x_i) + sum _{ij} c_{ij}hspace{0.5pt}mu (x_i,x_j)) where (f_i:Vrightarrow mathbb {R}) are functions given by (f_i(x_i)=sum _{vin V} c_{vi}hspace{0.5pt}mu (x_i,v)) and (c_{ij},c_{vi}) are given nonnegative costs. The computational complexity of (mathtt{0hbox {-}Ext}[{mu }]) has been recently established by Karzanov and by Hirai: if metric (mu ) is orientable modular then (mathtt{0hbox {-}Ext}[{mu }]) can be solved in polynomial time, otherwise (mathtt{0hbox {-}Ext}[{mu }]) is NP-hard. To prove the tractability part, Hirai developed a theory of discrete convex functions on orientable modular graphs generalizing several known classes of functions in discrete convex analysis, such as (L^natural )-convex functions. We consider a more general version of the problem in which unary functions (f_i(x_i)) can additionally have terms of the form (c_{uv;i}hspace{0.5pt}mu (x_i,{u,v})) for ({u,!hspace{0.5pt}hspace{0.5pt}v}in F), where set (Fsubseteq left( {begin{array}{c}V 2end{array}}right) ) is fixed. We extend the complexity classification above by providing an explicit condition on ((mu ,F)) for the problem to be tractable. In order to prove the tractability part, we generalize Hirai’s theory and define a larger class of discrete convex functions. It covers, in particular, another well-known class of functions, namely submodular functions on an integer lattice. Finally, we improve the complexity of Hirai’s algorithm for solving (mathtt{0hbox {-}Ext}[{mu }]) on orientable modular graphs.

给定一个固定的有限度量空间((V,mu )),最小0-扩展问题,表示为((mathtt{0hbox {-}Ext}[{mu }]),等价于下面的优化问题:最小化函数的形式((min nolimits _{xin V^n}sum _i f_i(x_i) + sum _{ij} c_{ij}hspace{0.5pt}mu (x_i,x_j)) 其中 (f_i:Vrightarrow mathbb {R}) 是由(f_i(x_i)=sum _{vin V} c_{vi}hspace{0.5pt}mu (x_i,v)) 和 (c_{ij},c_{vi}) 都是非负成本。Karzanov 和 Hirai 最近确定了 (mathtt{0hbox {-}Ext}[{mu }])的计算复杂度:如果度量 (mu ) 是可定向的模态,那么 (mathtt{0hbox {-}Ext}[{mu }])可以在多项式时间内求解,否则 (mathtt{0hbox {-}Ext}[{mu }])就是 NP 难的。为了证明可操作性部分,Hirai 发展了可定向模块图上的离散凸函数理论,概括了离散凸分析中的几类已知函数,如 (L^natural )-凸函数。我们考虑了问题的一个更一般的版本,其中一元函数 (f_i(x_i))可以额外具有形式为 (c_{uv;i}hspace{0.5pt}mu (x_i,{u,v})) for ({u,!hspace{0.5pt}hspace{0.5pt}v}in F), 其中集合 (Fsubseteq left( {begin{array}{c}V 2end{array}right) ) 是固定的。我们扩展了上面的复杂性分类,为问题的可操作性提供了一个明确的条件((mu ,F))。为了证明可处理性,我们推广了平井的理论,定义了一类更大的离散凸函数。它尤其涵盖了另一类众所周知的函数,即整数网格上的子模函数。最后,我们改进了平井算法求解可定向模块图上的(mathtt{0hbox {-}Ext}[{mu }])的复杂性。
{"title":"Generalized minimum 0-extension problem and discrete convexity","authors":"Martin Dvorak, Vladimir Kolmogorov","doi":"10.1007/s10107-024-02064-5","DOIUrl":"https://doi.org/10.1007/s10107-024-02064-5","url":null,"abstract":"<p>Given a fixed finite metric space <span>((V,mu ))</span>, the <i>minimum 0-extension problem</i>, denoted as <span>(mathtt{0hbox {-}Ext}[{mu }])</span>, is equivalent to the following optimization problem: minimize function of the form <span>(min nolimits _{xin V^n} sum _i f_i(x_i) + sum _{ij} c_{ij}hspace{0.5pt}mu (x_i,x_j))</span> where <span>(f_i:Vrightarrow mathbb {R})</span> are functions given by <span>(f_i(x_i)=sum _{vin V} c_{vi}hspace{0.5pt}mu (x_i,v))</span> and <span>(c_{ij},c_{vi})</span> are given nonnegative costs. The computational complexity of <span>(mathtt{0hbox {-}Ext}[{mu }])</span> has been recently established by Karzanov and by Hirai: if metric <span>(mu )</span> is <i>orientable modular</i> then <span>(mathtt{0hbox {-}Ext}[{mu }])</span> can be solved in polynomial time, otherwise <span>(mathtt{0hbox {-}Ext}[{mu }])</span> is NP-hard. To prove the tractability part, Hirai developed a theory of discrete convex functions on orientable modular graphs generalizing several known classes of functions in discrete convex analysis, such as <span>(L^natural )</span>-convex functions. We consider a more general version of the problem in which unary functions <span>(f_i(x_i))</span> can additionally have terms of the form <span>(c_{uv;i}hspace{0.5pt}mu (x_i,{u,v}))</span> for <span>({u,!hspace{0.5pt}hspace{0.5pt}v}in F)</span>, where set <span>(Fsubseteq left( {begin{array}{c}V 2end{array}}right) )</span> is fixed. We extend the complexity classification above by providing an explicit condition on <span>((mu ,F))</span> for the problem to be tractable. In order to prove the tractability part, we generalize Hirai’s theory and define a larger class of discrete convex functions. It covers, in particular, another well-known class of functions, namely submodular functions on an integer lattice. Finally, we improve the complexity of Hirai’s algorithm for solving <span>(mathtt{0hbox {-}Ext}[{mu }])</span> on orientable modular graphs.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"87 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140074954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Level constrained first order methods for function constrained optimization 函数约束优化的水平约束一阶方法
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-03-06 DOI: 10.1007/s10107-024-02057-4
Digvijay Boob, Qi Deng, Guanghui Lan

We present a new feasible proximal gradient method for constrained optimization where both the objective and constraint functions are given by summation of a smooth, possibly nonconvex function and a convex simple function. The algorithm converts the original problem into a sequence of convex subproblems. Formulating those subproblems requires the evaluation of at most one gradient-value of the original objective and constraint functions. Either exact or approximate subproblems solutions can be computed efficiently in many cases. An important feature of the algorithm is the constraint level parameter. By carefully increasing this level for each subproblem, we provide a simple solution to overcome the challenge of bounding the Lagrangian multipliers and show that the algorithm follows a strictly feasible solution path till convergence to the stationary point. We develop a simple, proximal gradient descent type analysis, showing that the complexity bound of this new algorithm is comparable to gradient descent for the unconstrained setting which is new in the literature. Exploiting this new design and analysis technique, we extend our algorithms to some more challenging constrained optimization problems where (1) the objective is a stochastic or finite-sum function, and (2) structured nonsmooth functions replace smooth components of both objective and constraint functions. Complexity results for these problems also seem to be new in the literature. Finally, our method can also be applied to convex function constrained problems where we show complexities similar to the proximal gradient method.

我们提出了一种新的可行近似梯度法,用于目标函数和约束函数都由一个平滑的、可能是非凸函数和一个凸简单函数求和给出的约束优化。该算法将原始问题转化为一系列凸子问题。提出这些子问题时,最多需要评估原始目标函数和约束函数的一个梯度值。在许多情况下,精确或近似的子问题解决方案都可以高效地计算出来。该算法的一个重要特点是约束水平参数。通过小心地增加每个子问题的约束水平参数,我们提供了一个简单的解决方案,以克服约束拉格朗日乘数的挑战,并证明该算法遵循严格可行的求解路径,直至收敛到静止点。我们开发了一种简单的近似梯度下降类型分析,表明这种新算法的复杂度约束与文献中新出现的无约束环境下的梯度下降算法相当。利用这一新的设计和分析技术,我们将算法扩展到了一些更具挑战性的约束优化问题,在这些问题中,(1) 目标是随机或有限和函数,(2) 结构化非光滑函数取代了目标和约束函数的光滑成分。这些问题的复杂性结果在文献中似乎也是全新的。最后,我们的方法还可以应用于凸函数约束问题,在这些问题中,我们展示了与近似梯度法类似的复杂性。
{"title":"Level constrained first order methods for function constrained optimization","authors":"Digvijay Boob, Qi Deng, Guanghui Lan","doi":"10.1007/s10107-024-02057-4","DOIUrl":"https://doi.org/10.1007/s10107-024-02057-4","url":null,"abstract":"<p>We present a new feasible proximal gradient method for constrained optimization where both the objective and constraint functions are given by summation of a smooth, possibly nonconvex function and a convex simple function. The algorithm converts the original problem into a sequence of convex subproblems. Formulating those subproblems requires the evaluation of at most one gradient-value of the original objective and constraint functions. Either exact or approximate subproblems solutions can be computed efficiently in many cases. An important feature of the algorithm is the constraint level parameter. By carefully increasing this level for each subproblem, we provide a simple solution to overcome the challenge of bounding the Lagrangian multipliers and show that the algorithm follows a strictly feasible solution path till convergence to the stationary point. We develop a simple, proximal gradient descent type analysis, showing that the complexity bound of this new algorithm is comparable to gradient descent for the unconstrained setting which is new in the literature. Exploiting this new design and analysis technique, we extend our algorithms to some more challenging constrained optimization problems where (1) the objective is a stochastic or finite-sum function, and (2) structured nonsmooth functions replace smooth components of both objective and constraint functions. Complexity results for these problems also seem to be new in the literature. Finally, our method can also be applied to convex function constrained problems where we show complexities similar to the proximal gradient method.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"43 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140054873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Polyhedral properties of RLT relaxations of nonconvex quadratic programs and their implications on exact relaxations 非凸二次方程程 RLT 松弛的多面体特性及其对精确松弛的影响
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-03-06 DOI: 10.1007/s10107-024-02070-7
Yuzhou Qiu, E. Alper Yıldırım

We study linear programming relaxations of nonconvex quadratic programs given by the reformulation–linearization technique (RLT), referred to as RLT relaxations. We investigate the relations between the polyhedral properties of the feasible regions of a quadratic program and its RLT relaxation. We establish various connections between recession directions, boundedness, and vertices of the two feasible regions. Using these properties, we present a complete description of the set of instances that admit an exact RLT relaxation. We then give a thorough discussion of how our results can be converted into simple algorithmic procedures to construct instances of quadratic programs with exact, inexact, or unbounded RLT relaxations.

我们研究的是重整线性化技术(RLT)给出的非凸二次方程程序的线性规划松弛,简称为 RLT 松弛。我们研究了二次型程序可行区域的多面体特性与其 RLT 松弛之间的关系。我们在两个可行区域的衰退方向、有界性和顶点之间建立了各种联系。利用这些性质,我们完整地描述了允许精确 RLT 松弛的实例集。然后,我们深入讨论了如何将我们的结果转化为简单的算法程序,以构建具有精确、不精确或无界 RLT 松弛的二次方程程序实例。
{"title":"Polyhedral properties of RLT relaxations of nonconvex quadratic programs and their implications on exact relaxations","authors":"Yuzhou Qiu, E. Alper Yıldırım","doi":"10.1007/s10107-024-02070-7","DOIUrl":"https://doi.org/10.1007/s10107-024-02070-7","url":null,"abstract":"<p>We study linear programming relaxations of nonconvex quadratic programs given by the reformulation–linearization technique (RLT), referred to as RLT relaxations. We investigate the relations between the polyhedral properties of the feasible regions of a quadratic program and its RLT relaxation. We establish various connections between recession directions, boundedness, and vertices of the two feasible regions. Using these properties, we present a complete description of the set of instances that admit an exact RLT relaxation. We then give a thorough discussion of how our results can be converted into simple algorithmic procedures to construct instances of quadratic programs with exact, inexact, or unbounded RLT relaxations.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"11 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140057690","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On convergence of iterative thresholding algorithms to approximate sparse solution for composite nonconvex optimization 论复合非凸优化近似稀疏解的迭代阈值算法的收敛性
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-03-06 DOI: 10.1007/s10107-024-02068-1
Yaohua Hu, Xinlin Hu, Xiaoqi Yang

This paper aims to find an approximate true sparse solution of an underdetermined linear system. For this purpose, we propose two types of iterative thresholding algorithms with the continuation technique and the truncation technique respectively. We introduce a notion of limited shrinkage thresholding operator and apply it, together with the restricted isometry property, to show that the proposed algorithms converge to an approximate true sparse solution within a tolerance relevant to the noise level and the limited shrinkage magnitude. Applying the obtained results to nonconvex regularization problems with SCAD, MCP and (ell _p) penalty ((0le p le 1)) and utilizing the recovery bound theory, we establish the convergence of their proximal gradient algorithms to an approximate global solution of nonconvex regularization problems. The established results include the existing convergence theory for (ell _1) or (ell _0) regularization problems for finding a true sparse solution as special cases. Preliminary numerical results show that our proposed algorithms can find approximate true sparse solutions that are much better than stationary solutions that are found by using the standard proximal gradient algorithm.

本文旨在寻找未定线性系统的近似真稀疏解。为此,我们提出了两种迭代阈值算法,分别采用延续技术和截断技术。我们引入了有限收缩阈值算子的概念,并将其与受限等距特性一起应用,证明所提出的算法能在与噪声水平和有限收缩幅度相关的容差范围内收敛到近似真稀疏解。将得到的结果应用于具有 SCAD、MCP 和 (ell _p)惩罚((0le p le 1))的非凸正则化问题,并利用恢复约束理论,我们建立了它们的近似梯度算法对非凸正则化问题近似全局解的收敛性。所建立的结果包括现有的收敛理论,用于寻找真正稀疏解的(ell _1)或(ell _0)正则化问题。初步的数值结果表明,我们提出的算法可以找到近似的真稀疏解,比使用标准近似梯度算法找到的静态解要好得多。
{"title":"On convergence of iterative thresholding algorithms to approximate sparse solution for composite nonconvex optimization","authors":"Yaohua Hu, Xinlin Hu, Xiaoqi Yang","doi":"10.1007/s10107-024-02068-1","DOIUrl":"https://doi.org/10.1007/s10107-024-02068-1","url":null,"abstract":"<p>This paper aims to find an approximate true sparse solution of an underdetermined linear system. For this purpose, we propose two types of iterative thresholding algorithms with the continuation technique and the truncation technique respectively. We introduce a notion of limited shrinkage thresholding operator and apply it, together with the restricted isometry property, to show that the proposed algorithms converge to an approximate true sparse solution within a tolerance relevant to the noise level and the limited shrinkage magnitude. Applying the obtained results to nonconvex regularization problems with SCAD, MCP and <span>(ell _p)</span> penalty (<span>(0le p le 1)</span>) and utilizing the recovery bound theory, we establish the convergence of their proximal gradient algorithms to an approximate global solution of nonconvex regularization problems. The established results include the existing convergence theory for <span>(ell _1)</span> or <span>(ell _0)</span> regularization problems for finding a true sparse solution as special cases. Preliminary numerical results show that our proposed algorithms can find approximate true sparse solutions that are much better than stationary solutions that are found by using the standard proximal gradient algorithm.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"105 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140054988","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Matroid-based TSP rounding for half-integral solutions 基于矩阵的 TSP 四舍五入半积分解法
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-03-06 DOI: 10.1007/s10107-024-02065-4

Abstract

We show how to round any half-integral solution to the subtour-elimination relaxation for the TSP, while losing a less-than (-)  1.5 factor. Such a rounding algorithm was recently given by Karlin, Klein, and Oveis Gharan based on sampling from max-entropy distributions. We build on an approach of Haddadan and Newman to show how sampling from the matroid intersection polytope, combined with a novel use of max-entropy sampling, can give better guarantees.

摘要 我们展示了如何对 TSP 的子our-elimination 松弛的任何半积分解进行舍入,同时损失小于 1.5 倍。最近,Karlin、Klein 和 Oveis Gharan 基于最大熵分布的采样给出了这种舍入算法。我们以 Haddadan 和 Newman 的方法为基础,展示了如何从 matroid 交集多面体中采样,并结合最大熵采样的新用法,从而给出更好的保证。
{"title":"Matroid-based TSP rounding for half-integral solutions","authors":"","doi":"10.1007/s10107-024-02065-4","DOIUrl":"https://doi.org/10.1007/s10107-024-02065-4","url":null,"abstract":"<h3>Abstract</h3> <p>We show how to round any half-integral solution to the subtour-elimination relaxation for the TSP, while losing a less-than<span> <span>(-)</span> </span> 1.5 factor. Such a rounding algorithm was recently given by Karlin, Klein, and Oveis Gharan based on sampling from max-entropy distributions. We build on an approach of Haddadan and Newman to show how sampling from the matroid intersection polytope, combined with a novel use of max-entropy sampling, can give better guarantees.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"274 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140057732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiplicative auction algorithm for approximate maximum weight bipartite matching 近似最大权重双网匹配的乘法拍卖算法
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-03-06 DOI: 10.1007/s10107-024-02066-3

Abstract

We present an auction algorithm using multiplicative instead of constant weight updates to compute a ((1-varepsilon )) -approximate maximum weight matching (MWM) in a bipartite graph with n vertices and m edges in time (O(mvarepsilon ^{-1})) , beating the running time of the fastest known approximation algorithm of Duan and Pettie [JACM ’14] that runs in (O(mvarepsilon ^{-1}log varepsilon ^{-1})) . Our algorithm is very simple and it can be extended to give a dynamic data structure that maintains a ((1-varepsilon )) -approximate maximum weight matching under (1) one-sided vertex deletions (with incident edges) and (2) one-sided vertex insertions (with incident edges sorted by weight) to the other side. The total time time used is (O(mvarepsilon ^{-1})) , where m is the sum of the number of initially existing and inserted edges.

摘要 我们提出了一种使用乘法而非恒定权重更新的拍卖算法,以计算在具有 n 个顶点和 m 条边的双向图中的((1-varepsilon ))近似最大权重匹配(MWM)。-(O(mvarepsilon ^{-1}))的时间内计算出具有 n 个顶点和 m 条边的双向图中的近似最大权重匹配(MWM)。打败了 Duan 和 Pettie [JACM '14] 的最快近似算法的运行时间(O(mvarepsilon ^{-1}log varepsilon ^{-1})) 。我们的算法非常简单,而且可以扩展到给出一个动态数据结构来维护一个((1-varepsilon ))-近似最大权重匹配。-近似最大权重匹配的动态数据结构。(1)单边顶点删除(带入射边)和(2)单边顶点插入(带按权重排序的入射边)到另一方。所用总时间为 (O(mvarepsilon ^{-1}))其中,m 是最初存在的和插入的边的数量之和。
{"title":"Multiplicative auction algorithm for approximate maximum weight bipartite matching","authors":"","doi":"10.1007/s10107-024-02066-3","DOIUrl":"https://doi.org/10.1007/s10107-024-02066-3","url":null,"abstract":"<h3>Abstract</h3> <p>We present an <em>auction algorithm</em> using multiplicative instead of constant weight updates to compute a <span> <span>((1-varepsilon ))</span> </span>-approximate maximum weight matching (MWM) in a bipartite graph with <em>n</em> vertices and <em>m</em> edges in time <span> <span>(O(mvarepsilon ^{-1}))</span> </span>, beating the running time of the fastest known approximation algorithm of Duan and Pettie [JACM ’14] that runs in <span> <span>(O(mvarepsilon ^{-1}log varepsilon ^{-1}))</span> </span>. Our algorithm is very simple and it can be extended to give a dynamic data structure that maintains a <span> <span>((1-varepsilon ))</span> </span>-approximate maximum weight matching under (1) one-sided vertex deletions (with incident edges) and (2) one-sided vertex insertions (with incident edges sorted by weight) to the other side. The total time time used is <span> <span>(O(mvarepsilon ^{-1}))</span> </span>, where <em>m</em> is the sum of the number of initially existing and inserted edges.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"55 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140054991","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The effect of smooth parametrizations on nonconvex optimization landscapes 平滑参数化对非凸优化景观的影响
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-03-04 DOI: 10.1007/s10107-024-02058-3
Eitan Levin, Joe Kileel, Nicolas Boumal

We develop new tools to study landscapes in nonconvex optimization. Given one optimization problem, we pair it with another by smoothly parametrizing the domain. This is either for practical purposes (e.g., to use smooth optimization algorithms with good guarantees) or for theoretical purposes (e.g., to reveal that the landscape satisfies a strict saddle property). In both cases, the central question is: how do the landscapes of the two problems relate? More precisely: how do desirable points such as local minima and critical points in one problem relate to those in the other problem? A key finding in this paper is that these relations are often determined by the parametrization itself, and are almost entirely independent of the cost function. Accordingly, we introduce a general framework to study parametrizations by their effect on landscapes. The framework enables us to obtain new guarantees for an array of problems, some of which were previously treated on a case-by-case basis in the literature. Applications include: optimizing low-rank matrices and tensors through factorizations; solving semidefinite programs via the Burer–Monteiro approach; training neural networks by optimizing their weights and biases; and quotienting out symmetries.

我们开发了研究非凸优化景观的新工具。给定一个优化问题,我们通过平滑参数化域将其与另一个问题配对。这要么是出于实用目的(例如,使用具有良好保证的平滑优化算法),要么是出于理论目的(例如,揭示景观满足严格的鞍属性)。在这两种情况下,核心问题都是:这两个问题的景观有何关联?更准确地说:一个问题中的理想点(如局部极小值和临界点)与另一个问题中的理想点有何关系?本文的一个重要发现是,这些关系通常由参数化本身决定,几乎完全独立于成本函数。因此,我们引入了一个通用框架,通过参数化对景观的影响来研究参数化。该框架使我们能够为一系列问题获得新的保证,其中一些问题以前在文献中是逐个处理的。其应用包括:通过因式分解优化低秩矩阵和张量;通过伯勒-蒙泰罗方法求解半定式程序;通过优化权重和偏置训练神经网络;以及对称性商化。
{"title":"The effect of smooth parametrizations on nonconvex optimization landscapes","authors":"Eitan Levin, Joe Kileel, Nicolas Boumal","doi":"10.1007/s10107-024-02058-3","DOIUrl":"https://doi.org/10.1007/s10107-024-02058-3","url":null,"abstract":"<p>We develop new tools to study landscapes in nonconvex optimization. Given one optimization problem, we pair it with another by smoothly parametrizing the domain. This is either for practical purposes (e.g., to use smooth optimization algorithms with good guarantees) or for theoretical purposes (e.g., to reveal that the landscape satisfies a strict saddle property). In both cases, the central question is: how do the landscapes of the two problems relate? More precisely: how do desirable points such as local minima and critical points in one problem relate to those in the other problem? A key finding in this paper is that these relations are often determined by the parametrization itself, and are almost entirely independent of the cost function. Accordingly, we introduce a general framework to study parametrizations by their effect on landscapes. The framework enables us to obtain new guarantees for an array of problems, some of which were previously treated on a case-by-case basis in the literature. Applications include: optimizing low-rank matrices and tensors through factorizations; solving semidefinite programs via the Burer–Monteiro approach; training neural networks by optimizing their weights and biases; and quotienting out symmetries.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"43 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140036042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hessian barrier algorithms for non-convex conic optimization 非凸圆锥优化的黑森障碍算法
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-03-04 DOI: 10.1007/s10107-024-02062-7
Pavel Dvurechensky, Mathias Staudigl

A key problem in mathematical imaging, signal processing and computational statistics is the minimization of non-convex objective functions that may be non-differentiable at the relative boundary of the feasible set. This paper proposes a new family of first- and second-order interior-point methods for non-convex optimization problems with linear and conic constraints, combining logarithmically homogeneous barriers with quadratic and cubic regularization respectively. Our approach is based on a potential-reduction mechanism and, under the Lipschitz continuity of the corresponding derivative with respect to the local barrier-induced norm, attains a suitably defined class of approximate first- or second-order KKT points with worst-case iteration complexity (O(varepsilon ^{-2})) (first-order) and (O(varepsilon ^{-3/2})) (second-order), respectively. Based on these findings, we develop new path-following schemes attaining the same complexity, modulo adjusting constants. These complexity bounds are known to be optimal in the unconstrained case, and our work shows that they are upper bounds in the case with complicated constraints as well. To the best of our knowledge, this work is the first which achieves these worst-case complexity bounds under such weak conditions for general conic constrained non-convex optimization problems.

数学成像、信号处理和计算统计中的一个关键问题是非凸目标函数的最小化,这些目标函数在可行集的相对边界处可能是不可分的。本文针对具有线性和圆锥约束的非凸优化问题提出了一系列新的一阶和二阶内点法,分别结合了对数同质壁垒和二次方与三次方正则化。我们的方法基于势还原机制,在相对于局部障碍诱导规范的相应导数的 Lipschitz 连续性条件下,可以获得一类适当定义的近似一阶或二阶 KKT 点,其最坏情况迭代复杂度分别为 (O(varepsilon ^{-2}))(一阶)和 (O(varepsilon ^{-3/2}))(二阶)。在这些发现的基础上,我们开发了新的路径跟踪方案,在调整常数的基础上达到了相同的复杂度。众所周知,这些复杂度边界在无约束的情况下是最优的,而我们的研究表明,它们在有复杂约束的情况下也是上限。据我们所知,这项研究是第一个在如此弱的条件下,针对一般圆锥约束非凸优化问题实现这些最坏情况复杂度约束的研究。
{"title":"Hessian barrier algorithms for non-convex conic optimization","authors":"Pavel Dvurechensky, Mathias Staudigl","doi":"10.1007/s10107-024-02062-7","DOIUrl":"https://doi.org/10.1007/s10107-024-02062-7","url":null,"abstract":"<p>A key problem in mathematical imaging, signal processing and computational statistics is the minimization of non-convex objective functions that may be non-differentiable at the relative boundary of the feasible set. This paper proposes a new family of first- and second-order interior-point methods for non-convex optimization problems with linear and conic constraints, combining logarithmically homogeneous barriers with quadratic and cubic regularization respectively. Our approach is based on a potential-reduction mechanism and, under the Lipschitz continuity of the corresponding derivative with respect to the local barrier-induced norm, attains a suitably defined class of approximate first- or second-order KKT points with worst-case iteration complexity <span>(O(varepsilon ^{-2}))</span> (first-order) and <span>(O(varepsilon ^{-3/2}))</span> (second-order), respectively. Based on these findings, we develop new path-following schemes attaining the same complexity, modulo adjusting constants. These complexity bounds are known to be optimal in the unconstrained case, and our work shows that they are upper bounds in the case with complicated constraints as well. To the best of our knowledge, this work is the first which achieves these worst-case complexity bounds under such weak conditions for general conic constrained non-convex optimization problems.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"4 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140036039","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Mathematical Programming
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1