首页 > 最新文献

Mathematical Programming最新文献

英文 中文
Automated tight Lyapunov analysis for first-order methods 一阶方法的自动紧 Lyapunov 分析
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-02-26 DOI: 10.1007/s10107-024-02061-8
Manu Upadhyaya, Sebastian Banert, Adrien B. Taylor, Pontus Giselsson

We present a methodology for establishing the existence of quadratic Lyapunov inequalities for a wide range of first-order methods used to solve convex optimization problems. In particular, we consider (i) classes of optimization problems of finite-sum form with (possibly strongly) convex and possibly smooth functional components, (ii) first-order methods that can be written as a linear system on state-space form in feedback interconnection with the subdifferentials of the functional components of the objective function, and (iii) quadratic Lyapunov inequalities that can be used to draw convergence conclusions. We present a necessary and sufficient condition for the existence of a quadratic Lyapunov inequality within a predefined class of Lyapunov inequalities, which amounts to solving a small-sized semidefinite program. We showcase our methodology on several first-order methods that fit the framework. Most notably, our methodology allows us to significantly extend the region of parameter choices that allow for duality gap convergence in the Chambolle–Pock method when the linear operator is the identity mapping.

我们提出了一种方法,用于确定用于解决凸优化问题的各种一阶方法的二次李雅普诺夫不等式的存在性。特别是,我们考虑了 (i) 具有(可能是强)凸和可能是平滑函数成分的有限总和形式的优化问题类别,(ii) 可以写成状态空间形式上的线性系统的一阶方法,该系统与目标函数的函数成分的子差分反馈互联,(iii) 可以用来得出收敛结论的二次李雅普诺夫不等式。我们提出了在预定义的 Lyapunov 不等式类别中存在二次 Lyapunov 不等式的必要条件和充分条件,这相当于求解一个小型半定式程序。我们在几个符合该框架的一阶方法上展示了我们的方法论。最值得注意的是,当线性算子是身份映射时,我们的方法允许我们大大扩展 Chambolle-Pock 方法中允许对偶差距收敛的参数选择区域。
{"title":"Automated tight Lyapunov analysis for first-order methods","authors":"Manu Upadhyaya, Sebastian Banert, Adrien B. Taylor, Pontus Giselsson","doi":"10.1007/s10107-024-02061-8","DOIUrl":"https://doi.org/10.1007/s10107-024-02061-8","url":null,"abstract":"<p>We present a methodology for establishing the existence of quadratic Lyapunov inequalities for a wide range of first-order methods used to solve convex optimization problems. In particular, we consider (i) classes of optimization problems of finite-sum form with (possibly strongly) convex and possibly smooth functional components, (ii) first-order methods that can be written as a linear system on state-space form in feedback interconnection with the subdifferentials of the functional components of the objective function, and (iii) quadratic Lyapunov inequalities that can be used to draw convergence conclusions. We present a necessary and sufficient condition for the existence of a quadratic Lyapunov inequality within a predefined class of Lyapunov inequalities, which amounts to solving a small-sized semidefinite program. We showcase our methodology on several first-order methods that fit the framework. Most notably, our methodology allows us to significantly extend the region of parameter choices that allow for duality gap convergence in the Chambolle–Pock method when the linear operator is the identity mapping.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"17 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140885735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Submodular maximization and its generalization through an intersection cut lens 次模态最大化及其通过交叉切分透镜的广义化
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-02-19 DOI: 10.1007/s10107-024-02059-2
Liding Xu, Leo Liberti

We study a mixed-integer set (mathcal {S}:={(x,t) in {0,1}^n times mathbb {R}: f(x) ge t}) arising in the submodular maximization problem, where f is a submodular function defined over ({0,1}^n). We use intersection cuts to tighten a polyhedral outer approximation of (mathcal {S}). We construct a continuous extension (bar{textsf{F}}_f) of f, which is convex and defined over the entire space (mathbb {R}^n). We show that the epigraph ({{,textrm{epi},}}(bar{textsf{F}}_f)) of (bar{textsf{F}}_f) is an (mathcal {S})-free set, and characterize maximal (mathcal {S})-free sets containing ({{,textrm{epi},}}(bar{textsf{F}}_f)). We propose a hybrid discrete Newton algorithm to compute an intersection cut efficiently and exactly. Our results are generalized to the hypograph or the superlevel set of a submodular-supermodular function over the Boolean hypercube, which is a model for discrete nonconvexity. A consequence of these results is intersection cuts for Boolean multilinear constraints. We evaluate our techniques on max cut, pseudo Boolean maximization, and Bayesian D-optimal design problems within a MIP solver.

我们研究了子模最大化问题中出现的混合整数集合(mathcal {S}:={(x,t) in {0,1}^n times mathbb {R}: f(x) ge t}) ,其中 f 是定义在 ({0,1}^n) 上的子模函数。我们使用交割来收紧 (mathcal {S}) 的多面体外近似。我们构建了 f 的连续扩展 (bar{textsf{F}}_f),它是凸的,并且定义在整个空间 (mathbb {R}^n) 上。我们证明了(bar{textsf{F}}_f)的外延({{textrm{epi},}}(bar{textsf{F}}_f)是一个(mathcal {S})-free集合、并描述包含 ({{textrm{epi},}}(bar{textsf{F}}_f))的最大 (mathcal {S})-无集合的特征。我们提出了一种混合离散牛顿算法来高效精确地计算交集切分。我们的结果被推广到布尔超立方体上的亚模态-超模态函数的超图或超级集,这是离散非凸模型。这些结果的一个结果就是布尔多线性约束的交割。我们在 MIP 求解器中对最大切割、伪布尔最大化和贝叶斯 D 优化设计问题评估了我们的技术。
{"title":"Submodular maximization and its generalization through an intersection cut lens","authors":"Liding Xu, Leo Liberti","doi":"10.1007/s10107-024-02059-2","DOIUrl":"https://doi.org/10.1007/s10107-024-02059-2","url":null,"abstract":"<p>We study a mixed-integer set <span>(mathcal {S}:={(x,t) in {0,1}^n times mathbb {R}: f(x) ge t})</span> arising in the submodular maximization problem, where <i>f</i> is a submodular function defined over <span>({0,1}^n)</span>. We use intersection cuts to tighten a polyhedral outer approximation of <span>(mathcal {S})</span>. We construct a continuous extension <span>(bar{textsf{F}}_f)</span> of <i>f</i>, which is convex and defined over the entire space <span>(mathbb {R}^n)</span>. We show that the epigraph <span>({{,textrm{epi},}}(bar{textsf{F}}_f))</span> of <span>(bar{textsf{F}}_f)</span> is an <span>(mathcal {S})</span>-free set, and characterize maximal <span>(mathcal {S})</span>-free sets containing <span>({{,textrm{epi},}}(bar{textsf{F}}_f))</span>. We propose a hybrid discrete Newton algorithm to compute an intersection cut efficiently and exactly. Our results are generalized to the hypograph or the superlevel set of a submodular-supermodular function over the Boolean hypercube, which is a model for discrete nonconvexity. A consequence of these results is intersection cuts for Boolean multilinear constraints. We evaluate our techniques on max cut, pseudo Boolean maximization, and Bayesian D-optimal design problems within a MIP solver.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"111 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140886471","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deciding whether a lattice has an orthonormal basis is in co-NP 决定一个网格是否有正交基础属于共 NP
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-02-19 DOI: 10.1007/s10107-023-02052-1
Christoph Hunkenschröder

We show that the problem of deciding whether a given Euclidean lattice L has an orthonormal basis is in NP and co-NP. Since this is equivalent to saying that L is isomorphic to the standard integer lattice, this problem is a special form of the lattice isomorphism problem, which is known to be in the complexity class SZK. We achieve this by deploying a result on characteristic vectors by Elkies that gained attention in the context of 4-manifolds and Seiberg-Witten equations, but seems rather unnoticed in the algorithmic lattice community. On the way, we also show that for a given Gram matrix (G in mathbb {Q}^{n times n}), we can efficiently find a rational lattice that is embedded in at most four times the initial dimension n, i.e. a rational matrix (B in mathbb {Q}^{4n times n}) such that (B^intercal B = G).

我们证明,判断给定欧几里得网格 L 是否具有正交基础的问题属于 NP 和 co-NP。由于这等同于说 L 与标准整数格同构,因此这个问题是格同构问题的一种特殊形式,而格同构问题已知属于复杂度类 SZK。我们利用埃尔基斯(Elkies)关于特征向量的一个结果来实现这个问题,这个结果在 4-芒形和塞伯格-维滕方程的背景下获得了关注,但在算法晶格领域却似乎没有引起注意。在此过程中,我们还证明了对于一个给定的克矩阵(G in mathbb {Q}^{ntimes n}),我们可以高效地找到一个嵌入在最多四倍初始维度 n 中的有理网格,即一个有理矩阵(B in mathbb {Q}^{4ntimes n}),使得 (B^intercal B = G).
{"title":"Deciding whether a lattice has an orthonormal basis is in co-NP","authors":"Christoph Hunkenschröder","doi":"10.1007/s10107-023-02052-1","DOIUrl":"https://doi.org/10.1007/s10107-023-02052-1","url":null,"abstract":"<p>We show that the problem of deciding whether a given Euclidean lattice <i>L</i> has an orthonormal basis is in NP and co-NP. Since this is equivalent to saying that <i>L</i> is isomorphic to the standard integer lattice, this problem is a special form of the lattice isomorphism problem, which is known to be in the complexity class SZK. We achieve this by deploying a result on <i>characteristic vectors</i> by Elkies that gained attention in the context of 4-manifolds and Seiberg-Witten equations, but seems rather unnoticed in the algorithmic lattice community. On the way, we also show that for a given Gram matrix <span>(G in mathbb {Q}^{n times n})</span>, we can efficiently find a rational lattice that is embedded in at most four times the initial dimension <i>n</i>, i.e. a rational matrix <span>(B in mathbb {Q}^{4n times n})</span> such that <span>(B^intercal B = G)</span>.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"2013 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140885737","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Convex hulls of monomial curves, and a sparse positivstellensatz 单项式曲线的凸壳和稀疏正定定理
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-02-16 DOI: 10.1007/s10107-024-02060-9
Gennadiy Averkov, Claus Scheiderer

Consider the closed convex hull K of a monomial curve given parametrically as ((t^{m_1},ldots ,t^{m_n})), with the parameter t varying in an interval I. We show, using constructive arguments, that K admits a lifted semidefinite description by (mathcal {O}(d)) linear matrix inequalities (LMIs), each of size (leftlfloor frac{n}{2} rightrfloor +1), where (d= max {m_1,ldots ,m_n}) is the degree of the curve. On the dual side, we show that if a univariate polynomial p(t) of degree d with at most (2k+1) monomials is non-negative on ({mathbb {R}}_+), then p admits a representation (p = t^0 sigma _0 + cdots + t^{d-k} sigma _{d-k}), where the polynomials (sigma _0,ldots ,sigma _{d-k}) are sums of squares and (deg (sigma _i) le 2k). The latter is a univariate positivstellensatz for sparse polynomials, with non-negativity of p being certified by sos polynomials whose degree only depends on the sparsity of p. Our results fit into the general attempt of formulating polynomial optimization problems as semidefinite problems with LMIs of small size. Such small-size descriptions are much more tractable from a computational viewpoint.

考虑参数为 ((t^{m_1},ldots ,t^{m_n}))的单项式曲线的闭凸壳 K,参数 t 在区间 I 中变化。我们利用构造论证证明,K 可以通过线性矩阵不等式(LMIs)进行提升半定量描述,每个线性矩阵不等式的大小为 (leftlfloor frac{n}{2} rightrfloor +1) ,其中 (d= max {m_1,ldots ,m_n/}/)是曲线的阶数。在对偶方面,我们证明了如果阶数为 d 的单变量多项式 p(t) 在 ({mathbb {R}}_+) 上是非负的,且其单项式最多有(2k+1) 个、then p admits a representation (p = t^0 sigma _0 + cdots + t^{d-k} sigma _{d-k}), where the polynomials (sigma _0,ldots ,sigma _{d-k}) are sums of squares and (deg (sigma _i) le 2k).后者是稀疏多项式的单变量正弦定理,p 的非负性由 sos 多项式证明,而 sos 多项式的度数只取决于 p 的稀疏性。我们的结果符合将多项式优化问题表述为具有小尺寸 LMI 的半有限问题的一般尝试。从计算的角度来看,这种小规模的描述要容易得多。
{"title":"Convex hulls of monomial curves, and a sparse positivstellensatz","authors":"Gennadiy Averkov, Claus Scheiderer","doi":"10.1007/s10107-024-02060-9","DOIUrl":"https://doi.org/10.1007/s10107-024-02060-9","url":null,"abstract":"<p>Consider the closed convex hull <i>K</i> of a monomial curve given parametrically as <span>((t^{m_1},ldots ,t^{m_n}))</span>, with the parameter <i>t</i> varying in an interval <i>I</i>. We show, using constructive arguments, that <i>K</i> admits a lifted semidefinite description by <span>(mathcal {O}(d))</span> linear matrix inequalities (LMIs), each of size <span>(leftlfloor frac{n}{2} rightrfloor +1)</span>, where <span>(d= max {m_1,ldots ,m_n})</span> is the degree of the curve. On the dual side, we show that if a univariate polynomial <i>p</i>(<i>t</i>) of degree <i>d</i> with at most <span>(2k+1)</span> monomials is non-negative on <span>({mathbb {R}}_+)</span>, then <i>p</i> admits a representation <span>(p = t^0 sigma _0 + cdots + t^{d-k} sigma _{d-k})</span>, where the polynomials <span>(sigma _0,ldots ,sigma _{d-k})</span> are sums of squares and <span>(deg (sigma _i) le 2k)</span>. The latter is a univariate positivstellensatz for sparse polynomials, with non-negativity of <i>p</i> being certified by sos polynomials whose degree only depends on the sparsity of <i>p</i>. Our results fit into the general attempt of formulating polynomial optimization problems as semidefinite problems with LMIs of small size. Such small-size descriptions are much more tractable from a computational viewpoint.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"10 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140886077","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semiglobal exponential stability of the discrete-time Arrow-Hurwicz-Uzawa primal-dual algorithm for constrained optimization 约束优化离散时 Arrow-Hurwicz-Uzawa primal-dual 算法的半全局指数稳定性
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-02-12 DOI: 10.1007/s10107-023-02051-2
Michelangelo Bin, Ivano Notarnicola, Thomas Parisini

We consider the discrete-time Arrow-Hurwicz-Uzawa primal-dual algorithm, also known as the first-order Lagrangian method, for constrained optimization problems involving a smooth strongly convex cost and smooth convex constraints. We prove that, for every given compact set of initial conditions, there always exists a sufficiently small stepsize guaranteeing exponential stability of the optimal primal-dual solution of the problem with a domain of attraction including the initialization set. Inspired by the analysis of nonlinear oscillators, the stability proof is based on a non-quadratic Lyapunov function including a nonlinear cross term.

我们考虑了离散时间 Arrow-Hurwicz-Uzawa 原始二元算法(也称为一阶拉格朗日法),该算法适用于涉及光滑强凸成本和光滑凸约束的约束优化问题。我们证明,对于每个给定的紧凑初始条件集,总是存在一个足够小的步长,以保证问题的最优初等二元解的指数稳定性,其吸引域包括初始化集。受非线性振荡器分析的启发,稳定性证明基于包含非线性交叉项的非二次方 Lyapunov 函数。
{"title":"Semiglobal exponential stability of the discrete-time Arrow-Hurwicz-Uzawa primal-dual algorithm for constrained optimization","authors":"Michelangelo Bin, Ivano Notarnicola, Thomas Parisini","doi":"10.1007/s10107-023-02051-2","DOIUrl":"https://doi.org/10.1007/s10107-023-02051-2","url":null,"abstract":"<p>We consider the discrete-time Arrow-Hurwicz-Uzawa primal-dual algorithm, also known as the first-order Lagrangian method, for constrained optimization problems involving a smooth strongly convex cost and smooth convex constraints. We prove that, for every given compact set of initial conditions, there always exists a sufficiently small stepsize guaranteeing exponential stability of the optimal primal-dual solution of the problem with a domain of attraction including the initialization set. Inspired by the analysis of nonlinear oscillators, the stability proof is based on a non-quadratic Lyapunov function including a nonlinear cross term.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"12 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139760876","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A general framework for multi-marginal optimal transport 多边际优化运输的总体框架
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-02-05 DOI: 10.1007/s10107-023-02032-5
Brendan Pass, Adolfo Vargas-Jiménez

We establish a general condition on the cost function to obtain uniqueness and Monge solutions in the multi-marginal optimal transport problem, under the assumption that a given collection of the marginals are absolutely continuous with respect to local coordinates. When only the first marginal is assumed to be absolutely continuous, our condition is equivalent to the twist on splitting sets condition found in Kim and Pass (SIAM J Math Anal 46:1538–1550, 2014; SIAM J Math Anal 46:1538–1550, 2014). In addition, it is satisfied by the special cost functions in our earlier work (Pass and Vargas-Jiménez in SIAM J Math Anal 53:4386–4400, 2021; Monge solutions and uniqueness in multi-marginal optimal transport via graph theory. arXiv:2104.09488, 2021), when absolute continuity is imposed on certain other collections of marginals. We also present several new examples of cost functions which violate the twist on splitting sets condition but satisfy the new condition introduced here, including a class of examples arising in robust risk management problems; we therefore obtain Monge solution and uniqueness results for these cost functions, under regularity conditions on an appropriate subset of the marginals.

我们建立了一个关于成本函数的一般条件,以便在多边际最优运输问题中获得唯一性和蒙日解,前提是给定的边际集合相对于局部坐标是绝对连续的。当只假设第一个边际绝对连续时,我们的条件等同于 Kim 和 Pass(SIAM J Math Anal 46:1538-1550, 2014;SIAM J Math Anal 46:1538-1550, 2014)中发现的关于分裂集的扭曲条件。此外,当对某些其他边际集合施加绝对连续性时,我们早期工作(Pass 和 Vargas-Jiménez in SIAM J Math Anal 53:4386-4400, 2021; Monge solutions and uniqueness in multi-marginal optimal transport via graph theory.我们还提出了几个成本函数的新例子,它们违反了分裂集上的扭曲条件,但满足了这里引入的新条件,其中包括稳健风险管理问题中出现的一类例子;因此,在边际的适当子集上的正则性条件下,我们得到了这些成本函数的 Monge 解和唯一性结果。
{"title":"A general framework for multi-marginal optimal transport","authors":"Brendan Pass, Adolfo Vargas-Jiménez","doi":"10.1007/s10107-023-02032-5","DOIUrl":"https://doi.org/10.1007/s10107-023-02032-5","url":null,"abstract":"<p>We establish a general condition on the cost function to obtain uniqueness and Monge solutions in the multi-marginal optimal transport problem, under the assumption that a given collection of the marginals are absolutely continuous with respect to local coordinates. When only the first marginal is assumed to be absolutely continuous, our condition is equivalent to the twist on splitting sets condition found in Kim and Pass (SIAM J Math Anal 46:1538–1550, 2014; SIAM J Math Anal 46:1538–1550, 2014). In addition, it is satisfied by the special cost functions in our earlier work (Pass and Vargas-Jiménez in SIAM J Math Anal 53:4386–4400, 2021; Monge solutions and uniqueness in multi-marginal optimal transport via graph theory. arXiv:2104.09488, 2021), when absolute continuity is imposed on certain other collections of marginals. We also present several new examples of cost functions which violate the twist on splitting sets condition but satisfy the new condition introduced here, including a class of examples arising in robust risk management problems; we therefore obtain Monge solution and uniqueness results for these cost functions, under regularity conditions on an appropriate subset of the marginals.\u0000</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"9 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139760932","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Frank–Wolfe-type methods for a class of nonconvex inequality-constrained problems 一类非凸不等式约束问题的 Frank-Wolfe 型方法
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-02-03 DOI: 10.1007/s10107-023-02055-y

Abstract

The Frank–Wolfe (FW) method, which implements efficient linear oracles that minimize linear approximations of the objective function over a fixed compact convex set, has recently received much attention in the optimization and machine learning literature. In this paper, we propose a new FW-type method for minimizing a smooth function over a compact set defined as the level set of a single difference-of-convex function, based on new generalized linear-optimization oracles (LO). We show that these LOs can be computed efficiently with closed-form solutions in some important optimization models that arise in compressed sensing and machine learning. In addition, under a mild strict feasibility condition, we establish the subsequential convergence of our nonconvex FW-type method. Since the feasible region of our generalized LO typically changes from iteration to iteration, our convergence analysis is completely different from those existing works in the literature on FW-type methods that deal with fixed feasible regions among subproblems. Finally, motivated by the away steps for accelerating FW-type methods for convex problems, we further design an away-step oracle to supplement our nonconvex FW-type method, and establish subsequential convergence of this variant. Numerical results on the matrix completion problem with standard datasets are presented to demonstrate the efficiency of the proposed FW-type method and its away-step variant.

摘要 弗兰克-沃尔夫(Frank-Wolfe,FW)方法实现了在固定紧凑凸集上最小化目标函数线性近似值的高效线性指标,最近在优化和机器学习文献中受到广泛关注。在本文中,我们基于新的广义线性优化神谕(LO),提出了一种新的 FW 型方法,用于在定义为单个凸函数差的水平集的紧凑集上最小化平滑函数。我们证明,在压缩传感和机器学习中出现的一些重要优化模型中,可以通过闭式解高效计算这些 LO。此外,在温和严格的可行性条件下,我们建立了非凸 FW 型方法的后续收敛性。由于我们的广义 LO 的可行区域通常会随着迭代而变化,因此我们的收敛性分析与现有文献中处理子问题间固定可行区域的 FW 型方法完全不同。最后,受用于加速凸问题 FW 型方法的远离步骤的启发,我们进一步设计了一个远离步骤神谕来补充我们的非凸 FW 型方法,并建立了这一变体的后续收敛性。我们给出了使用标准数据集的矩阵完成问题的数值结果,以证明所提出的 FW 型方法及其远离步骤变体的效率。
{"title":"Frank–Wolfe-type methods for a class of nonconvex inequality-constrained problems","authors":"","doi":"10.1007/s10107-023-02055-y","DOIUrl":"https://doi.org/10.1007/s10107-023-02055-y","url":null,"abstract":"<h3>Abstract</h3> <p>The Frank–Wolfe (FW) method, which implements efficient linear oracles that minimize linear approximations of the objective function over a <em>fixed</em> compact convex set, has recently received much attention in the optimization and machine learning literature. In this paper, we propose a new FW-type method for minimizing a smooth function over a compact set defined as the level set of a single <em>difference-of-convex</em> function, based on new <em>generalized</em> linear-optimization oracles (LO). We show that these LOs can be computed efficiently with <em>closed-form solutions</em> in some important optimization models that arise in compressed sensing and machine learning. In addition, under a mild strict feasibility condition, we establish the subsequential convergence of our nonconvex FW-type method. Since the feasible region of our generalized LO typically changes from iteration to iteration, our convergence analysis is <em>completely different</em> from those existing works in the literature on FW-type methods that deal with <em>fixed</em> feasible regions among subproblems. Finally, motivated by the away steps for accelerating FW-type methods for convex problems, we further design an <em>away-step oracle</em> to supplement our nonconvex FW-type method, and establish subsequential convergence of this variant. Numerical results on the matrix completion problem with standard datasets are presented to demonstrate the efficiency of the proposed FW-type method and its away-step variant. </p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"10 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139677806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Designing tractable piecewise affine policies for multi-stage adjustable robust optimization 为多阶段可调鲁棒优化设计可操作的片断仿射策略
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-02-03 DOI: 10.1007/s10107-023-02053-0
Simon Thomä, Grit Walther, Maximilian Schiffer

We study piecewise affine policies for multi-stage adjustable robust optimization (ARO) problems with non-negative right-hand side uncertainty. First, we construct new dominating uncertainty sets and show how a multi-stage ARO problem can be solved efficiently with a linear program when uncertainty is replaced by these new sets. We then demonstrate how solutions for this alternative problem can be transformed into solutions for the original problem. By carefully choosing the dominating sets, we prove strong approximation bounds for our policies and extend many previously best-known bounds for the two-staged problem variant to its multi-stage counterpart. Moreover, the new bounds are—to the best of our knowledge—the first bounds shown for the general multi-stage ARO problem considered. We extensively compare our policies to other policies from the literature and prove relative performance guarantees. In two numerical experiments, we identify beneficial and disadvantageous properties for different policies and present effective adjustments to tackle the most critical disadvantages of our policies. Overall, the experiments show that our piecewise affine policies can be computed by orders of magnitude faster than affine policies, while often yielding comparable or even better results.

我们研究了具有非负右侧不确定性的多阶段可调鲁棒优化(ARO)问题的片断仿射策略。首先,我们构建了新的支配性不确定性集,并展示了当不确定性被这些新的不确定性集取代时,如何用线性程序高效地解决多阶段 ARO 问题。然后,我们演示了如何将这一替代问题的解决方案转化为原始问题的解决方案。通过仔细选择支配集,我们证明了我们的策略具有很强的近似边界,并将两阶段问题变体的许多已知边界扩展到了多阶段问题变体。此外,据我们所知,新的界限是首次针对一般多阶段 ARO 问题给出的界限。我们将我们的策略与文献中的其他策略进行了广泛比较,并证明了相对性能保证。在两个数值实验中,我们确定了不同策略的优势和劣势,并提出了有效的调整措施,以解决我们策略中最关键的劣势。总之,实验表明,我们的片断仿射策略的计算速度比仿射策略快几个数量级,同时通常能获得相当甚至更好的结果。
{"title":"Designing tractable piecewise affine policies for multi-stage adjustable robust optimization","authors":"Simon Thomä, Grit Walther, Maximilian Schiffer","doi":"10.1007/s10107-023-02053-0","DOIUrl":"https://doi.org/10.1007/s10107-023-02053-0","url":null,"abstract":"<p>We study piecewise affine policies for multi-stage adjustable robust optimization (ARO) problems with non-negative right-hand side uncertainty. First, we construct new dominating uncertainty sets and show how a multi-stage ARO problem can be solved efficiently with a linear program when uncertainty is replaced by these new sets. We then demonstrate how solutions for this alternative problem can be transformed into solutions for the original problem. By carefully choosing the dominating sets, we prove strong approximation bounds for our policies and extend many previously best-known bounds for the two-staged problem variant to its multi-stage counterpart. Moreover, the new bounds are—to the best of our knowledge—the first bounds shown for the general multi-stage ARO problem considered. We extensively compare our policies to other policies from the literature and prove relative performance guarantees. In two numerical experiments, we identify beneficial and disadvantageous properties for different policies and present effective adjustments to tackle the most critical disadvantages of our policies. Overall, the experiments show that our piecewise affine policies can be computed by orders of magnitude faster than affine policies, while often yielding comparable or even better results.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"12 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139677417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A constant-factor approximation for generalized malleable scheduling under $$M ^{natural }$$ -concave processing speeds 在 $$M ^{natural }$$ -凹处理速度条件下的广义可延展调度的常系数近似值
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-01-29 DOI: 10.1007/s10107-023-02054-z
Dimitris Fotakis, Jannik Matuschke, Orestis Papadigenopoulos

In generalized malleable scheduling, jobs can be allocated and processed simultaneously on multiple machines so as to reduce the overall makespan of the schedule. The required processing time for each job is determined by the joint processing speed of the allocated machines. We study the case that processing speeds are job-dependent (M ^{natural })-concave functions and provide a constant-factor approximation for this setting, significantly expanding the realm of functions for which such an approximation is possible. Further, we explore the connection between malleable scheduling and the problem of fairly allocating items to a set of agents with distinct utility functions, devising a black-box reduction that allows to obtain resource-augmented approximation algorithms for the latter.

在广义延展性调度中,作业可在多台机器上同时分配和处理,以减少调度的总体时间跨度。每个作业所需的处理时间由所分配机器的联合处理速度决定。我们研究了处理速度是与作业相关的 (M ^{natural })-concave 函数的情况,并为这种情况提供了一个常系数近似值,从而大大扩展了这种近似值可能适用的函数领域。此外,我们还探讨了可变调度与向一组具有不同效用函数的代理公平分配物品问题之间的联系,并设计了一种黑箱还原方法,从而为后者获得了资源增量近似算法。
{"title":"A constant-factor approximation for generalized malleable scheduling under $$M ^{natural }$$ -concave processing speeds","authors":"Dimitris Fotakis, Jannik Matuschke, Orestis Papadigenopoulos","doi":"10.1007/s10107-023-02054-z","DOIUrl":"https://doi.org/10.1007/s10107-023-02054-z","url":null,"abstract":"<p>In generalized malleable scheduling, jobs can be allocated and processed simultaneously on multiple machines so as to reduce the overall makespan of the schedule. The required processing time for each job is determined by the joint processing speed of the allocated machines. We study the case that processing speeds are job-dependent <span>(M ^{natural })</span>-concave functions and provide a constant-factor approximation for this setting, significantly expanding the realm of functions for which such an approximation is possible. Further, we explore the connection between malleable scheduling and the problem of fairly allocating items to a set of agents with distinct utility functions, devising a black-box reduction that allows to obtain resource-augmented approximation algorithms for the latter.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"8 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139644974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adjustability in robust linear optimization 稳健线性优化中的可调整性
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-01-27 DOI: 10.1007/s10107-023-02049-w

Abstract

We investigate the concept of adjustability—the difference in objective values between two types of dynamic robust optimization formulations: one where (static) decisions are made before uncertainty realization, and one where uncertainty is resolved before (adjustable) decisions. This difference reflects the value of information and decision timing in optimization under uncertainty, and is related to several other concepts such as the optimality of decision rules in robust optimization. We develop a theoretical framework to quantify adjustability based on the input data of a robust optimization problem with a linear objective, linear constraints, and fixed recourse. We make very few additional assumptions. In particular, we do not assume constraint-wise separability or parameter nonnegativity that are commonly imposed in the literature for the study of adjustability. This allows us to study important but previously under-investigated problems, such as formulations with equality constraints and problems with both upper and lower bound constraints. Based on the discovery of an interesting connection between the reformulations of the static and fully adjustable problems, our analysis gives a necessary and sufficient condition—in the form of a theorem-of-the-alternatives—for adjustability to be zero when the uncertainty set is polyhedral. Based on this sharp characterization, we provide two efficient mixed-integer optimization formulations to verify zero adjustability. Then, we develop a constructive approach to quantify adjustability when the uncertainty set is general, which results in an efficient and tight poly-time algorithm to bound adjustability. We demonstrate the efficiency and tightness via both theoretical and numerical analyses.

摘要 我们研究了可调整性的概念--两类动态稳健优化公式之间目标值的差异:一类是在不确定性实现之前做出(静态)决策,另一类是在做出(可调整的)决策之前解决不确定性。这种差异反映了信息和决策时机在不确定条件下优化的价值,并与其他几个概念相关,如稳健优化中决策规则的最优性。我们基于线性目标、线性约束和固定追索权的稳健优化问题的输入数据,建立了一个量化可调整性的理论框架。我们只做了很少的额外假设。特别是,我们没有假设约束分离性或参数非负性,而这些在研究可调整性的文献中是常见的。这使得我们可以研究一些重要但以前未充分研究的问题,如带有相等约束条件的公式,以及同时带有上限和下限约束条件的问题。在发现静态问题和完全可调整问题的重构之间存在有趣联系的基础上,我们的分析给出了当不确定集合为多面体时,可调整性为零的必要条件和充分条件--以替代定理的形式。基于这一尖锐的表征,我们提供了两个高效的混合整数优化公式来验证零可调性。然后,我们开发了一种构造性方法来量化不确定性集为一般时的可调整性,并由此产生了一种高效、严密的多时间算法来约束可调整性。我们通过理论和数值分析证明了这种算法的高效性和严密性。
{"title":"Adjustability in robust linear optimization","authors":"","doi":"10.1007/s10107-023-02049-w","DOIUrl":"https://doi.org/10.1007/s10107-023-02049-w","url":null,"abstract":"<h3>Abstract</h3> <p>We investigate the concept of adjustability—the difference in objective values between two types of dynamic robust optimization formulations: one where (static) decisions are made before uncertainty realization, and one where uncertainty is resolved before (adjustable) decisions. This difference reflects the value of information and decision timing in optimization under uncertainty, and is related to several other concepts such as the optimality of decision rules in robust optimization. We develop a theoretical framework to quantify adjustability based on the input data of a robust optimization problem with a linear objective, linear constraints, and fixed recourse. We make very few additional assumptions. In particular, we do not assume constraint-wise separability or parameter nonnegativity that are commonly imposed in the literature for the study of adjustability. This allows us to study important but previously under-investigated problems, such as formulations with equality constraints and problems with both upper and lower bound constraints. Based on the discovery of an interesting connection between the reformulations of the static and fully adjustable problems, our analysis gives a necessary and sufficient condition—in the form of a theorem-of-the-alternatives—for adjustability to be zero when the uncertainty set is polyhedral. Based on this sharp characterization, we provide two efficient mixed-integer optimization formulations to verify zero adjustability. Then, we develop a constructive approach to quantify adjustability when the uncertainty set is general, which results in an efficient and tight poly-time algorithm to bound adjustability. We demonstrate the efficiency and tightness via both theoretical and numerical analyses.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"32 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139583987","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Mathematical Programming
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1