Pub Date : 2024-02-26DOI: 10.1007/s10107-024-02061-8
Manu Upadhyaya, Sebastian Banert, Adrien B. Taylor, Pontus Giselsson
We present a methodology for establishing the existence of quadratic Lyapunov inequalities for a wide range of first-order methods used to solve convex optimization problems. In particular, we consider (i) classes of optimization problems of finite-sum form with (possibly strongly) convex and possibly smooth functional components, (ii) first-order methods that can be written as a linear system on state-space form in feedback interconnection with the subdifferentials of the functional components of the objective function, and (iii) quadratic Lyapunov inequalities that can be used to draw convergence conclusions. We present a necessary and sufficient condition for the existence of a quadratic Lyapunov inequality within a predefined class of Lyapunov inequalities, which amounts to solving a small-sized semidefinite program. We showcase our methodology on several first-order methods that fit the framework. Most notably, our methodology allows us to significantly extend the region of parameter choices that allow for duality gap convergence in the Chambolle–Pock method when the linear operator is the identity mapping.
{"title":"Automated tight Lyapunov analysis for first-order methods","authors":"Manu Upadhyaya, Sebastian Banert, Adrien B. Taylor, Pontus Giselsson","doi":"10.1007/s10107-024-02061-8","DOIUrl":"https://doi.org/10.1007/s10107-024-02061-8","url":null,"abstract":"<p>We present a methodology for establishing the existence of quadratic Lyapunov inequalities for a wide range of first-order methods used to solve convex optimization problems. In particular, we consider (i) classes of optimization problems of finite-sum form with (possibly strongly) convex and possibly smooth functional components, (ii) first-order methods that can be written as a linear system on state-space form in feedback interconnection with the subdifferentials of the functional components of the objective function, and (iii) quadratic Lyapunov inequalities that can be used to draw convergence conclusions. We present a necessary and sufficient condition for the existence of a quadratic Lyapunov inequality within a predefined class of Lyapunov inequalities, which amounts to solving a small-sized semidefinite program. We showcase our methodology on several first-order methods that fit the framework. Most notably, our methodology allows us to significantly extend the region of parameter choices that allow for duality gap convergence in the Chambolle–Pock method when the linear operator is the identity mapping.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"17 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140885735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-02-19DOI: 10.1007/s10107-024-02059-2
Liding Xu, Leo Liberti
We study a mixed-integer set (mathcal {S}:={(x,t) in {0,1}^n times mathbb {R}: f(x) ge t}) arising in the submodular maximization problem, where f is a submodular function defined over ({0,1}^n). We use intersection cuts to tighten a polyhedral outer approximation of (mathcal {S}). We construct a continuous extension (bar{textsf{F}}_f) of f, which is convex and defined over the entire space (mathbb {R}^n). We show that the epigraph ({{,textrm{epi},}}(bar{textsf{F}}_f)) of (bar{textsf{F}}_f) is an (mathcal {S})-free set, and characterize maximal (mathcal {S})-free sets containing ({{,textrm{epi},}}(bar{textsf{F}}_f)). We propose a hybrid discrete Newton algorithm to compute an intersection cut efficiently and exactly. Our results are generalized to the hypograph or the superlevel set of a submodular-supermodular function over the Boolean hypercube, which is a model for discrete nonconvexity. A consequence of these results is intersection cuts for Boolean multilinear constraints. We evaluate our techniques on max cut, pseudo Boolean maximization, and Bayesian D-optimal design problems within a MIP solver.
我们研究了子模最大化问题中出现的混合整数集合(mathcal {S}:={(x,t) in {0,1}^n times mathbb {R}: f(x) ge t}) ,其中 f 是定义在 ({0,1}^n) 上的子模函数。我们使用交割来收紧 (mathcal {S}) 的多面体外近似。我们构建了 f 的连续扩展 (bar{textsf{F}}_f),它是凸的,并且定义在整个空间 (mathbb {R}^n) 上。我们证明了(bar{textsf{F}}_f)的外延({{textrm{epi},}}(bar{textsf{F}}_f)是一个(mathcal {S})-free集合、并描述包含 ({{textrm{epi},}}(bar{textsf{F}}_f))的最大 (mathcal {S})-无集合的特征。我们提出了一种混合离散牛顿算法来高效精确地计算交集切分。我们的结果被推广到布尔超立方体上的亚模态-超模态函数的超图或超级集,这是离散非凸模型。这些结果的一个结果就是布尔多线性约束的交割。我们在 MIP 求解器中对最大切割、伪布尔最大化和贝叶斯 D 优化设计问题评估了我们的技术。
{"title":"Submodular maximization and its generalization through an intersection cut lens","authors":"Liding Xu, Leo Liberti","doi":"10.1007/s10107-024-02059-2","DOIUrl":"https://doi.org/10.1007/s10107-024-02059-2","url":null,"abstract":"<p>We study a mixed-integer set <span>(mathcal {S}:={(x,t) in {0,1}^n times mathbb {R}: f(x) ge t})</span> arising in the submodular maximization problem, where <i>f</i> is a submodular function defined over <span>({0,1}^n)</span>. We use intersection cuts to tighten a polyhedral outer approximation of <span>(mathcal {S})</span>. We construct a continuous extension <span>(bar{textsf{F}}_f)</span> of <i>f</i>, which is convex and defined over the entire space <span>(mathbb {R}^n)</span>. We show that the epigraph <span>({{,textrm{epi},}}(bar{textsf{F}}_f))</span> of <span>(bar{textsf{F}}_f)</span> is an <span>(mathcal {S})</span>-free set, and characterize maximal <span>(mathcal {S})</span>-free sets containing <span>({{,textrm{epi},}}(bar{textsf{F}}_f))</span>. We propose a hybrid discrete Newton algorithm to compute an intersection cut efficiently and exactly. Our results are generalized to the hypograph or the superlevel set of a submodular-supermodular function over the Boolean hypercube, which is a model for discrete nonconvexity. A consequence of these results is intersection cuts for Boolean multilinear constraints. We evaluate our techniques on max cut, pseudo Boolean maximization, and Bayesian D-optimal design problems within a MIP solver.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"111 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140886471","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-02-19DOI: 10.1007/s10107-023-02052-1
Christoph Hunkenschröder
We show that the problem of deciding whether a given Euclidean lattice L has an orthonormal basis is in NP and co-NP. Since this is equivalent to saying that L is isomorphic to the standard integer lattice, this problem is a special form of the lattice isomorphism problem, which is known to be in the complexity class SZK. We achieve this by deploying a result on characteristic vectors by Elkies that gained attention in the context of 4-manifolds and Seiberg-Witten equations, but seems rather unnoticed in the algorithmic lattice community. On the way, we also show that for a given Gram matrix (G in mathbb {Q}^{n times n}), we can efficiently find a rational lattice that is embedded in at most four times the initial dimension n, i.e. a rational matrix (B in mathbb {Q}^{4n times n}) such that (B^intercal B = G).
我们证明,判断给定欧几里得网格 L 是否具有正交基础的问题属于 NP 和 co-NP。由于这等同于说 L 与标准整数格同构,因此这个问题是格同构问题的一种特殊形式,而格同构问题已知属于复杂度类 SZK。我们利用埃尔基斯(Elkies)关于特征向量的一个结果来实现这个问题,这个结果在 4-芒形和塞伯格-维滕方程的背景下获得了关注,但在算法晶格领域却似乎没有引起注意。在此过程中,我们还证明了对于一个给定的克矩阵(G in mathbb {Q}^{ntimes n}),我们可以高效地找到一个嵌入在最多四倍初始维度 n 中的有理网格,即一个有理矩阵(B in mathbb {Q}^{4ntimes n}),使得 (B^intercal B = G).
{"title":"Deciding whether a lattice has an orthonormal basis is in co-NP","authors":"Christoph Hunkenschröder","doi":"10.1007/s10107-023-02052-1","DOIUrl":"https://doi.org/10.1007/s10107-023-02052-1","url":null,"abstract":"<p>We show that the problem of deciding whether a given Euclidean lattice <i>L</i> has an orthonormal basis is in NP and co-NP. Since this is equivalent to saying that <i>L</i> is isomorphic to the standard integer lattice, this problem is a special form of the lattice isomorphism problem, which is known to be in the complexity class SZK. We achieve this by deploying a result on <i>characteristic vectors</i> by Elkies that gained attention in the context of 4-manifolds and Seiberg-Witten equations, but seems rather unnoticed in the algorithmic lattice community. On the way, we also show that for a given Gram matrix <span>(G in mathbb {Q}^{n times n})</span>, we can efficiently find a rational lattice that is embedded in at most four times the initial dimension <i>n</i>, i.e. a rational matrix <span>(B in mathbb {Q}^{4n times n})</span> such that <span>(B^intercal B = G)</span>.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"2013 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140885737","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-02-16DOI: 10.1007/s10107-024-02060-9
Gennadiy Averkov, Claus Scheiderer
Consider the closed convex hull K of a monomial curve given parametrically as ((t^{m_1},ldots ,t^{m_n})), with the parameter t varying in an interval I. We show, using constructive arguments, that K admits a lifted semidefinite description by (mathcal {O}(d)) linear matrix inequalities (LMIs), each of size (leftlfloor frac{n}{2} rightrfloor +1), where (d= max {m_1,ldots ,m_n}) is the degree of the curve. On the dual side, we show that if a univariate polynomial p(t) of degree d with at most (2k+1) monomials is non-negative on ({mathbb {R}}_+), then p admits a representation (p = t^0 sigma _0 + cdots + t^{d-k} sigma _{d-k}), where the polynomials (sigma _0,ldots ,sigma _{d-k}) are sums of squares and (deg (sigma _i) le 2k). The latter is a univariate positivstellensatz for sparse polynomials, with non-negativity of p being certified by sos polynomials whose degree only depends on the sparsity of p. Our results fit into the general attempt of formulating polynomial optimization problems as semidefinite problems with LMIs of small size. Such small-size descriptions are much more tractable from a computational viewpoint.
考虑参数为 ((t^{m_1},ldots ,t^{m_n}))的单项式曲线的闭凸壳 K,参数 t 在区间 I 中变化。我们利用构造论证证明,K 可以通过线性矩阵不等式(LMIs)进行提升半定量描述,每个线性矩阵不等式的大小为 (leftlfloor frac{n}{2} rightrfloor +1) ,其中 (d= max {m_1,ldots ,m_n/}/)是曲线的阶数。在对偶方面,我们证明了如果阶数为 d 的单变量多项式 p(t) 在 ({mathbb {R}}_+) 上是非负的,且其单项式最多有(2k+1) 个、then p admits a representation (p = t^0 sigma _0 + cdots + t^{d-k} sigma _{d-k}), where the polynomials (sigma _0,ldots ,sigma _{d-k}) are sums of squares and (deg (sigma _i) le 2k).后者是稀疏多项式的单变量正弦定理,p 的非负性由 sos 多项式证明,而 sos 多项式的度数只取决于 p 的稀疏性。我们的结果符合将多项式优化问题表述为具有小尺寸 LMI 的半有限问题的一般尝试。从计算的角度来看,这种小规模的描述要容易得多。
{"title":"Convex hulls of monomial curves, and a sparse positivstellensatz","authors":"Gennadiy Averkov, Claus Scheiderer","doi":"10.1007/s10107-024-02060-9","DOIUrl":"https://doi.org/10.1007/s10107-024-02060-9","url":null,"abstract":"<p>Consider the closed convex hull <i>K</i> of a monomial curve given parametrically as <span>((t^{m_1},ldots ,t^{m_n}))</span>, with the parameter <i>t</i> varying in an interval <i>I</i>. We show, using constructive arguments, that <i>K</i> admits a lifted semidefinite description by <span>(mathcal {O}(d))</span> linear matrix inequalities (LMIs), each of size <span>(leftlfloor frac{n}{2} rightrfloor +1)</span>, where <span>(d= max {m_1,ldots ,m_n})</span> is the degree of the curve. On the dual side, we show that if a univariate polynomial <i>p</i>(<i>t</i>) of degree <i>d</i> with at most <span>(2k+1)</span> monomials is non-negative on <span>({mathbb {R}}_+)</span>, then <i>p</i> admits a representation <span>(p = t^0 sigma _0 + cdots + t^{d-k} sigma _{d-k})</span>, where the polynomials <span>(sigma _0,ldots ,sigma _{d-k})</span> are sums of squares and <span>(deg (sigma _i) le 2k)</span>. The latter is a univariate positivstellensatz for sparse polynomials, with non-negativity of <i>p</i> being certified by sos polynomials whose degree only depends on the sparsity of <i>p</i>. Our results fit into the general attempt of formulating polynomial optimization problems as semidefinite problems with LMIs of small size. Such small-size descriptions are much more tractable from a computational viewpoint.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"10 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140886077","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-02-12DOI: 10.1007/s10107-023-02051-2
Michelangelo Bin, Ivano Notarnicola, Thomas Parisini
We consider the discrete-time Arrow-Hurwicz-Uzawa primal-dual algorithm, also known as the first-order Lagrangian method, for constrained optimization problems involving a smooth strongly convex cost and smooth convex constraints. We prove that, for every given compact set of initial conditions, there always exists a sufficiently small stepsize guaranteeing exponential stability of the optimal primal-dual solution of the problem with a domain of attraction including the initialization set. Inspired by the analysis of nonlinear oscillators, the stability proof is based on a non-quadratic Lyapunov function including a nonlinear cross term.
{"title":"Semiglobal exponential stability of the discrete-time Arrow-Hurwicz-Uzawa primal-dual algorithm for constrained optimization","authors":"Michelangelo Bin, Ivano Notarnicola, Thomas Parisini","doi":"10.1007/s10107-023-02051-2","DOIUrl":"https://doi.org/10.1007/s10107-023-02051-2","url":null,"abstract":"<p>We consider the discrete-time Arrow-Hurwicz-Uzawa primal-dual algorithm, also known as the first-order Lagrangian method, for constrained optimization problems involving a smooth strongly convex cost and smooth convex constraints. We prove that, for every given compact set of initial conditions, there always exists a sufficiently small stepsize guaranteeing exponential stability of the optimal primal-dual solution of the problem with a domain of attraction including the initialization set. Inspired by the analysis of nonlinear oscillators, the stability proof is based on a non-quadratic Lyapunov function including a nonlinear cross term.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"12 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139760876","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-02-05DOI: 10.1007/s10107-023-02032-5
Brendan Pass, Adolfo Vargas-Jiménez
We establish a general condition on the cost function to obtain uniqueness and Monge solutions in the multi-marginal optimal transport problem, under the assumption that a given collection of the marginals are absolutely continuous with respect to local coordinates. When only the first marginal is assumed to be absolutely continuous, our condition is equivalent to the twist on splitting sets condition found in Kim and Pass (SIAM J Math Anal 46:1538–1550, 2014; SIAM J Math Anal 46:1538–1550, 2014). In addition, it is satisfied by the special cost functions in our earlier work (Pass and Vargas-Jiménez in SIAM J Math Anal 53:4386–4400, 2021; Monge solutions and uniqueness in multi-marginal optimal transport via graph theory. arXiv:2104.09488, 2021), when absolute continuity is imposed on certain other collections of marginals. We also present several new examples of cost functions which violate the twist on splitting sets condition but satisfy the new condition introduced here, including a class of examples arising in robust risk management problems; we therefore obtain Monge solution and uniqueness results for these cost functions, under regularity conditions on an appropriate subset of the marginals.
我们建立了一个关于成本函数的一般条件,以便在多边际最优运输问题中获得唯一性和蒙日解,前提是给定的边际集合相对于局部坐标是绝对连续的。当只假设第一个边际绝对连续时,我们的条件等同于 Kim 和 Pass(SIAM J Math Anal 46:1538-1550, 2014;SIAM J Math Anal 46:1538-1550, 2014)中发现的关于分裂集的扭曲条件。此外,当对某些其他边际集合施加绝对连续性时,我们早期工作(Pass 和 Vargas-Jiménez in SIAM J Math Anal 53:4386-4400, 2021; Monge solutions and uniqueness in multi-marginal optimal transport via graph theory.我们还提出了几个成本函数的新例子,它们违反了分裂集上的扭曲条件,但满足了这里引入的新条件,其中包括稳健风险管理问题中出现的一类例子;因此,在边际的适当子集上的正则性条件下,我们得到了这些成本函数的 Monge 解和唯一性结果。
{"title":"A general framework for multi-marginal optimal transport","authors":"Brendan Pass, Adolfo Vargas-Jiménez","doi":"10.1007/s10107-023-02032-5","DOIUrl":"https://doi.org/10.1007/s10107-023-02032-5","url":null,"abstract":"<p>We establish a general condition on the cost function to obtain uniqueness and Monge solutions in the multi-marginal optimal transport problem, under the assumption that a given collection of the marginals are absolutely continuous with respect to local coordinates. When only the first marginal is assumed to be absolutely continuous, our condition is equivalent to the twist on splitting sets condition found in Kim and Pass (SIAM J Math Anal 46:1538–1550, 2014; SIAM J Math Anal 46:1538–1550, 2014). In addition, it is satisfied by the special cost functions in our earlier work (Pass and Vargas-Jiménez in SIAM J Math Anal 53:4386–4400, 2021; Monge solutions and uniqueness in multi-marginal optimal transport via graph theory. arXiv:2104.09488, 2021), when absolute continuity is imposed on certain other collections of marginals. We also present several new examples of cost functions which violate the twist on splitting sets condition but satisfy the new condition introduced here, including a class of examples arising in robust risk management problems; we therefore obtain Monge solution and uniqueness results for these cost functions, under regularity conditions on an appropriate subset of the marginals.\u0000</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"9 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139760932","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-02-03DOI: 10.1007/s10107-023-02055-y
Abstract
The Frank–Wolfe (FW) method, which implements efficient linear oracles that minimize linear approximations of the objective function over a fixed compact convex set, has recently received much attention in the optimization and machine learning literature. In this paper, we propose a new FW-type method for minimizing a smooth function over a compact set defined as the level set of a single difference-of-convex function, based on new generalized linear-optimization oracles (LO). We show that these LOs can be computed efficiently with closed-form solutions in some important optimization models that arise in compressed sensing and machine learning. In addition, under a mild strict feasibility condition, we establish the subsequential convergence of our nonconvex FW-type method. Since the feasible region of our generalized LO typically changes from iteration to iteration, our convergence analysis is completely different from those existing works in the literature on FW-type methods that deal with fixed feasible regions among subproblems. Finally, motivated by the away steps for accelerating FW-type methods for convex problems, we further design an away-step oracle to supplement our nonconvex FW-type method, and establish subsequential convergence of this variant. Numerical results on the matrix completion problem with standard datasets are presented to demonstrate the efficiency of the proposed FW-type method and its away-step variant.
{"title":"Frank–Wolfe-type methods for a class of nonconvex inequality-constrained problems","authors":"","doi":"10.1007/s10107-023-02055-y","DOIUrl":"https://doi.org/10.1007/s10107-023-02055-y","url":null,"abstract":"<h3>Abstract</h3> <p>The Frank–Wolfe (FW) method, which implements efficient linear oracles that minimize linear approximations of the objective function over a <em>fixed</em> compact convex set, has recently received much attention in the optimization and machine learning literature. In this paper, we propose a new FW-type method for minimizing a smooth function over a compact set defined as the level set of a single <em>difference-of-convex</em> function, based on new <em>generalized</em> linear-optimization oracles (LO). We show that these LOs can be computed efficiently with <em>closed-form solutions</em> in some important optimization models that arise in compressed sensing and machine learning. In addition, under a mild strict feasibility condition, we establish the subsequential convergence of our nonconvex FW-type method. Since the feasible region of our generalized LO typically changes from iteration to iteration, our convergence analysis is <em>completely different</em> from those existing works in the literature on FW-type methods that deal with <em>fixed</em> feasible regions among subproblems. Finally, motivated by the away steps for accelerating FW-type methods for convex problems, we further design an <em>away-step oracle</em> to supplement our nonconvex FW-type method, and establish subsequential convergence of this variant. Numerical results on the matrix completion problem with standard datasets are presented to demonstrate the efficiency of the proposed FW-type method and its away-step variant. </p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"10 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139677806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-02-03DOI: 10.1007/s10107-023-02053-0
Simon Thomä, Grit Walther, Maximilian Schiffer
We study piecewise affine policies for multi-stage adjustable robust optimization (ARO) problems with non-negative right-hand side uncertainty. First, we construct new dominating uncertainty sets and show how a multi-stage ARO problem can be solved efficiently with a linear program when uncertainty is replaced by these new sets. We then demonstrate how solutions for this alternative problem can be transformed into solutions for the original problem. By carefully choosing the dominating sets, we prove strong approximation bounds for our policies and extend many previously best-known bounds for the two-staged problem variant to its multi-stage counterpart. Moreover, the new bounds are—to the best of our knowledge—the first bounds shown for the general multi-stage ARO problem considered. We extensively compare our policies to other policies from the literature and prove relative performance guarantees. In two numerical experiments, we identify beneficial and disadvantageous properties for different policies and present effective adjustments to tackle the most critical disadvantages of our policies. Overall, the experiments show that our piecewise affine policies can be computed by orders of magnitude faster than affine policies, while often yielding comparable or even better results.
我们研究了具有非负右侧不确定性的多阶段可调鲁棒优化(ARO)问题的片断仿射策略。首先,我们构建了新的支配性不确定性集,并展示了当不确定性被这些新的不确定性集取代时,如何用线性程序高效地解决多阶段 ARO 问题。然后,我们演示了如何将这一替代问题的解决方案转化为原始问题的解决方案。通过仔细选择支配集,我们证明了我们的策略具有很强的近似边界,并将两阶段问题变体的许多已知边界扩展到了多阶段问题变体。此外,据我们所知,新的界限是首次针对一般多阶段 ARO 问题给出的界限。我们将我们的策略与文献中的其他策略进行了广泛比较,并证明了相对性能保证。在两个数值实验中,我们确定了不同策略的优势和劣势,并提出了有效的调整措施,以解决我们策略中最关键的劣势。总之,实验表明,我们的片断仿射策略的计算速度比仿射策略快几个数量级,同时通常能获得相当甚至更好的结果。
{"title":"Designing tractable piecewise affine policies for multi-stage adjustable robust optimization","authors":"Simon Thomä, Grit Walther, Maximilian Schiffer","doi":"10.1007/s10107-023-02053-0","DOIUrl":"https://doi.org/10.1007/s10107-023-02053-0","url":null,"abstract":"<p>We study piecewise affine policies for multi-stage adjustable robust optimization (ARO) problems with non-negative right-hand side uncertainty. First, we construct new dominating uncertainty sets and show how a multi-stage ARO problem can be solved efficiently with a linear program when uncertainty is replaced by these new sets. We then demonstrate how solutions for this alternative problem can be transformed into solutions for the original problem. By carefully choosing the dominating sets, we prove strong approximation bounds for our policies and extend many previously best-known bounds for the two-staged problem variant to its multi-stage counterpart. Moreover, the new bounds are—to the best of our knowledge—the first bounds shown for the general multi-stage ARO problem considered. We extensively compare our policies to other policies from the literature and prove relative performance guarantees. In two numerical experiments, we identify beneficial and disadvantageous properties for different policies and present effective adjustments to tackle the most critical disadvantages of our policies. Overall, the experiments show that our piecewise affine policies can be computed by orders of magnitude faster than affine policies, while often yielding comparable or even better results.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"12 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139677417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In generalized malleable scheduling, jobs can be allocated and processed simultaneously on multiple machines so as to reduce the overall makespan of the schedule. The required processing time for each job is determined by the joint processing speed of the allocated machines. We study the case that processing speeds are job-dependent (M ^{natural })-concave functions and provide a constant-factor approximation for this setting, significantly expanding the realm of functions for which such an approximation is possible. Further, we explore the connection between malleable scheduling and the problem of fairly allocating items to a set of agents with distinct utility functions, devising a black-box reduction that allows to obtain resource-augmented approximation algorithms for the latter.
{"title":"A constant-factor approximation for generalized malleable scheduling under $$M ^{natural }$$ -concave processing speeds","authors":"Dimitris Fotakis, Jannik Matuschke, Orestis Papadigenopoulos","doi":"10.1007/s10107-023-02054-z","DOIUrl":"https://doi.org/10.1007/s10107-023-02054-z","url":null,"abstract":"<p>In generalized malleable scheduling, jobs can be allocated and processed simultaneously on multiple machines so as to reduce the overall makespan of the schedule. The required processing time for each job is determined by the joint processing speed of the allocated machines. We study the case that processing speeds are job-dependent <span>(M ^{natural })</span>-concave functions and provide a constant-factor approximation for this setting, significantly expanding the realm of functions for which such an approximation is possible. Further, we explore the connection between malleable scheduling and the problem of fairly allocating items to a set of agents with distinct utility functions, devising a black-box reduction that allows to obtain resource-augmented approximation algorithms for the latter.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"8 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139644974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-27DOI: 10.1007/s10107-023-02049-w
Abstract
We investigate the concept of adjustability—the difference in objective values between two types of dynamic robust optimization formulations: one where (static) decisions are made before uncertainty realization, and one where uncertainty is resolved before (adjustable) decisions. This difference reflects the value of information and decision timing in optimization under uncertainty, and is related to several other concepts such as the optimality of decision rules in robust optimization. We develop a theoretical framework to quantify adjustability based on the input data of a robust optimization problem with a linear objective, linear constraints, and fixed recourse. We make very few additional assumptions. In particular, we do not assume constraint-wise separability or parameter nonnegativity that are commonly imposed in the literature for the study of adjustability. This allows us to study important but previously under-investigated problems, such as formulations with equality constraints and problems with both upper and lower bound constraints. Based on the discovery of an interesting connection between the reformulations of the static and fully adjustable problems, our analysis gives a necessary and sufficient condition—in the form of a theorem-of-the-alternatives—for adjustability to be zero when the uncertainty set is polyhedral. Based on this sharp characterization, we provide two efficient mixed-integer optimization formulations to verify zero adjustability. Then, we develop a constructive approach to quantify adjustability when the uncertainty set is general, which results in an efficient and tight poly-time algorithm to bound adjustability. We demonstrate the efficiency and tightness via both theoretical and numerical analyses.
{"title":"Adjustability in robust linear optimization","authors":"","doi":"10.1007/s10107-023-02049-w","DOIUrl":"https://doi.org/10.1007/s10107-023-02049-w","url":null,"abstract":"<h3>Abstract</h3> <p>We investigate the concept of adjustability—the difference in objective values between two types of dynamic robust optimization formulations: one where (static) decisions are made before uncertainty realization, and one where uncertainty is resolved before (adjustable) decisions. This difference reflects the value of information and decision timing in optimization under uncertainty, and is related to several other concepts such as the optimality of decision rules in robust optimization. We develop a theoretical framework to quantify adjustability based on the input data of a robust optimization problem with a linear objective, linear constraints, and fixed recourse. We make very few additional assumptions. In particular, we do not assume constraint-wise separability or parameter nonnegativity that are commonly imposed in the literature for the study of adjustability. This allows us to study important but previously under-investigated problems, such as formulations with equality constraints and problems with both upper and lower bound constraints. Based on the discovery of an interesting connection between the reformulations of the static and fully adjustable problems, our analysis gives a necessary and sufficient condition—in the form of a theorem-of-the-alternatives—for adjustability to be zero when the uncertainty set is polyhedral. Based on this sharp characterization, we provide two efficient mixed-integer optimization formulations to verify zero adjustability. Then, we develop a constructive approach to quantify adjustability when the uncertainty set is general, which results in an efficient and tight poly-time algorithm to bound adjustability. We demonstrate the efficiency and tightness via both theoretical and numerical analyses.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"32 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139583987","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}