Mathematical Programming最新文献

Advances on strictly Δ -modular IPs.

IF 2.2 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2025-01-01 Epub Date: 2024-10-30 DOI: 10.1007/s10107-024-02148-2

Martin Nägele, Christian Nöbel, Richard Santiago, Rico Zenklusen

There has been significant work recently on integer programs (IPs) $min {c^{⊤} x : A x \leq b, x \in Z^{n}}$ with a constraint marix A with bounded subdeterminants. This is motivated by a well-known conjecture claiming that, for any constant $Δ \in Z_{> 0}$ , $Δ$ -modular IPs are efficiently solvable, which are IPs where the constraint matrix $A \in Z^{m \times n}$ has full column rank and all $n \times n$ minors of A are within ${- Δ, \dots, Δ}$ . Previous progress on this question, in particular for $Δ = 2$ , relies on algorithms that solve an important special case, namely strictly $Δ$ -modular IPs, which further restrict the $n \times n$ minors of A to be within ${- Δ, 0, Δ}$ . Even for $Δ = 2$ , such problems include well-known combinatorial optimization problems like the minimum odd/even cut problem. The conjecture remains open even for strictly $Δ$ -modular IPs. Prior advances were restricted to prime $Δ$ , which allows for employing strong number-theoretic results. In this work, we make first progress beyond the prime case by presenting techniques not relying on such strong number-theoretic prime results. In particular, our approach implies that there is a randomized algorithm to check feasibility of strictly $Δ$ -modular IPs in strongly polynomial time if $Δ \leq 4$ .

{"title":"<ArticleTitle xmlns:ns0=\"http://www.w3.org/1998/Math/MathML\">Advances on strictly <ns0:math><ns0:mi>Δ</ns0:mi></ns0:math> -modular IPs.","authors":"Martin Nägele, Christian Nöbel, Richard Santiago, Rico Zenklusen","doi":"10.1007/s10107-024-02148-2","DOIUrl":"https://doi.org/10.1007/s10107-024-02148-2","url":null,"abstract":"There has been significant work recently on integer programs (IPs) <math><mrow><mo>min</mo> <mo>{</mo> <msup><mi>c</mi> <mi>⊤</mi></msup> <mi>x</mi> <mo>:</mo> <mi>A</mi> <mi>x</mi> <mo>≤</mo> <mi>b</mi> <mo>,</mo> <mspace></mspace> <mi>x</mi> <mo>∈</mo> <msup><mrow><mi>Z</mi></mrow> <mi>n</mi></msup> <mo>}</mo></mrow> </math> with a constraint marix A with bounded subdeterminants. This is motivated by a well-known conjecture claiming that, for any constant <math><mrow><mi>Δ</mi> <mo>∈</mo> <msub><mi>Z</mi> <mrow><mo>></mo> <mn>0</mn></mrow> </msub> </mrow> </math> , <math><mi>Δ</mi></math> -modular IPs are efficiently solvable, which are IPs where the constraint matrix <math><mrow><mi>A</mi> <mo>∈</mo> <msup><mrow><mi>Z</mi></mrow> <mrow><mi>m</mi> <mo>×</mo> <mi>n</mi></mrow> </msup> </mrow> </math> has full column rank and all <math><mrow><mi>n</mi> <mo>×</mo> <mi>n</mi></mrow> </math> minors of A are within <math><mrow><mo>{</mo> <mo>-</mo> <mi>Δ</mi> <mo>,</mo> <mo>⋯</mo> <mo>,</mo> <mi>Δ</mi> <mo>}</mo></mrow> </math> . Previous progress on this question, in particular for <math><mrow><mi>Δ</mi> <mo>=</mo> <mn>2</mn></mrow> </math> , relies on algorithms that solve an important special case, namely strictly <math><mi>Δ</mi></math> -modular IPs, which further restrict the <math><mrow><mi>n</mi> <mo>×</mo> <mi>n</mi></mrow> </math> minors of A to be within <math><mrow><mo>{</mo> <mo>-</mo> <mi>Δ</mi> <mo>,</mo> <mn>0</mn> <mo>,</mo> <mi>Δ</mi> <mo>}</mo></mrow> </math> . Even for <math><mrow><mi>Δ</mi> <mo>=</mo> <mn>2</mn></mrow> </math> , such problems include well-known combinatorial optimization problems like the minimum odd/even cut problem. The conjecture remains open even for strictly <math><mi>Δ</mi></math> -modular IPs. Prior advances were restricted to prime <math><mi>Δ</mi></math> , which allows for employing strong number-theoretic results. In this work, we make first progress beyond the prime case by presenting techniques not relying on such strong number-theoretic prime results. In particular, our approach implies that there is a randomized algorithm to check feasibility of strictly <math><mi>Δ</mi></math> -modular IPs in strongly polynomial time if <math><mrow><mi>Δ</mi> <mo>≤</mo> <mn>4</mn></mrow> </math> .","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"210 1-2","pages":"731-760"},"PeriodicalIF":2.2,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11870991/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143542512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Constant-competitiveness for random assignment Matroid secretary without knowing the Matroid.

IF 2.2 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2025-01-01 Epub Date: 2025-01-15 DOI: 10.1007/s10107-024-02177-x

Richard Santiago, Ivan Sergeev, Rico Zenklusen

The Matroid Secretary Conjecture is a notorious open problem in online optimization. It claims the existence of an O(1)-competitive algorithm for the Matroid Secretary Problem (MSP). Here, the elements of a weighted matroid appear one-by-one, revealing their weight at appearance, and the task is to select elements online with the goal to get an independent set of largest possible weight. O(1)-competitive MSP algorithms have so far only been obtained for restricted matroid classes and for MSP variations, including Random-Assignment MSP (RA-MSP), where an adversary fixes a number of weights equal to the ground set size of the matroid, which then get assigned randomly to the elements of the ground set. Unfortunately, these approaches heavily rely on knowing the full matroid upfront. This is an arguably undesirable requirement, and there are good reasons to believe that an approach towards resolving the MSP Conjecture should not rely on it. Thus, both Soto (SIAM Journal on Computing 42(1): 178-211, 2013.) and Oveis Gharan and Vondrák (Algorithmica 67(4): 472-497, 2013.) raised as an open question whether RA-MSP admits an O(1)-competitive algorithm even without knowing the matroid upfront. In this work, we answer this question affirmatively. Our result makes RA-MSP the first well-known MSP variant with an O(1)-competitive algorithm that does not need to know the underlying matroid upfront and without any restriction on the underlying matroid. Our approach is based on first approximately learning the rank-density curve of the matroid, which we then exploit algorithmically.

矩阵秘书猜想（Matroid Secretary Conjecture）是在线优化领域一个臭名昭著的开放性问题。该猜想声称存在针对矩阵秘书问题（MSP）的 O(1)-competitive 算法。在这里，一个加权矩阵的元素会一个接一个地出现，在出现时显示它们的权重，任务是在线选择元素，目标是得到一个权重最大的独立集合。O(1)-competitive MSP 算法迄今只适用于受限制的 matroid 类和 MSP 变体，包括随机分配 MSP (RA-MSP)，其中对手固定了与 matroid 地面集大小相等的权重数，然后将其随机分配给地面集的元素。遗憾的是，这些方法在很大程度上依赖于预先知道完整的 matroid。可以说，这是一个不可取的要求，我们有充分的理由相信，解决 MSP 猜想的方法不应依赖于此。因此，Soto (SIAM Journal on Computing 42(1)：178-211, 2013.) 以及 Oveis Gharan 和 Vondrák (Algorithmica 67(4)：472-497, 2013. ）提出了一个开放性问题：即使不预先知道矩阵，RA-MSP 是否也能实现 O(1)-competitive 算法。在这项工作中，我们肯定地回答了这个问题。我们的结果使 RA-MSP 成为第一个具有 O(1)-competitive 算法的著名 MSP 变体，它不需要预先知道底层 matroid，而且对底层 matroid 没有任何限制。我们的方法基于首先近似学习 Matroid 的秩密度曲线，然后在算法上加以利用。

{"title":"Constant-competitiveness for random assignment Matroid secretary without knowing the Matroid.","authors":"Richard Santiago, Ivan Sergeev, Rico Zenklusen","doi":"10.1007/s10107-024-02177-x","DOIUrl":"https://doi.org/10.1007/s10107-024-02177-x","url":null,"abstract":"The Matroid Secretary Conjecture is a notorious open problem in online optimization. It claims the existence of an O(1)-competitive algorithm for the Matroid Secretary Problem (MSP). Here, the elements of a weighted matroid appear one-by-one, revealing their weight at appearance, and the task is to select elements online with the goal to get an independent set of largest possible weight. O(1)-competitive MSP algorithms have so far only been obtained for restricted matroid classes and for MSP variations, including Random-Assignment MSP (RA-MSP), where an adversary fixes a number of weights equal to the ground set size of the matroid, which then get assigned randomly to the elements of the ground set. Unfortunately, these approaches heavily rely on knowing the full matroid upfront. This is an arguably undesirable requirement, and there are good reasons to believe that an approach towards resolving the MSP Conjecture should not rely on it. Thus, both Soto (SIAM Journal on Computing 42(1): 178-211, 2013.) and Oveis Gharan and Vondrák (Algorithmica 67(4): 472-497, 2013.) raised as an open question whether RA-MSP admits an O(1)-competitive algorithm even without knowing the matroid upfront. In this work, we answer this question affirmatively. Our result makes RA-MSP the first well-known MSP variant with an O(1)-competitive algorithm that does not need to know the underlying matroid upfront and without any restriction on the underlying matroid. Our approach is based on first approximately learning the rank-density curve of the matroid, which we then exploit algorithmically.","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"210 1-2","pages":"815-846"},"PeriodicalIF":2.2,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11870907/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143542520","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A nearly optimal randomized algorithm for explorable heap selection.

IF 2.2 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2025-01-01 Epub Date: 2024-11-05 DOI: 10.1007/s10107-024-02145-5

Sander Borst, Daniel Dadush, Sophie Huiberts, Danish Kashaev

Explorable heap selection is the problem of selecting the nth smallest value in a binary heap. The key values can only be accessed by traversing through the underlying infinite binary tree, and the complexity of the algorithm is measured by the total distance traveled in the tree (each edge has unit cost). This problem was originally proposed as a model to study search strategies for the branch-and-bound algorithm with storage restrictions by Karp, Saks and Widgerson (FOCS '86), who gave deterministic and randomized $n \cdot exp (O (\sqrt{log n}))$ time algorithms using $O (log {(n)}^{2.5})$ and $O (\sqrt{log n})$ space respectively. We present a new randomized algorithm with running time $O (n log {(n)}^{3})$ against an oblivious adversary using $O (log n)$ space, substantially improving the previous best randomized running time at the expense of slightly increased space usage. We also show an $Ω (log (n) n / log (log (n)))$ lower bound for any algorithm that solves the problem in the same amount of space, indicating that our algorithm is nearly optimal.

可探索堆选择是在二进制堆中选择第 n 个最小值的问题。关键值只能通过遍历底层的无限二叉树来获取，算法的复杂度由在树中所走的总距离来衡量（每条边都有单位成本）。这个问题最初是由 Karp、Saks 和 Widgerson（FOCS '86）作为研究有存储限制的分支与边界算法搜索策略的模型提出的，他们分别给出了使用 O ( log ( n ) 2.5 ) 和 O ( log n ) 空间的确定性和随机 n - exp ( O ( log n ) ) 时间算法。我们提出了一种新的随机算法，其针对遗忘对手的运行时间为 O ( n log ( n ) 3 )，使用空间为 O ( log n )，大大改进了之前的最佳随机运行时间，但使用空间略有增加。我们还展示了 Ω ( log ( n ) n / log ( log ( n ) )) 的下限，表明我们的算法接近最优。

{"title":"A nearly optimal randomized algorithm for explorable heap selection.","authors":"Sander Borst, Daniel Dadush, Sophie Huiberts, Danish Kashaev","doi":"10.1007/s10107-024-02145-5","DOIUrl":"https://doi.org/10.1007/s10107-024-02145-5","url":null,"abstract":"Explorable heap selection is the problem of selecting the nth smallest value in a binary heap. The key values can only be accessed by traversing through the underlying infinite binary tree, and the complexity of the algorithm is measured by the total distance traveled in the tree (each edge has unit cost). This problem was originally proposed as a model to study search strategies for the branch-and-bound algorithm with storage restrictions by Karp, Saks and Widgerson (FOCS '86), who gave deterministic and randomized <math><mrow><mi>n</mi> <mo>·</mo> <mo>exp</mo> <mo>(</mo> <mi>O</mi> <mrow><mo>(</mo> <msqrt><mrow><mo>log</mo> <mi>n</mi></mrow> </msqrt> <mo>)</mo></mrow> <mo>)</mo></mrow> </math> time algorithms using <math><mrow><mi>O</mi> <mo>(</mo> <mo>log</mo> <msup><mrow><mo>(</mo> <mi>n</mi> <mo>)</mo></mrow> <mrow><mn>2.5</mn></mrow> </msup> <mo>)</mo></mrow> </math> and <math><mrow><mi>O</mi> <mo>(</mo> <msqrt><mrow><mo>log</mo> <mi>n</mi></mrow> </msqrt> <mo>)</mo></mrow> </math> space respectively. We present a new randomized algorithm with running time <math><mrow><mi>O</mi> <mo>(</mo> <mi>n</mi> <mo>log</mo> <msup><mrow><mo>(</mo> <mi>n</mi> <mo>)</mo></mrow> <mn>3</mn></msup> <mo>)</mo></mrow> </math> against an oblivious adversary using <math><mrow><mi>O</mi> <mo>(</mo> <mo>log</mo> <mi>n</mi> <mo>)</mo></mrow> </math> space, substantially improving the previous best randomized running time at the expense of slightly increased space usage. We also show an <math><mrow><mi>Ω</mi> <mo>(</mo> <mo>log</mo> <mo>(</mo> <mi>n</mi> <mo>)</mo> <mi>n</mi> <mo>/</mo> <mo>log</mo> <mo>(</mo> <mo>log</mo> <mo>(</mo> <mi>n</mi> <mo>)</mo> <mo>)</mo> <mo>)</mo></mrow> </math> lower bound for any algorithm that solves the problem in the same amount of space, indicating that our algorithm is nearly optimal.","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"210 1-2","pages":"75-96"},"PeriodicalIF":2.2,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11870923/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143542518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Did smallpox cause stillbirths? Maternal smallpox infection, vaccination, and stillbirths in Sweden, 1780-1839. 天花会导致死胎吗？1780-1839年瑞典产妇天花感染、疫苗接种和死胎。

IF 1.5 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2024-11-01 Epub Date: 2023-02-22 DOI: 10.1080/00324728.2023.2174266

Eric B Schneider, Sören Edvinsson, Kota Ogasawara

While there is strong evidence that maternal smallpox infection can cause foetal loss, it is not clear whether smallpox infections were a demographically important cause of stillbirths historically. In this paper, we use parish-level data from the Swedish Tabellverket data set for 1780-1839 to test the effect of smallpox on stillbirths quantitatively, analysing periods before and after the introduction of vaccination in 1802. We find that smallpox infection was not a major cause of stillbirths before 1820, because most women contracted smallpox as children and were therefore not susceptible during pregnancy. We do find a small, statistically significant effect of smallpox on stillbirths from 1820 to 1839, when waning immunity from vaccination put a greater share of pregnant women at risk of contracting smallpox. However, the reduced prevalence of smallpox in this period limited its impact on stillbirths. Thus, smallpox was not an important driver of historical stillbirth trends.

虽然有确凿证据表明母体感染天花会导致胎儿死亡，但尚不清楚历史上天花感染是否是死胎的重要原因。在本文中，我们利用瑞典 Tabellverket 数据集中 1780-1839 年的教区级数据，定量检验了天花对死胎的影响，分析了 1802 年引入疫苗接种之前和之后的时期。我们发现，在 1820 年之前，感染天花并不是死产的主要原因，因为大多数妇女在孩童时期就感染了天花，因此在怀孕期间不易感染。我们确实发现，在 1820 年至 1839 年期间，天花对死胎的影响很小，但在统计意义上却很显著，因为当时接种疫苗后免疫力下降，导致更多孕妇面临感染天花的风险。然而，这一时期天花流行率的降低限制了其对死胎的影响。因此，天花并不是历史上死胎趋势的重要驱动因素。

引用次数: 0

Fast convergence to non-isolated minima: four equivalent conditions for $${textrm{C}^{2}}$$ functions 非孤立极小值的快速收敛：$${textrm{C}^{2}}$函数的四个等价条件

IF 2.7 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2024-09-19 DOI: 10.1007/s10107-024-02136-6

Quentin Rebjock, Nicolas Boumal

Optimization algorithms can see their local convergence rates deteriorate when the Hessian at the optimum is singular. These singularities are inescapable when the optima are non-isolated. Yet, under the right circumstances, several algorithms preserve their favorable rates even when optima form a continuum (e.g., due to over-parameterization). This has been explained under various structural assumptions, including the Polyak–Łojasiewicz condition, Quadratic Growth and the Error Bound. We show that, for cost functions which are twice continuously differentiable ((textrm{C}^2)), those three (local) properties are equivalent. Moreover, we show they are equivalent to the Morse–Bott property, that is, local minima form differentiable submanifolds, and the Hessian of the cost function is positive definite along its normal directions. We leverage this insight to improve local convergence guarantees for safe-guarded Newton-type methods under any (hence all) of the above assumptions. First, for adaptive cubic regularization, we secure quadratic convergence even with inexact subproblem solvers. Second, for trust-region methods, we argue capture can fail with an exact subproblem solver, then proceed to show linear convergence with an inexact one (Cauchy steps).

当最优点的 Hessian 出现奇异值时，优化算法的局部收敛速度就会下降。当最优点不孤立时，这些奇异性是不可避免的。然而，在适当的情况下，有几种算法即使在最优点形成连续体时也能保持良好的收敛率（例如，由于过度参数化）。这在各种结构假设下都有解释，包括 Polyak-Łojasiewicz 条件、二次增长和误差约束。我们证明，对于两次连续可微（(textrm{C}^2)）的成本函数，这三个（局部）属性是等价的。此外，我们还证明了它们等同于莫尔斯-波特（Morse-Bott）性质，即局部极小值形成可微分的子曲面，成本函数的赫塞斯沿其法线方向是正定的。我们利用这一洞察力，改进了在上述任何（也包括所有）假设条件下的安全牛顿型方法的局部收敛保证。首先，对于自适应立方正则化，即使使用不精确的子问题求解器，我们也能确保二次收敛。其次，对于信任区域方法，我们认为使用精确子问题求解器可以捕获失败，然后继续证明使用非精确求解器（考奇步）可以线性收敛。

{"title":"Fast convergence to non-isolated minima: four equivalent conditions for $${textrm{C}^{2}}$$ functions","authors":"Quentin Rebjock, Nicolas Boumal","doi":"10.1007/s10107-024-02136-6","DOIUrl":"https://doi.org/10.1007/s10107-024-02136-6","url":null,"abstract":"Optimization algorithms can see their local convergence rates deteriorate when the Hessian at the optimum is singular. These singularities are inescapable when the optima are non-isolated. Yet, under the right circumstances, several algorithms preserve their favorable rates even when optima form a continuum (e.g., due to over-parameterization). This has been explained under various structural assumptions, including the Polyak–Łojasiewicz condition, Quadratic Growth and the Error Bound. We show that, for cost functions which are twice continuously differentiable ((textrm{C}^2)), those three (local) properties are equivalent. Moreover, we show they are equivalent to the Morse–Bott property, that is, local minima form differentiable submanifolds, and the Hessian of the cost function is positive definite along its normal directions. We leverage this insight to improve local convergence guarantees for safe-guarded Newton-type methods under any (hence all) of the above assumptions. First, for adaptive cubic regularization, we secure quadratic convergence even with inexact subproblem solvers. Second, for trust-region methods, we argue capture can fail with an exact subproblem solver, then proceed to show linear convergence with an inexact one (Cauchy steps).","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"29 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142258237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Complexity of chordal conversion for sparse semidefinite programs with small treewidth 小树宽稀疏半inite程序的和弦转换复杂性

IF 2.7 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2024-09-17 DOI: 10.1007/s10107-024-02137-5

Richard Y. Zhang

If a sparse semidefinite program (SDP), specified over (ntimes n) matrices and subject to m linear constraints, has an aggregate sparsity graph G with small treewidth, then chordal conversion will sometimes allow an interior-point method to solve the SDP in just (O(m+n)) time per-iteration, which is a significant speedup over the (varOmega (n^{3})) time per-iteration for a direct application of the interior-point method. Unfortunately, the speedup is not guaranteed by an O(1) treewidth in G that is independent of m and n, as a diagonal SDP would have treewidth zero but can still necessitate up to (varOmega (n^{3})) time per-iteration. Instead, we construct an extended aggregate sparsity graph (overline{G}supseteq G) by forcing each constraint matrix (A_{i}) to be its own clique in G. We prove that a small treewidth in (overline{G}) does indeed guarantee that chordal conversion will solve the SDP in (O(m+n)) time per-iteration, to (epsilon )-accuracy in at most (O(sqrt{m+n}log (1/epsilon ))) iterations. This sufficient condition covers many successful applications of chordal conversion, including the MAX-k-CUT relaxation, the Lovász theta problem, sensor network localization, polynomial optimization, and the AC optimal power flow relaxation, thus allowing theory to match practical experience.

如果一个在 (ntimes n) 矩阵上指定并受制于 m 个线性约束的稀疏半定式程序（SDP）有一个具有小树宽的总稀疏性图 G、那么和弦转换有时会让内点法在每次迭代时只需要 (O(m+n)) 时间就能求解 SDP，这比直接应用内点法每次迭代所需的(varOmega (n^{3})) 时间大大加快了速度。不幸的是，这种加速并不能通过 G 中与 m 和 n 无关的 O(1) 树状宽度来保证，因为对角 SDP 的树状宽度为零，但每次迭代仍然需要多达 (varOmega (n^{3})) 的时间。相反，我们通过强迫每个约束矩阵 (A_{i}) 成为 G 中自己的小块，来构建一个扩展的集合稀疏性图 (overline{G}supseteq G) 。我们证明，(overline{G})中的小树宽确实可以保证弦变换在每次迭代中以(O(m+n))时间求解 SDP，最多以(O(sqrt{m+n}log (1/epsilon ))) 次迭代达到(epsilon)精度。这个充分条件涵盖了和弦转换的许多成功应用，包括 MAX-k-CUT 松弛、Lovász theta 问题、传感器网络定位、多项式优化和交流最优功率流松弛，从而使理论与实践经验相匹配。

{"title":"Complexity of chordal conversion for sparse semidefinite programs with small treewidth","authors":"Richard Y. Zhang","doi":"10.1007/s10107-024-02137-5","DOIUrl":"https://doi.org/10.1007/s10107-024-02137-5","url":null,"abstract":"If a sparse semidefinite program (SDP), specified over (ntimes n) matrices and subject to m linear constraints, has an aggregate sparsity graph G with small treewidth, then chordal conversion will sometimes allow an interior-point method to solve the SDP in just (O(m+n)) time per-iteration, which is a significant speedup over the (varOmega (n^{3})) time per-iteration for a direct application of the interior-point method. Unfortunately, the speedup is not guaranteed by an O(1) treewidth in G that is independent of m and n, as a diagonal SDP would have treewidth zero but can still necessitate up to (varOmega (n^{3})) time per-iteration. Instead, we construct an extended aggregate sparsity graph (overline{G}supseteq G) by forcing each constraint matrix (A_{i}) to be its own clique in G. We prove that a small treewidth in (overline{G}) does indeed guarantee that chordal conversion will solve the SDP in (O(m+n)) time per-iteration, to (epsilon )-accuracy in at most (O(sqrt{m+n}log (1/epsilon ))) iterations. This sufficient condition covers many successful applications of chordal conversion, including the MAX-k-CUT relaxation, the Lovász theta problem, sensor network localization, polynomial optimization, and the AC optimal power flow relaxation, thus allowing theory to match practical experience.","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"15 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142258110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Recycling valid inequalities for robust combinatorial optimization with budgeted uncertainty 具有预算不确定性的稳健组合优化的循环有效不等式

IF 2.7 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2024-08-29 DOI: 10.1007/s10107-024-02135-7

Christina Büsing, Timo Gersing, Arie M. C. A. Koster

Robust combinatorial optimization with budgeted uncertainty is one of the most popular approaches for integrating uncertainty into optimization problems. The existence of a compact reformulation for (mixed-integer) linear programs and positive complexity results give the impression that these problems are relatively easy to solve. However, the practical performance of the reformulation is quite poor when solving robust integer problems, in particular due to its weak linear relaxation. To overcome this issue, we propose procedures to derive new classes of valid inequalities for robust combinatorial optimization problems. For this, we recycle valid inequalities of the underlying deterministic problem such that the additional variables from the robust formulation are incorporated. The valid inequalities to be recycled may either be readily available model constraints or actual cutting planes, where we can benefit from decades of research on valid inequalities for classical optimization problems. We first demonstrate the strength of the inequalities theoretically, by proving that recycling yields a facet-defining inequality in many cases, even if the original valid inequality was not facet-defining. Afterwards, we show in an extensive computational study that using recycled inequalities can lead to a significant improvement of the computation time when solving robust optimization problems.

预算不确定性的稳健组合优化是将不确定性纳入优化问题的最流行方法之一。线性（混合整数）程序的紧凑重构和正复杂性结果给人的印象是，这些问题相对容易解决。然而，在求解鲁棒整数问题时，重整计算的实际性能却很差，特别是由于其线性松弛较弱。为了克服这个问题，我们提出了为鲁棒组合优化问题推导新的有效不等式类别的程序。为此，我们回收了基础确定性问题的有效不等式，以便将稳健公式中的额外变量纳入其中。需要回收的有效不等式可以是现成的模型约束，也可以是实际的切割平面，我们可以从几十年来对经典优化问题有效不等式的研究中获益。我们首先从理论上证明了不等式的优势，证明在很多情况下，即使原始有效不等式不是面定义的，循环也能得到面定义不等式。随后，我们通过大量的计算研究表明，在求解鲁棒优化问题时，使用循环不等式可以显著缩短计算时间。

{"title":"Recycling valid inequalities for robust combinatorial optimization with budgeted uncertainty","authors":"Christina Büsing, Timo Gersing, Arie M. C. A. Koster","doi":"10.1007/s10107-024-02135-7","DOIUrl":"https://doi.org/10.1007/s10107-024-02135-7","url":null,"abstract":"Robust combinatorial optimization with budgeted uncertainty is one of the most popular approaches for integrating uncertainty into optimization problems. The existence of a compact reformulation for (mixed-integer) linear programs and positive complexity results give the impression that these problems are relatively easy to solve. However, the practical performance of the reformulation is quite poor when solving robust integer problems, in particular due to its weak linear relaxation. To overcome this issue, we propose procedures to derive new classes of valid inequalities for robust combinatorial optimization problems. For this, we recycle valid inequalities of the underlying deterministic problem such that the additional variables from the robust formulation are incorporated. The valid inequalities to be recycled may either be readily available model constraints or actual cutting planes, where we can benefit from decades of research on valid inequalities for classical optimization problems. We first demonstrate the strength of the inequalities theoretically, by proving that recycling yields a facet-defining inequality in many cases, even if the original valid inequality was not facet-defining. Afterwards, we show in an extensive computational study that using recycled inequalities can lead to a significant improvement of the computation time when solving robust optimization problems.","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"152 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142183155","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Accelerated stochastic approximation with state-dependent noise 带有状态相关噪声的加速随机逼近

IF 2.7 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2024-08-27 DOI: 10.1007/s10107-024-02138-4

Sasila Ilandarideva, Anatoli Juditsky, Guanghui Lan, Tianjiao Li

We consider a class of stochastic smooth convex optimization problems under rather general assumptions on the noise in the stochastic gradient observation. As opposed to the classical problem setting in which the variance of noise is assumed to be uniformly bounded, herein we assume that the variance of stochastic gradients is related to the “sub-optimality” of the approximate solutions delivered by the algorithm. Such problems naturally arise in a variety of applications, in particular, in the well-known generalized linear regression problem in statistics. However, to the best of our knowledge, none of the existing stochastic approximation algorithms for solving this class of problems attain optimality in terms of the dependence on accuracy, problem parameters, and mini-batch size. We discuss two non-Euclidean accelerated stochastic approximation routines—stochastic accelerated gradient descent (SAGD) and stochastic gradient extrapolation (SGE)—which carry a particular duality relationship. We show that both SAGD and SGE, under appropriate conditions, achieve the optimal convergence rate, attaining the optimal iteration and sample complexities simultaneously. However, corresponding assumptions for the SGE algorithm are more general; they allow, for instance, for efficient application of the SGE to statistical estimation problems under heavy tail noises and discontinuous score functions. We also discuss the application of the SGE to problems satisfying quadratic growth conditions, and show how it can be used to recover sparse solutions. Finally, we report on some simulation experiments to illustrate numerical performance of our proposed algorithms in high-dimensional settings.

我们考虑了一类随机平滑凸优化问题，并对随机梯度观测中的噪声作了相当宽泛的假设。在经典问题中，噪声方差被假定为均匀有界，而在这里，我们假定随机梯度的方差与算法提供的近似解的 "次优性 "有关。这种问题自然会在各种应用中出现，特别是在统计学中著名的广义线性回归问题中。然而，据我们所知，现有的解决这类问题的随机近似算法中，没有一种能在精度、问题参数和小批量规模的依赖性方面达到最优。我们讨论了两种非欧几里得加速随机逼近例程--随机加速梯度下降算法（SAGD）和随机梯度外推法（SGE），这两种算法具有特殊的对偶关系。我们的研究表明，在适当条件下，SAGD 和 SGE 都能达到最佳收敛速度，同时获得最佳迭代和样本复杂度。然而，SGE 算法的相应假设更为宽泛；例如，它们允许 SGE 有效地应用于重尾噪声和不连续得分函数下的统计估计问题。我们还讨论了 SGE 在满足二次增长条件的问题中的应用，并展示了它如何用于恢复稀疏解。最后，我们报告了一些模拟实验，以说明我们提出的算法在高维环境下的数值性能。

{"title":"Accelerated stochastic approximation with state-dependent noise","authors":"Sasila Ilandarideva, Anatoli Juditsky, Guanghui Lan, Tianjiao Li","doi":"10.1007/s10107-024-02138-4","DOIUrl":"https://doi.org/10.1007/s10107-024-02138-4","url":null,"abstract":"We consider a class of stochastic smooth convex optimization problems under rather general assumptions on the noise in the stochastic gradient observation. As opposed to the classical problem setting in which the variance of noise is assumed to be uniformly bounded, herein we assume that the variance of stochastic gradients is related to the “sub-optimality” of the approximate solutions delivered by the algorithm. Such problems naturally arise in a variety of applications, in particular, in the well-known generalized linear regression problem in statistics. However, to the best of our knowledge, none of the existing stochastic approximation algorithms for solving this class of problems attain optimality in terms of the dependence on accuracy, problem parameters, and mini-batch size. We discuss two non-Euclidean accelerated stochastic approximation routines—stochastic accelerated gradient descent (SAGD) and stochastic gradient extrapolation (SGE)—which carry a particular duality relationship. We show that both SAGD and SGE, under appropriate conditions, achieve the optimal convergence rate, attaining the optimal iteration and sample complexities simultaneously. However, corresponding assumptions for the SGE algorithm are more general; they allow, for instance, for efficient application of the SGE to statistical estimation problems under heavy tail noises and discontinuous score functions. We also discuss the application of the SGE to problems satisfying quadratic growth conditions, and show how it can be used to recover sparse solutions. Finally, we report on some simulation experiments to illustrate numerical performance of our proposed algorithms in high-dimensional settings.","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"62 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142183165","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A fast combinatorial algorithm for the bilevel knapsack problem with interdiction constraints 带拦截约束的双层knapsack问题的快速组合算法

IF 2.7 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2024-08-22 DOI: 10.1007/s10107-024-02133-9

Noah Weninger, Ricardo Fukasawa

We consider the bilevel knapsack problem with interdiction constraints, a fundamental bilevel integer programming problem which generalizes the 0–1 knapsack problem. In this problem, there are two knapsacks and n items. The objective is to select some items to pack into the first knapsack such that the maximum profit attainable from packing some of the remaining items into the second knapsack is minimized. We present a combinatorial branch-and-bound algorithm which outperforms the current state-of-the-art solution method in computational experiments for 99% of the instances reported in the literature. On many of the harder instances, our algorithm is orders of magnitude faster, which enabled it to solve 53 of the 72 previously unsolved instances. Our result relies fundamentally on a new dynamic programming algorithm which computes very strong lower bounds. This dynamic program solves a relaxation of the problem from bilevel to 2n-level where the items are processed in an online fashion. The relaxation is easier to solve but approximates the original problem surprisingly well in practice. We believe that this same technique may be useful for other interdiction problems.

我们考虑的是带拦截约束的双层背包问题，这是一个基本的双层整数编程问题，是 0-1 背包问题的一般化。在这个问题中，有两个背包和 n 个物品。问题的目标是选择一些物品装入第一个背包，使得将剩余物品装入第二个背包所获得的最大利润最小。我们提出了一种分支与边界组合算法，在文献报道的 99% 的实例中，该算法在计算实验中的表现优于目前最先进的求解方法。在许多较难的实例中，我们的算法要快上几个数量级，这使得它能够解决 72 个以前未解决的实例中的 53 个。我们的成果从根本上依赖于一种新的动态编程算法，它能计算出非常强的下限。该动态程序将问题从双级放宽到 2n 级，在 2n 级中，项目以在线方式处理。这种松弛更容易求解，但在实践中却能出人意料地逼近原始问题。我们相信，同样的技术对其他拦截问题也很有用。

{"title":"A fast combinatorial algorithm for the bilevel knapsack problem with interdiction constraints","authors":"Noah Weninger, Ricardo Fukasawa","doi":"10.1007/s10107-024-02133-9","DOIUrl":"https://doi.org/10.1007/s10107-024-02133-9","url":null,"abstract":"We consider the bilevel knapsack problem with interdiction constraints, a fundamental bilevel integer programming problem which generalizes the 0–1 knapsack problem. In this problem, there are two knapsacks and n items. The objective is to select some items to pack into the first knapsack such that the maximum profit attainable from packing some of the remaining items into the second knapsack is minimized. We present a combinatorial branch-and-bound algorithm which outperforms the current state-of-the-art solution method in computational experiments for 99% of the instances reported in the literature. On many of the harder instances, our algorithm is orders of magnitude faster, which enabled it to solve 53 of the 72 previously unsolved instances. Our result relies fundamentally on a new dynamic programming algorithm which computes very strong lower bounds. This dynamic program solves a relaxation of the problem from bilevel to 2n-level where the items are processed in an online fashion. The relaxation is easier to solve but approximates the original problem surprisingly well in practice. We believe that this same technique may be useful for other interdiction problems.","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"2 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142183166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Nonlinear conjugate gradient methods: worst-case convergence rates via computer-assisted analyses 非线性共轭梯度方法：通过计算机辅助分析实现最坏情况收敛率

IF 2.7 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2024-08-22 DOI: 10.1007/s10107-024-02127-7

Shuvomoy Das Gupta, Robert M. Freund, Xu Andy Sun, Adrien Taylor

We propose a computer-assisted approach to the analysis of the worst-case convergence of nonlinear conjugate gradient methods (NCGMs). Those methods are known for their generally good empirical performances for large-scale optimization, while having relatively incomplete analyses. Using our computer-assisted approach, we establish novel complexity bounds for the Polak-Ribière-Polyak (PRP) and the Fletcher-Reeves (FR) NCGMs for smooth strongly convex minimization. In particular, we construct mathematical proofs that establish the first non-asymptotic convergence bound for FR (which is historically the first developed NCGM), and a much improved non-asymptotic convergence bound for PRP. Additionally, we provide simple adversarial examples on which these methods do not perform better than gradient descent with exact line search, leaving very little room for improvements on the same class of problems.

我们提出了一种计算机辅助方法，用于分析非线性共轭梯度方法（NCGMs）的最坏收敛情况。众所周知，这些方法在大规模优化方面具有普遍良好的经验性能，但分析却相对不完整。利用我们的计算机辅助方法，我们为用于平滑强凸最小化的 Polak-Ribière-Polyak (PRP) 和 Fletcher-Reeves (FR) NCGMs 建立了新的复杂度边界。特别是，我们构建了数学证明，为 FR（历史上第一个开发的 NCGM）建立了第一个非渐近收敛约束，并为 PRP 建立了一个大大改进的非渐近收敛约束。此外，我们还提供了一些简单的对抗性示例，在这些示例中，这些方法的性能并不比使用精确线性搜索的梯度下降法更好，因此在同一类问题上的改进空间很小。

引用次数: 0