首页 > 最新文献

Mathematical Programming最新文献

英文 中文
Did smallpox cause stillbirths? Maternal smallpox infection, vaccination, and stillbirths in Sweden, 1780-1839. 天花会导致死胎吗?1780-1839年瑞典产妇天花感染、疫苗接种和死胎。
IF 1.5 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-11-01 Epub Date: 2023-02-22 DOI: 10.1080/00324728.2023.2174266
Eric B Schneider, Sören Edvinsson, Kota Ogasawara

While there is strong evidence that maternal smallpox infection can cause foetal loss, it is not clear whether smallpox infections were a demographically important cause of stillbirths historically. In this paper, we use parish-level data from the Swedish Tabellverket data set for 1780-1839 to test the effect of smallpox on stillbirths quantitatively, analysing periods before and after the introduction of vaccination in 1802. We find that smallpox infection was not a major cause of stillbirths before 1820, because most women contracted smallpox as children and were therefore not susceptible during pregnancy. We do find a small, statistically significant effect of smallpox on stillbirths from 1820 to 1839, when waning immunity from vaccination put a greater share of pregnant women at risk of contracting smallpox. However, the reduced prevalence of smallpox in this period limited its impact on stillbirths. Thus, smallpox was not an important driver of historical stillbirth trends.

虽然有确凿证据表明母体感染天花会导致胎儿死亡,但尚不清楚历史上天花感染是否是死胎的重要原因。在本文中,我们利用瑞典 Tabellverket 数据集中 1780-1839 年的教区级数据,定量检验了天花对死胎的影响,分析了 1802 年引入疫苗接种之前和之后的时期。我们发现,在 1820 年之前,感染天花并不是死产的主要原因,因为大多数妇女在孩童时期就感染了天花,因此在怀孕期间不易感染。我们确实发现,在 1820 年至 1839 年期间,天花对死胎的影响很小,但在统计意义上却很显著,因为当时接种疫苗后免疫力下降,导致更多孕妇面临感染天花的风险。然而,这一时期天花流行率的降低限制了其对死胎的影响。因此,天花并不是历史上死胎趋势的重要驱动因素。
{"title":"Did smallpox cause stillbirths? Maternal smallpox infection, vaccination, and stillbirths in Sweden, 1780-1839.","authors":"Eric B Schneider, Sören Edvinsson, Kota Ogasawara","doi":"10.1080/00324728.2023.2174266","DOIUrl":"10.1080/00324728.2023.2174266","url":null,"abstract":"<p><p>While there is strong evidence that maternal smallpox infection can cause foetal loss, it is not clear whether smallpox infections were a demographically important cause of stillbirths historically. In this paper, we use parish-level data from the Swedish Tabellverket data set for 1780-1839 to test the effect of smallpox on stillbirths quantitatively, analysing periods before and after the introduction of vaccination in 1802. We find that smallpox infection was not a major cause of stillbirths before 1820, because most women contracted smallpox as children and were therefore not susceptible during pregnancy. We do find a small, statistically significant effect of smallpox on stillbirths from 1820 to 1839, when waning immunity from vaccination put a greater share of pregnant women at risk of contracting smallpox. However, the reduced prevalence of smallpox in this period limited its impact on stillbirths. Thus, smallpox was not an important driver of historical stillbirth trends.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"15 1","pages":"467-482"},"PeriodicalIF":1.5,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73348221","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fast convergence to non-isolated minima: four equivalent conditions for $${textrm{C}^{2}}$$ functions 非孤立极小值的快速收敛:$${textrm{C}^{2}}$函数的四个等价条件
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-09-19 DOI: 10.1007/s10107-024-02136-6
Quentin Rebjock, Nicolas Boumal

Optimization algorithms can see their local convergence rates deteriorate when the Hessian at the optimum is singular. These singularities are inescapable when the optima are non-isolated. Yet, under the right circumstances, several algorithms preserve their favorable rates even when optima form a continuum (e.g., due to over-parameterization). This has been explained under various structural assumptions, including the Polyak–Łojasiewicz condition, Quadratic Growth and the Error Bound. We show that, for cost functions which are twice continuously differentiable ((textrm{C}^2)), those three (local) properties are equivalent. Moreover, we show they are equivalent to the Morse–Bott property, that is, local minima form differentiable submanifolds, and the Hessian of the cost function is positive definite along its normal directions. We leverage this insight to improve local convergence guarantees for safe-guarded Newton-type methods under any (hence all) of the above assumptions. First, for adaptive cubic regularization, we secure quadratic convergence even with inexact subproblem solvers. Second, for trust-region methods, we argue capture can fail with an exact subproblem solver, then proceed to show linear convergence with an inexact one (Cauchy steps).

当最优点的 Hessian 出现奇异值时,优化算法的局部收敛速度就会下降。当最优点不孤立时,这些奇异性是不可避免的。然而,在适当的情况下,有几种算法即使在最优点形成连续体时也能保持良好的收敛率(例如,由于过度参数化)。这在各种结构假设下都有解释,包括 Polyak-Łojasiewicz 条件、二次增长和误差约束。我们证明,对于两次连续可微((textrm{C}^2))的成本函数,这三个(局部)属性是等价的。此外,我们还证明了它们等同于莫尔斯-波特(Morse-Bott)性质,即局部极小值形成可微分的子曲面,成本函数的赫塞斯沿其法线方向是正定的。我们利用这一洞察力,改进了在上述任何(也包括所有)假设条件下的安全牛顿型方法的局部收敛保证。首先,对于自适应立方正则化,即使使用不精确的子问题求解器,我们也能确保二次收敛。其次,对于信任区域方法,我们认为使用精确子问题求解器可以捕获失败,然后继续证明使用非精确求解器(考奇步)可以线性收敛。
{"title":"Fast convergence to non-isolated minima: four equivalent conditions for $${textrm{C}^{2}}$$ functions","authors":"Quentin Rebjock, Nicolas Boumal","doi":"10.1007/s10107-024-02136-6","DOIUrl":"https://doi.org/10.1007/s10107-024-02136-6","url":null,"abstract":"<p>Optimization algorithms can see their local convergence rates deteriorate when the Hessian at the optimum is singular. These singularities are inescapable when the optima are non-isolated. Yet, under the right circumstances, several algorithms preserve their favorable rates even when optima form a continuum (e.g., due to over-parameterization). This has been explained under various structural assumptions, including the Polyak–Łojasiewicz condition, Quadratic Growth and the Error Bound. We show that, for cost functions which are twice continuously differentiable (<span>(textrm{C}^2)</span>), those three (local) properties are equivalent. Moreover, we show they are equivalent to the Morse–Bott property, that is, local minima form differentiable submanifolds, and the Hessian of the cost function is positive definite along its normal directions. We leverage this insight to improve local convergence guarantees for safe-guarded Newton-type methods under any (hence all) of the above assumptions. First, for adaptive cubic regularization, we secure quadratic convergence even with inexact subproblem solvers. Second, for trust-region methods, we argue capture can fail with an exact subproblem solver, then proceed to show linear convergence with an inexact one (Cauchy steps).</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"29 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142258237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Complexity of chordal conversion for sparse semidefinite programs with small treewidth 小树宽稀疏半inite程序的和弦转换复杂性
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-09-17 DOI: 10.1007/s10107-024-02137-5
Richard Y. Zhang

If a sparse semidefinite program (SDP), specified over (ntimes n) matrices and subject to m linear constraints, has an aggregate sparsity graph G with small treewidth, then chordal conversion will sometimes allow an interior-point method to solve the SDP in just (O(m+n)) time per-iteration, which is a significant speedup over the (varOmega (n^{3})) time per-iteration for a direct application of the interior-point method. Unfortunately, the speedup is not guaranteed by an O(1) treewidth in G that is independent of m and n, as a diagonal SDP would have treewidth zero but can still necessitate up to (varOmega (n^{3})) time per-iteration. Instead, we construct an extended aggregate sparsity graph (overline{G}supseteq G) by forcing each constraint matrix (A_{i}) to be its own clique in G. We prove that a small treewidth in (overline{G}) does indeed guarantee that chordal conversion will solve the SDP in (O(m+n)) time per-iteration, to (epsilon )-accuracy in at most (O(sqrt{m+n}log (1/epsilon ))) iterations. This sufficient condition covers many successful applications of chordal conversion, including the MAX-k-CUT relaxation, the Lovász theta problem, sensor network localization, polynomial optimization, and the AC optimal power flow relaxation, thus allowing theory to match practical experience.

如果一个在 (ntimes n) 矩阵上指定并受制于 m 个线性约束的稀疏半定式程序(SDP)有一个具有小树宽的总稀疏性图 G、那么和弦转换有时会让内点法在每次迭代时只需要 (O(m+n)) 时间就能求解 SDP,这比直接应用内点法每次迭代所需的(varOmega (n^{3})) 时间大大加快了速度。不幸的是,这种加速并不能通过 G 中与 m 和 n 无关的 O(1) 树状宽度来保证,因为对角 SDP 的树状宽度为零,但每次迭代仍然需要多达 (varOmega (n^{3})) 的时间。相反,我们通过强迫每个约束矩阵 (A_{i}) 成为 G 中自己的小块,来构建一个扩展的集合稀疏性图 (overline{G}supseteq G) 。我们证明,(overline{G})中的小树宽确实可以保证弦变换在每次迭代中以(O(m+n))时间求解 SDP,最多以(O(sqrt{m+n}log (1/epsilon ))) 次迭代达到(epsilon)精度。这个充分条件涵盖了和弦转换的许多成功应用,包括 MAX-k-CUT 松弛、Lovász theta 问题、传感器网络定位、多项式优化和交流最优功率流松弛,从而使理论与实践经验相匹配。
{"title":"Complexity of chordal conversion for sparse semidefinite programs with small treewidth","authors":"Richard Y. Zhang","doi":"10.1007/s10107-024-02137-5","DOIUrl":"https://doi.org/10.1007/s10107-024-02137-5","url":null,"abstract":"<p>If a sparse semidefinite program (SDP), specified over <span>(ntimes n)</span> matrices and subject to <i>m</i> linear constraints, has an aggregate sparsity graph <i>G</i> with small treewidth, then chordal conversion will sometimes allow an interior-point method to solve the SDP in just <span>(O(m+n))</span> time per-iteration, which is a significant speedup over the <span>(varOmega (n^{3}))</span> time per-iteration for a direct application of the interior-point method. Unfortunately, the speedup is not guaranteed by an <i>O</i>(1) treewidth in <i>G</i> that is independent of <i>m</i> and <i>n</i>, as a diagonal SDP would have treewidth zero but can still necessitate up to <span>(varOmega (n^{3}))</span> time per-iteration. Instead, we construct an extended aggregate sparsity graph <span>(overline{G}supseteq G)</span> by forcing each constraint matrix <span>(A_{i})</span> to be its own clique in <i>G</i>. We prove that a small treewidth in <span>(overline{G})</span> does indeed guarantee that chordal conversion will solve the SDP in <span>(O(m+n))</span> time per-iteration, to <span>(epsilon )</span>-accuracy in at most <span>(O(sqrt{m+n}log (1/epsilon )))</span> iterations. This sufficient condition covers many successful applications of chordal conversion, including the MAX-<i>k</i>-CUT relaxation, the Lovász theta problem, sensor network localization, polynomial optimization, and the AC optimal power flow relaxation, thus allowing theory to match practical experience.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"15 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142258110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Recycling valid inequalities for robust combinatorial optimization with budgeted uncertainty 具有预算不确定性的稳健组合优化的循环有效不等式
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-08-29 DOI: 10.1007/s10107-024-02135-7
Christina Büsing, Timo Gersing, Arie M. C. A. Koster

Robust combinatorial optimization with budgeted uncertainty is one of the most popular approaches for integrating uncertainty into optimization problems. The existence of a compact reformulation for (mixed-integer) linear programs and positive complexity results give the impression that these problems are relatively easy to solve. However, the practical performance of the reformulation is quite poor when solving robust integer problems, in particular due to its weak linear relaxation. To overcome this issue, we propose procedures to derive new classes of valid inequalities for robust combinatorial optimization problems. For this, we recycle valid inequalities of the underlying deterministic problem such that the additional variables from the robust formulation are incorporated. The valid inequalities to be recycled may either be readily available model constraints or actual cutting planes, where we can benefit from decades of research on valid inequalities for classical optimization problems. We first demonstrate the strength of the inequalities theoretically, by proving that recycling yields a facet-defining inequality in many cases, even if the original valid inequality was not facet-defining. Afterwards, we show in an extensive computational study that using recycled inequalities can lead to a significant improvement of the computation time when solving robust optimization problems.

预算不确定性的稳健组合优化是将不确定性纳入优化问题的最流行方法之一。线性(混合整数)程序的紧凑重构和正复杂性结果给人的印象是,这些问题相对容易解决。然而,在求解鲁棒整数问题时,重整计算的实际性能却很差,特别是由于其线性松弛较弱。为了克服这个问题,我们提出了为鲁棒组合优化问题推导新的有效不等式类别的程序。为此,我们回收了基础确定性问题的有效不等式,以便将稳健公式中的额外变量纳入其中。需要回收的有效不等式可以是现成的模型约束,也可以是实际的切割平面,我们可以从几十年来对经典优化问题有效不等式的研究中获益。我们首先从理论上证明了不等式的优势,证明在很多情况下,即使原始有效不等式不是面定义的,循环也能得到面定义不等式。随后,我们通过大量的计算研究表明,在求解鲁棒优化问题时,使用循环不等式可以显著缩短计算时间。
{"title":"Recycling valid inequalities for robust combinatorial optimization with budgeted uncertainty","authors":"Christina Büsing, Timo Gersing, Arie M. C. A. Koster","doi":"10.1007/s10107-024-02135-7","DOIUrl":"https://doi.org/10.1007/s10107-024-02135-7","url":null,"abstract":"<p>Robust combinatorial optimization with budgeted uncertainty is one of the most popular approaches for integrating uncertainty into optimization problems. The existence of a compact reformulation for (mixed-integer) linear programs and positive complexity results give the impression that these problems are relatively easy to solve. However, the practical performance of the reformulation is quite poor when solving robust integer problems, in particular due to its weak linear relaxation. To overcome this issue, we propose procedures to derive new classes of valid inequalities for robust combinatorial optimization problems. For this, we recycle valid inequalities of the underlying deterministic problem such that the additional variables from the robust formulation are incorporated. The valid inequalities to be recycled may either be readily available model constraints or actual cutting planes, where we can benefit from decades of research on valid inequalities for classical optimization problems. We first demonstrate the strength of the inequalities theoretically, by proving that recycling yields a facet-defining inequality in many cases, even if the original valid inequality was not facet-defining. Afterwards, we show in an extensive computational study that using recycled inequalities can lead to a significant improvement of the computation time when solving robust optimization problems.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"152 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142183155","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Accelerated stochastic approximation with state-dependent noise 带有状态相关噪声的加速随机逼近
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-08-27 DOI: 10.1007/s10107-024-02138-4
Sasila Ilandarideva, Anatoli Juditsky, Guanghui Lan, Tianjiao Li

We consider a class of stochastic smooth convex optimization problems under rather general assumptions on the noise in the stochastic gradient observation. As opposed to the classical problem setting in which the variance of noise is assumed to be uniformly bounded, herein we assume that the variance of stochastic gradients is related to the “sub-optimality” of the approximate solutions delivered by the algorithm. Such problems naturally arise in a variety of applications, in particular, in the well-known generalized linear regression problem in statistics. However, to the best of our knowledge, none of the existing stochastic approximation algorithms for solving this class of problems attain optimality in terms of the dependence on accuracy, problem parameters, and mini-batch size. We discuss two non-Euclidean accelerated stochastic approximation routines—stochastic accelerated gradient descent (SAGD) and stochastic gradient extrapolation (SGE)—which carry a particular duality relationship. We show that both SAGD and SGE, under appropriate conditions, achieve the optimal convergence rate, attaining the optimal iteration and sample complexities simultaneously. However, corresponding assumptions for the SGE algorithm are more general; they allow, for instance, for efficient application of the SGE to statistical estimation problems under heavy tail noises and discontinuous score functions. We also discuss the application of the SGE to problems satisfying quadratic growth conditions, and show how it can be used to recover sparse solutions. Finally, we report on some simulation experiments to illustrate numerical performance of our proposed algorithms in high-dimensional settings.

我们考虑了一类随机平滑凸优化问题,并对随机梯度观测中的噪声作了相当宽泛的假设。在经典问题中,噪声方差被假定为均匀有界,而在这里,我们假定随机梯度的方差与算法提供的近似解的 "次优性 "有关。这种问题自然会在各种应用中出现,特别是在统计学中著名的广义线性回归问题中。然而,据我们所知,现有的解决这类问题的随机近似算法中,没有一种能在精度、问题参数和小批量规模的依赖性方面达到最优。我们讨论了两种非欧几里得加速随机逼近例程--随机加速梯度下降算法(SAGD)和随机梯度外推法(SGE),这两种算法具有特殊的对偶关系。我们的研究表明,在适当条件下,SAGD 和 SGE 都能达到最佳收敛速度,同时获得最佳迭代和样本复杂度。然而,SGE 算法的相应假设更为宽泛;例如,它们允许 SGE 有效地应用于重尾噪声和不连续得分函数下的统计估计问题。我们还讨论了 SGE 在满足二次增长条件的问题中的应用,并展示了它如何用于恢复稀疏解。最后,我们报告了一些模拟实验,以说明我们提出的算法在高维环境下的数值性能。
{"title":"Accelerated stochastic approximation with state-dependent noise","authors":"Sasila Ilandarideva, Anatoli Juditsky, Guanghui Lan, Tianjiao Li","doi":"10.1007/s10107-024-02138-4","DOIUrl":"https://doi.org/10.1007/s10107-024-02138-4","url":null,"abstract":"<p>We consider a class of stochastic smooth convex optimization problems under rather general assumptions on the noise in the stochastic gradient observation. As opposed to the classical problem setting in which the variance of noise is assumed to be uniformly bounded, herein we assume that the variance of stochastic gradients is related to the “sub-optimality” of the approximate solutions delivered by the algorithm. Such problems naturally arise in a variety of applications, in particular, in the well-known generalized linear regression problem in statistics. However, to the best of our knowledge, none of the existing stochastic approximation algorithms for solving this class of problems attain optimality in terms of the dependence on accuracy, problem parameters, and mini-batch size. We discuss two non-Euclidean accelerated stochastic approximation routines—stochastic accelerated gradient descent (SAGD) and stochastic gradient extrapolation (SGE)—which carry a particular duality relationship. We show that both SAGD and SGE, under appropriate conditions, achieve the optimal convergence rate, attaining the optimal iteration and sample complexities simultaneously. However, corresponding assumptions for the SGE algorithm are more general; they allow, for instance, for efficient application of the SGE to statistical estimation problems under heavy tail noises and discontinuous score functions. We also discuss the application of the SGE to problems satisfying quadratic growth conditions, and show how it can be used to recover sparse solutions. Finally, we report on some simulation experiments to illustrate numerical performance of our proposed algorithms in high-dimensional settings.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"62 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142183165","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A fast combinatorial algorithm for the bilevel knapsack problem with interdiction constraints 带拦截约束的双层knapsack问题的快速组合算法
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-08-22 DOI: 10.1007/s10107-024-02133-9
Noah Weninger, Ricardo Fukasawa

We consider the bilevel knapsack problem with interdiction constraints, a fundamental bilevel integer programming problem which generalizes the 0–1 knapsack problem. In this problem, there are two knapsacks and n items. The objective is to select some items to pack into the first knapsack such that the maximum profit attainable from packing some of the remaining items into the second knapsack is minimized. We present a combinatorial branch-and-bound algorithm which outperforms the current state-of-the-art solution method in computational experiments for 99% of the instances reported in the literature. On many of the harder instances, our algorithm is orders of magnitude faster, which enabled it to solve 53 of the 72 previously unsolved instances. Our result relies fundamentally on a new dynamic programming algorithm which computes very strong lower bounds. This dynamic program solves a relaxation of the problem from bilevel to 2n-level where the items are processed in an online fashion. The relaxation is easier to solve but approximates the original problem surprisingly well in practice. We believe that this same technique may be useful for other interdiction problems.

我们考虑的是带拦截约束的双层背包问题,这是一个基本的双层整数编程问题,是 0-1 背包问题的一般化。在这个问题中,有两个背包和 n 个物品。问题的目标是选择一些物品装入第一个背包,使得将剩余物品装入第二个背包所获得的最大利润最小。我们提出了一种分支与边界组合算法,在文献报道的 99% 的实例中,该算法在计算实验中的表现优于目前最先进的求解方法。在许多较难的实例中,我们的算法要快上几个数量级,这使得它能够解决 72 个以前未解决的实例中的 53 个。我们的成果从根本上依赖于一种新的动态编程算法,它能计算出非常强的下限。该动态程序将问题从双级放宽到 2n 级,在 2n 级中,项目以在线方式处理。这种松弛更容易求解,但在实践中却能出人意料地逼近原始问题。我们相信,同样的技术对其他拦截问题也很有用。
{"title":"A fast combinatorial algorithm for the bilevel knapsack problem with interdiction constraints","authors":"Noah Weninger, Ricardo Fukasawa","doi":"10.1007/s10107-024-02133-9","DOIUrl":"https://doi.org/10.1007/s10107-024-02133-9","url":null,"abstract":"<p>We consider the bilevel knapsack problem with interdiction constraints, a fundamental bilevel integer programming problem which generalizes the 0–1 knapsack problem. In this problem, there are two knapsacks and <i>n</i> items. The objective is to select some items to pack into the first knapsack such that the maximum profit attainable from packing some of the remaining items into the second knapsack is minimized. We present a combinatorial branch-and-bound algorithm which outperforms the current state-of-the-art solution method in computational experiments for 99% of the instances reported in the literature. On many of the harder instances, our algorithm is orders of magnitude faster, which enabled it to solve 53 of the 72 previously unsolved instances. Our result relies fundamentally on a new dynamic programming algorithm which computes very strong lower bounds. This dynamic program solves a relaxation of the problem from bilevel to 2<i>n</i>-level where the items are processed in an online fashion. The relaxation is easier to solve but approximates the original problem surprisingly well in practice. We believe that this same technique may be useful for other interdiction problems.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"2 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142183166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nonlinear conjugate gradient methods: worst-case convergence rates via computer-assisted analyses 非线性共轭梯度方法:通过计算机辅助分析实现最坏情况收敛率
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-08-22 DOI: 10.1007/s10107-024-02127-7
Shuvomoy Das Gupta, Robert M. Freund, Xu Andy Sun, Adrien Taylor

We propose a computer-assisted approach to the analysis of the worst-case convergence of nonlinear conjugate gradient methods (NCGMs). Those methods are known for their generally good empirical performances for large-scale optimization, while having relatively incomplete analyses. Using our computer-assisted approach, we establish novel complexity bounds for the Polak-Ribière-Polyak (PRP) and the Fletcher-Reeves (FR) NCGMs for smooth strongly convex minimization. In particular, we construct mathematical proofs that establish the first non-asymptotic convergence bound for FR (which is historically the first developed NCGM), and a much improved non-asymptotic convergence bound for PRP. Additionally, we provide simple adversarial examples on which these methods do not perform better than gradient descent with exact line search, leaving very little room for improvements on the same class of problems.

我们提出了一种计算机辅助方法,用于分析非线性共轭梯度方法(NCGMs)的最坏收敛情况。众所周知,这些方法在大规模优化方面具有普遍良好的经验性能,但分析却相对不完整。利用我们的计算机辅助方法,我们为用于平滑强凸最小化的 Polak-Ribière-Polyak (PRP) 和 Fletcher-Reeves (FR) NCGMs 建立了新的复杂度边界。特别是,我们构建了数学证明,为 FR(历史上第一个开发的 NCGM)建立了第一个非渐近收敛约束,并为 PRP 建立了一个大大改进的非渐近收敛约束。此外,我们还提供了一些简单的对抗性示例,在这些示例中,这些方法的性能并不比使用精确线性搜索的梯度下降法更好,因此在同一类问题上的改进空间很小。
{"title":"Nonlinear conjugate gradient methods: worst-case convergence rates via computer-assisted analyses","authors":"Shuvomoy Das Gupta, Robert M. Freund, Xu Andy Sun, Adrien Taylor","doi":"10.1007/s10107-024-02127-7","DOIUrl":"https://doi.org/10.1007/s10107-024-02127-7","url":null,"abstract":"<p>We propose a computer-assisted approach to the analysis of the worst-case convergence of nonlinear conjugate gradient methods (NCGMs). Those methods are known for their generally good empirical performances for large-scale optimization, while having relatively incomplete analyses. Using our computer-assisted approach, we establish novel complexity bounds for the Polak-Ribière-Polyak (PRP) and the Fletcher-Reeves (FR) NCGMs for smooth strongly convex minimization. In particular, we construct mathematical proofs that establish the first non-asymptotic convergence bound for FR (which is historically the first developed NCGM), and a much improved non-asymptotic convergence bound for PRP. Additionally, we provide simple adversarial examples on which these methods do not perform better than gradient descent with exact line search, leaving very little room for improvements on the same class of problems.\u0000</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"32 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142183164","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning augmented branch and bound for mixed integer linear programming 混合整数线性规划的机器学习增强分支与约束
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-08-22 DOI: 10.1007/s10107-024-02130-y
Lara Scavuzzo, Karen Aardal, Andrea Lodi, Neil Yorke-Smith

Mixed Integer Linear Programming (MILP) is a pillar of mathematical optimization that offers a powerful modeling language for a wide range of applications. The main engine for solving MILPs is the branch-and-bound algorithm. Adding to the enormous algorithmic progress in MILP solving of the past decades, in more recent years there has been an explosive development in the use of machine learning for enhancing all main tasks involved in the branch-and-bound algorithm. These include primal heuristics, branching, cutting planes, node selection and solver configuration decisions. This article presents a survey of such approaches, addressing the vision of integration of machine learning and mathematical optimization as complementary technologies, and how this integration can benefit MILP solving. In particular, we give detailed attention to machine learning algorithms that automatically optimize some metric of branch-and-bound efficiency. We also address appropriate MILP representations, benchmarks and software tools used in the context of applying learning algorithms.

混合整数线性规划(MILP)是数学优化的支柱,为广泛的应用提供了强大的建模语言。求解 MILP 的主要引擎是分支与边界算法。在过去几十年 MILP 求解算法取得巨大进步的基础上,近年来,机器学习在增强分支与边界算法所涉及的所有主要任务方面取得了爆炸性的发展。这些任务包括原始启发式、分支、切割平面、节点选择和求解器配置决策。本文概述了这些方法,探讨了机器学习与数学优化作为互补技术进行整合的前景,以及这种整合如何有利于 MILP 求解。特别是,我们详细介绍了自动优化分支边界效率指标的机器学习算法。我们还讨论了在应用学习算法时使用的适当 MILP 表示法、基准和软件工具。
{"title":"Machine learning augmented branch and bound for mixed integer linear programming","authors":"Lara Scavuzzo, Karen Aardal, Andrea Lodi, Neil Yorke-Smith","doi":"10.1007/s10107-024-02130-y","DOIUrl":"https://doi.org/10.1007/s10107-024-02130-y","url":null,"abstract":"<p>Mixed Integer Linear Programming (MILP) is a pillar of mathematical optimization that offers a powerful modeling language for a wide range of applications. The main engine for solving MILPs is the branch-and-bound algorithm. Adding to the enormous algorithmic progress in MILP solving of the past decades, in more recent years there has been an explosive development in the use of machine learning for enhancing all main tasks involved in the branch-and-bound algorithm. These include primal heuristics, branching, cutting planes, node selection and solver configuration decisions. This article presents a survey of such approaches, addressing the vision of integration of machine learning and mathematical optimization as complementary technologies, and how this integration can benefit MILP solving. In particular, we give detailed attention to machine learning algorithms that automatically optimize some metric of branch-and-bound efficiency. We also address appropriate MILP representations, benchmarks and software tools used in the context of applying learning algorithms.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"6 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142183169","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the strength of Lagrangian duality in multiobjective integer programming 论多目标整数编程中拉格朗日对偶性的强度
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-08-20 DOI: 10.1007/s10107-024-02121-z
Matthew Brun, Tyler Perini, Saumya Sinha, Andrew J. Schaefer

This paper investigates the potential of Lagrangian relaxations to generate quality bounds on non-dominated images of multiobjective integer programs (MOIPs). Under some conditions on the relaxed constraints, we show that a set of Lagrangian relaxations can provide bounds that coincide with every bound generated by the convex hull relaxation. We also provide a guarantee of the relative quality of the Lagrangian bound at unsupported solutions. These results imply that, if the relaxed feasible region is bounded, some Lagrangian bounds will be strictly better than some convex hull bounds. We demonstrate that there exist Lagrangian multipliers which are sparse, satisfy a complementary slackness property, and generate tight relaxations at supported solutions. However, if all constraints are dualized, a relaxation can never be tight at an unsupported solution. These results characterize the strength of the Lagrangian dual at efficient solutions of an MOIP.

本文研究了拉格朗日松弛在生成多目标整数程序(MOIP)非主图像质量约束方面的潜力。在松弛约束的某些条件下,我们证明了一组拉格朗日松弛可以提供与凸壳松弛产生的每个约束重合的约束。我们还为无支撑解的拉格朗日约束的相对质量提供了保证。这些结果意味着,如果松弛的可行区域是有界的,那么某些拉格朗日约束将严格优于某些凸壳约束。我们证明,存在稀疏的、满足互补松弛特性的拉格朗日乘子,并能在有支撑解处产生紧松弛。然而,如果所有约束条件都是二元化的,那么在无支撑解处的松弛就永远不会紧密。这些结果说明了在 MOIP 的有效解中拉格朗日对偶的强度。
{"title":"On the strength of Lagrangian duality in multiobjective integer programming","authors":"Matthew Brun, Tyler Perini, Saumya Sinha, Andrew J. Schaefer","doi":"10.1007/s10107-024-02121-z","DOIUrl":"https://doi.org/10.1007/s10107-024-02121-z","url":null,"abstract":"<p>This paper investigates the potential of Lagrangian relaxations to generate quality bounds on non-dominated images of multiobjective integer programs (MOIPs). Under some conditions on the relaxed constraints, we show that a set of Lagrangian relaxations can provide bounds that coincide with every bound generated by the convex hull relaxation. We also provide a guarantee of the relative quality of the Lagrangian bound at unsupported solutions. These results imply that, if the relaxed feasible region is bounded, some Lagrangian bounds will be strictly better than some convex hull bounds. We demonstrate that there exist Lagrangian multipliers which are sparse, satisfy a complementary slackness property, and generate tight relaxations at supported solutions. However, if all constraints are dualized, a relaxation can never be tight at an unsupported solution. These results characterize the strength of the Lagrangian dual at efficient solutions of an MOIP.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"23 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142183168","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Convexification techniques for fractional programs 分数程序的凸化技术
IF 2.7 2区 数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-08-16 DOI: 10.1007/s10107-024-02131-x
Taotao He, Siyue Liu, Mohit Tawarmalani

This paper develops a correspondence relating convex hulls of fractional functions with those of polynomial functions over the same domain. Using this result, we develop a number of new reformulations and relaxations for fractional programming problems. First, we relate (0mathord {-}1) problems involving a ratio of affine functions with the boolean quadric polytope, and use inequalities for the latter to develop tighter formulations for the former. Second, we derive a new formulation to optimize a ratio of quadratic functions over a polytope using copositive programming. Third, we show that univariate fractional functions can be convexified using moment hulls. Fourth, we develop a new hierarchy of relaxations that converges finitely to the simultaneous convex hull of a collection of ratios of affine functions of (0mathord {-}1) variables. Finally, we demonstrate theoretically and computationally that our techniques close a significant gap relative to state-of-the-art relaxations, require much less computational effort, and can solve larger problem instances.

本文提出了分式函数的凸壳与同一域上多项式函数的凸壳之间的对应关系。利用这一结果,我们为分式编程问题开发了许多新的重构和松弛方法。首先,我们将涉及仿射函数之比的(0mathord {-}1)问题与布尔二次多面体联系起来,并利用后者的不等式为前者建立了更严密的公式。其次,我们推导出一种新的公式,利用共正编程优化多面体上的二次函数之比。第三,我们证明了单变量分式函数可以利用矩壳进行凸化。第四,我们开发了一种新的松弛层次,它可以有限地收敛到 (0mathord {-}1) 变量的仿射函数比率集合的同时凸壳。最后,我们从理论和计算上证明,我们的技术与最先进的松弛技术相比缩小了很大差距,所需的计算量也小得多,而且可以解决更大的问题实例。
{"title":"Convexification techniques for fractional programs","authors":"Taotao He, Siyue Liu, Mohit Tawarmalani","doi":"10.1007/s10107-024-02131-x","DOIUrl":"https://doi.org/10.1007/s10107-024-02131-x","url":null,"abstract":"<p>This paper develops a correspondence relating convex hulls of fractional functions with those of polynomial functions over the same domain. Using this result, we develop a number of new reformulations and relaxations for fractional programming problems. First, we relate <span>(0mathord {-}1)</span> problems involving a ratio of affine functions with the boolean quadric polytope, and use inequalities for the latter to develop tighter formulations for the former. Second, we derive a new formulation to optimize a ratio of quadratic functions over a polytope using copositive programming. Third, we show that univariate fractional functions can be convexified using moment hulls. Fourth, we develop a new hierarchy of relaxations that converges finitely to the simultaneous convex hull of a collection of ratios of affine functions of <span>(0mathord {-}1)</span> variables. Finally, we demonstrate theoretically and computationally that our techniques close a significant gap relative to state-of-the-art relaxations, require much less computational effort, and can solve larger problem instances.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"33 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142183167","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Mathematical Programming
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1