Mathematical Programming最新文献_第3页

A trust region-type normal map-based semismooth Newton method for nonsmooth nonconvex composite optimization 基于信任区域型法线图的非平滑非凸复合优化半平滑牛顿法

IF 2.7 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2024-07-22 DOI: 10.1007/s10107-024-02110-2

Wenqing Ouyang, Andre Milzarek

We propose a novel trust region method for solving a class of nonsmooth, nonconvex composite-type optimization problems. The approach embeds inexact semismooth Newton steps for finding zeros of a normal map-based stationarity measure for the problem in a trust region framework. Based on a new merit function and acceptance mechanism, global convergence and transition to fast local q-superlinear convergence are established under standard conditions. In addition, we verify that the proposed trust region globalization is compatible with the Kurdyka–Łojasiewicz inequality yielding finer convergence results. Experiments on sparse logistic regression, image compression, and a constrained log-determinant problem illustrate the efficiency of the proposed algorithm.

我们提出了一种新颖的信任区域方法，用于解决一类非光滑、非凸的复合型优化问题。该方法在信任区域框架中嵌入了不精确的半光滑牛顿步骤，用于为问题寻找基于正态图的静止度量的零点。基于新的优点函数和接受机制，在标准条件下建立了全局收敛和向快速局部 q 超线性收敛的过渡。此外，我们还验证了所提出的信任区域全局化与 Kurdyka-Łojasiewicz 不等式兼容，从而获得了更精细的收敛结果。稀疏对数回归、图像压缩和受约束对数确定性问题的实验说明了所提算法的效率。

引用次数: 0

Competitive kill-and-restart and preemptive strategies for non-clairvoyant scheduling 非千里眼调度的竞争性杀死-重启和抢先策略

IF 2.7 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2024-07-22 DOI: 10.1007/s10107-024-02118-8

Sven Jäger, Guillaume Sagnol, Daniel Schmidt genannt Waldschmidt, Philipp Warode

We study kill-and-restart and preemptive strategies for the fundamental scheduling problem of minimizing the sum of weighted completion times on a single machine in the non-clairvoyant setting. First, we show a lower bound of 3 for any deterministic non-clairvoyant kill-and-restart strategy. Then, we give for any (b > 1) a tight analysis for the natural b-scaling kill-and-restart strategy as well as for a randomized variant of it. In particular, we show a competitive ratio of ((1+3sqrt{3})approx 6.197) for the deterministic and of (approx 3.032) for the randomized strategy, by making use of the largest eigenvalue of a Toeplitz matrix. In addition, we show that the preemptive Weighted Shortest Elapsed Time First (WSETF) rule is 2-competitive when jobs are released online, matching the lower bound for the unit weight case with trivial release dates for any non-clairvoyant algorithm. Using this result as well as the competitiveness of round-robin for multiple machines, we prove performance guarantees smaller than 10 for adaptions of the b-scaling strategy to online release dates and unweighted jobs on identical parallel machines.

我们针对基本调度问题--在非千里眼环境下最小化单台机器上的加权完成时间之和--研究了 "杀死-重启 "和 "抢占 "策略。首先，我们展示了任何确定性非千里眼杀机重启策略的下限为 3。然后，我们给出了对于任意 (b > 1) 自然 b 缩放杀毒-重启策略及其随机变体的严密分析。特别是，通过使用托普利兹矩阵的最大特征值，我们展示了确定性策略的竞争比为（（1+3sqrt{3}）约6.197），随机策略的竞争比为（约3.032）。此外，我们还证明，当作业在线发布时，抢占式加权最短耗时优先（WSETF）规则具有 2 重竞争性，这与任何非千里眼算法在发布日期琐碎的单位权重情况下的下限相匹配。利用这一结果以及多机轮循的竞争性，我们证明了在相同并行机器上，b-scaling 策略适应在线发布日期和非加权作业的性能保证小于 10。

{"title":"Competitive kill-and-restart and preemptive strategies for non-clairvoyant scheduling","authors":"Sven Jäger, Guillaume Sagnol, Daniel Schmidt genannt Waldschmidt, Philipp Warode","doi":"10.1007/s10107-024-02118-8","DOIUrl":"https://doi.org/10.1007/s10107-024-02118-8","url":null,"abstract":"We study kill-and-restart and preemptive strategies for the fundamental scheduling problem of minimizing the sum of weighted completion times on a single machine in the non-clairvoyant setting. First, we show a lower bound of 3 for any deterministic non-clairvoyant kill-and-restart strategy. Then, we give for any (b > 1) a tight analysis for the natural b-scaling kill-and-restart strategy as well as for a randomized variant of it. In particular, we show a competitive ratio of ((1+3sqrt{3})approx 6.197) for the deterministic and of (approx 3.032) for the randomized strategy, by making use of the largest eigenvalue of a Toeplitz matrix. In addition, we show that the preemptive Weighted Shortest Elapsed Time First (WSETF) rule is 2-competitive when jobs are released online, matching the lower bound for the unit weight case with trivial release dates for any non-clairvoyant algorithm. Using this result as well as the competitiveness of round-robin for multiple machines, we prove performance guarantees smaller than 10 for adaptions of the b-scaling strategy to online release dates and unweighted jobs on identical parallel machines.","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"26 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141746019","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

On the geometry and refined rate of primal–dual hybrid gradient for linear programming 论线性规划的初等-双重混合梯度的几何形状和精炼率

IF 2.7 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2024-07-17 DOI: 10.1007/s10107-024-02109-9

Haihao Lu, Jinwen Yang

We study the convergence behaviors of primal–dual hybrid gradient (PDHG) for solving linear programming (LP). PDHG is the base algorithm of a new general-purpose first-order method LP solver, PDLP, which aims to scale up LP by taking advantage of modern computing architectures. Despite its numerical success, the theoretical understanding of PDHG for LP is still very limited; the previous complexity result relies on the global Hoffman constant of the KKT system, which is known to be very loose and uninformative. In this work, we aim to develop a fundamental understanding of the convergence behaviors of PDHG for LP and to develop a refined complexity rate that does not rely on the global Hoffman constant. We show that there are two major stages of PDHG for LP: in Stage I, PDHG identifies active variables and the length of the first stage is driven by a certain quantity which measures how close the non-degeneracy part of the LP instance is to degeneracy; in Stage II, PDHG effectively solves a homogeneous linear inequality system, and the complexity of the second stage is driven by a well-behaved local sharpness constant of the system. This finding is closely related to the concept of partial smoothness in non-smooth optimization, and it is the first complexity result of finite time identification without the non-degeneracy assumption. An interesting implication of our results is that degeneracy itself does not slow down the convergence of PDHG for LP, but near-degeneracy does.

我们研究了求解线性规划（LP）的原始双混合梯度（PDHG）的收敛行为。PDHG 是一种新型通用一阶法 LP 求解器 PDLP 的基础算法，其目的是利用现代计算架构的优势来扩展 LP。尽管在数值上取得了成功，但人们对 PDHG 用于 LP 的理论理解仍然非常有限；之前的复杂度结果依赖于 KKT 系统的全局霍夫曼常数，而众所周知，该常数非常松散，且信息量不大。在这项工作中，我们旨在从根本上理解 LP 的 PDHG 收敛行为，并开发出一种不依赖全局霍夫曼常数的精细复杂度率。我们证明了 LP 的 PDHG 有两个主要阶段：在第一阶段，PDHG 识别活动变量，第一阶段的长度受某个量的驱动，该量衡量 LP 实例的非退化部分与退化的接近程度；在第二阶段，PDHG 有效地求解了一个同质线性不等式系统，第二阶段的复杂度受该系统的一个良好的局部锐度常数的驱动。这一发现与非光滑优化中的局部光滑性概念密切相关，也是第一个没有非退化假设的有限时间辨识复杂性结果。我们的结果还有一个有趣的含义，即退化本身并不会减慢 LP 的 PDHG 收敛速度，但接近退化却会。

{"title":"On the geometry and refined rate of primal–dual hybrid gradient for linear programming","authors":"Haihao Lu, Jinwen Yang","doi":"10.1007/s10107-024-02109-9","DOIUrl":"https://doi.org/10.1007/s10107-024-02109-9","url":null,"abstract":"We study the convergence behaviors of primal–dual hybrid gradient (PDHG) for solving linear programming (LP). PDHG is the base algorithm of a new general-purpose first-order method LP solver, PDLP, which aims to scale up LP by taking advantage of modern computing architectures. Despite its numerical success, the theoretical understanding of PDHG for LP is still very limited; the previous complexity result relies on the global Hoffman constant of the KKT system, which is known to be very loose and uninformative. In this work, we aim to develop a fundamental understanding of the convergence behaviors of PDHG for LP and to develop a refined complexity rate that does not rely on the global Hoffman constant. We show that there are two major stages of PDHG for LP: in Stage I, PDHG identifies active variables and the length of the first stage is driven by a certain quantity which measures how close the non-degeneracy part of the LP instance is to degeneracy; in Stage II, PDHG effectively solves a homogeneous linear inequality system, and the complexity of the second stage is driven by a well-behaved local sharpness constant of the system. This finding is closely related to the concept of partial smoothness in non-smooth optimization, and it is the first complexity result of finite time identification without the non-degeneracy assumption. An interesting implication of our results is that degeneracy itself does not slow down the convergence of PDHG for LP, but near-degeneracy does.","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"30 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141717803","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Distributional utility preference robust optimization models in multi-attribute decision making 多属性决策中的分配效用偏好稳健优化模型

IF 2.7 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2024-07-17 DOI: 10.1007/s10107-024-02114-y

Jian Hu, Dali Zhang, Huifu Xu, Sainan Zhang

Utility preference robust optimization (PRO) has recently been proposed to deal with optimal decision-making problems where the decision maker’s (DM’s) preference over gains and losses is ambiguous. In this paper, we take a step further to investigate the case that the DM’s preference is random. We propose to use a random utility function to describe the DM’s preference and develop distributional utility preference robust optimization (DUPRO) models when the distribution of the random utility function is ambiguous. We concentrate on data-driven problems where samples of the random parameters are obtainable but the sample size may be relatively small. In the case when the random utility functions are of piecewise linear structure, we propose a bootstrap method to construct the ambiguity set and demonstrate how the resulting DUPRO can be solved by a mixed-integer linear program. The piecewise linear structure is versatile in its ability to incorporate classical non-parametric utility assessment methods into the sample generation of a random utility function. Next, we expand the proposed DUPRO models and computational schemes to address general cases where the random utility functions are not necessarily piecewise linear. We show how the DUPRO models with piecewise linear random utility functions can serve as approximations for the DUPRO models with general random utility functions and allow us to quantify the approximation errors. Finally, we carry out some performance studies of the proposed bootstrap-based DUPRO model and report the preliminary numerical test results. This paper is the first attempt to use distributionally robust optimization methods for PRO problems.

最近，有人提出了效用偏好稳健优化法（PRO），用于处理决策者（DM）对收益和损失的偏好不明确的最优决策问题。在本文中，我们将更进一步研究决策者偏好随机的情况。我们建议使用随机效用函数来描述 DM 的偏好，并开发了随机效用函数分布不明确时的分布效用偏好稳健优化（DUPRO）模型。我们专注于数据驱动的问题，在这种情况下，随机参数的样本是可以获得的，但样本量可能相对较小。在随机效用函数为片断线性结构的情况下，我们提出了一种自举法来构建模糊集，并演示了如何通过混合整数线性规划来求解所得到的 DUPRO。片断线性结构用途广泛，能将经典的非参数效用评估方法纳入随机效用函数的样本生成中。接下来，我们将扩展所提出的 DUPRO 模型和计算方案，以解决随机效用函数不一定是片线性的一般情况。我们展示了具有片线性随机效用函数的 DUPRO 模型如何作为具有一般随机效用函数的 DUPRO 模型的近似值，并允许我们量化近似误差。最后，我们对所提出的基于引导的 DUPRO 模型进行了一些性能研究，并报告了初步的数值测试结果。本文是将分布稳健优化方法用于 PRO 问题的首次尝试。

{"title":"Distributional utility preference robust optimization models in multi-attribute decision making","authors":"Jian Hu, Dali Zhang, Huifu Xu, Sainan Zhang","doi":"10.1007/s10107-024-02114-y","DOIUrl":"https://doi.org/10.1007/s10107-024-02114-y","url":null,"abstract":"Utility preference robust optimization (PRO) has recently been proposed to deal with optimal decision-making problems where the decision maker’s (DM’s) preference over gains and losses is ambiguous. In this paper, we take a step further to investigate the case that the DM’s preference is random. We propose to use a random utility function to describe the DM’s preference and develop distributional utility preference robust optimization (DUPRO) models when the distribution of the random utility function is ambiguous. We concentrate on data-driven problems where samples of the random parameters are obtainable but the sample size may be relatively small. In the case when the random utility functions are of piecewise linear structure, we propose a bootstrap method to construct the ambiguity set and demonstrate how the resulting DUPRO can be solved by a mixed-integer linear program. The piecewise linear structure is versatile in its ability to incorporate classical non-parametric utility assessment methods into the sample generation of a random utility function. Next, we expand the proposed DUPRO models and computational schemes to address general cases where the random utility functions are not necessarily piecewise linear. We show how the DUPRO models with piecewise linear random utility functions can serve as approximations for the DUPRO models with general random utility functions and allow us to quantify the approximation errors. Finally, we carry out some performance studies of the proposed bootstrap-based DUPRO model and report the preliminary numerical test results. This paper is the first attempt to use distributionally robust optimization methods for PRO problems.\u0000","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"31 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141717805","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

New notions of simultaneous diagonalizability of quadratic forms with applications to QCQPs 二次型同时对角化的新概念及其在 QCQPs 中的应用

IF 2.7 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2024-07-17 DOI: 10.1007/s10107-024-02120-0

Alex L. Wang, Rujun Jiang

A set of quadratic forms is simultaneously diagonalizable via congruence (SDC) if there exists a basis under which each of the quadratic forms is diagonal. This property appears naturally when analyzing quadratically constrained quadratic programs (QCQPs) and has important implications in globally solving such problems using branch-and-bound methods. This paper extends the reach of the SDC property by studying two new weaker notions of simultaneous diagonalizability. Specifically, we say that a set of quadratic forms is almost SDC (ASDC) if it is the limit of SDC sets and d-restricted SDC (d-RSDC) if it is the restriction of an SDC set in up to d-many additional dimensions. In the context of QCQPs, these properties correspond to problems that may be diagonalized after arbitrarily small perturbations or after the introduction of d additional variables. Our main contributions are complete characterizations of the ASDC pairs and nonsingular triples of symmetric matrices, as well as a sufficient condition for the 1-RSDC property for pairs of symmetric matrices. Surprisingly, we show that every singular symmetric pair is ASDC and that almost every symmetric pair is 1-RSDC. We accompany our theoretical results with preliminary numerical experiments applying these constructions to solve QCQPs within branch-and-bound schemes.

如果存在这样一个基础，即每个二次型都是对角的，那么一组二次型就同时可通过全等对角（SDC）。这一性质在分析二次受限二次方程程序（QCQPs）时自然出现，并对使用分支约束法全局求解此类问题具有重要意义。本文通过研究两个新的较弱的同时对角化概念，扩展了 SDC 特性的范围。具体来说，如果一个二次型集合是 SDC 集合的极限，我们就说它几乎是 SDC (ASDC)；如果它是 SDC 集合在多达 d 个额外维度上的限制，我们就说它是 d 限制 SDC (d-RSDC)。在 QCQPs 的背景下，这些性质对应于经过任意小的扰动或引入 d 个额外变量后可以对角化的问题。我们的主要贡献是完整描述了对称矩阵的 ASDC 对和非奇异三元组，以及对称矩阵对的 1-RSDC 属性的充分条件。令人惊讶的是，我们证明了每个奇异对称对都是 ASDC，而且几乎每个对称对都是 1-RSDC。在得出理论结果的同时，我们还进行了初步的数值实验，将这些构造应用于在分支与边界方案中求解 QCQP。

{"title":"New notions of simultaneous diagonalizability of quadratic forms with applications to QCQPs","authors":"Alex L. Wang, Rujun Jiang","doi":"10.1007/s10107-024-02120-0","DOIUrl":"https://doi.org/10.1007/s10107-024-02120-0","url":null,"abstract":"A set of quadratic forms is simultaneously diagonalizable via congruence (SDC) if there exists a basis under which each of the quadratic forms is diagonal. This property appears naturally when analyzing quadratically constrained quadratic programs (QCQPs) and has important implications in globally solving such problems using branch-and-bound methods. This paper extends the reach of the SDC property by studying two new weaker notions of simultaneous diagonalizability. Specifically, we say that a set of quadratic forms is almost SDC (ASDC) if it is the limit of SDC sets and d-restricted SDC (d-RSDC) if it is the restriction of an SDC set in up to d-many additional dimensions. In the context of QCQPs, these properties correspond to problems that may be diagonalized after arbitrarily small perturbations or after the introduction of d additional variables. Our main contributions are complete characterizations of the ASDC pairs and nonsingular triples of symmetric matrices, as well as a sufficient condition for the 1-RSDC property for pairs of symmetric matrices. Surprisingly, we show that every singular symmetric pair is ASDC and that almost every symmetric pair is 1-RSDC. We accompany our theoretical results with preliminary numerical experiments applying these constructions to solve QCQPs within branch-and-bound schemes.","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"44 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141717804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Efficient separation of RLT cuts for implicit and explicit bilinear terms 隐式和显式双线性项的 RLT 切分的高效分离

IF 2.7 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2024-07-16 DOI: 10.1007/s10107-024-02104-0

Ksenia Bestuzheva, Ambros Gleixner, Tobias Achterberg

The reformulation–linearization technique (RLT) is a prominent approach to constructing tight linear relaxations of non-convex continuous and mixed-integer optimization problems. The goal of this paper is to extend the applicability and improve the performance of RLT for bilinear product relations. First, we present a method for detecting bilinear product relations implicitly contained in mixed-integer linear programs, which is based on analyzing linear constraints with binary variables, thus enabling the application of bilinear RLT to a new class of problems. Strategies for filtering product relations are discussed and tested. Our second contribution addresses the high computational cost of RLT cut separation, which presents one of the major difficulties in applying RLT efficiently in practice. We propose a new RLT cutting plane separation algorithm which identifies combinations of linear constraints and bound factors that are expected to yield an inequality that is violated by the current relaxation solution. This algorithm is applicable to RLT cuts generated for all types of bilinear terms, including but not limited to the detected implicit products. A detailed computational study based on independent implementations in two solvers evaluates the performance impact of the proposed methods.

重拟线性化技术（RLT）是构建非凸连续和混合整数优化问题紧密线性松弛的一种重要方法。本文的目标是扩展 RLT 在双线性积关系中的适用性并提高其性能。首先，我们提出了一种检测混合整数线性程序中隐含的双线性乘积关系的方法，该方法基于对二元变量线性约束的分析，从而使双线性 RLT 能够应用于一类新问题。我们还讨论并测试了过滤乘积关系的策略。我们的第二个贡献是解决了 RLT 切分的高计算成本问题，这也是在实践中有效应用 RLT 的主要困难之一。我们提出了一种新的 RLT 切面分离算法，它能识别线性约束和约束因子的组合，这些组合预计会产生当前松弛解违反的不等式。该算法适用于为所有类型的双线性项生成的 RLT 切分，包括但不限于检测到的隐含乘积。基于两个求解器中独立实现的详细计算研究评估了所提方法对性能的影响。

{"title":"Efficient separation of RLT cuts for implicit and explicit bilinear terms","authors":"Ksenia Bestuzheva, Ambros Gleixner, Tobias Achterberg","doi":"10.1007/s10107-024-02104-0","DOIUrl":"https://doi.org/10.1007/s10107-024-02104-0","url":null,"abstract":"The reformulation–linearization technique (RLT) is a prominent approach to constructing tight linear relaxations of non-convex continuous and mixed-integer optimization problems. The goal of this paper is to extend the applicability and improve the performance of RLT for bilinear product relations. First, we present a method for detecting bilinear product relations implicitly contained in mixed-integer linear programs, which is based on analyzing linear constraints with binary variables, thus enabling the application of bilinear RLT to a new class of problems. Strategies for filtering product relations are discussed and tested. Our second contribution addresses the high computational cost of RLT cut separation, which presents one of the major difficulties in applying RLT efficiently in practice. We propose a new RLT cutting plane separation algorithm which identifies combinations of linear constraints and bound factors that are expected to yield an inequality that is violated by the current relaxation solution. This algorithm is applicable to RLT cuts generated for all types of bilinear terms, including but not limited to the detected implicit products. A detailed computational study based on independent implementations in two solvers evaluates the performance impact of the proposed methods.","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"26 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141722007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Structural iterative rounding for generalized k-median problems 广义 K 中值问题的结构性迭代舍入

IF 2.7 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2024-07-10 DOI: 10.1007/s10107-024-02119-7

Anupam Gupta, Benjamin Moseley, Rudy Zhou

This paper considers approximation algorithms for generalized k-median problems. These problems can be informally described as k-median with a constant number of extra constraints, and includes k-median with outliers, and knapsack median. Our first contribution is a pseudo-approximation algorithm for generalized k-median that outputs a 6.387-approximate solution, with a constant number of fractional variables. The algorithm builds on the iterative rounding framework introduced by Krishnaswamy, Li, and Sandeep for k-median with outliers as reported (Krishnaswamy et al. in: Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing, 2018). The main technical innovation is allowing richer constraint sets in the iterative rounding and using the structure of the resulting extreme points. Using our pseudo-approximation algorithm, we give improved approximation algorithms for k-median with outliers and knapsack median. This involves combining our pseudo-approximation with pre- and post-processing steps to round a constant number of fractional variables at a small increase in cost. Our algorithms achieve approximation ratios (6.994 + epsilon ) and (6.387 + epsilon ) for k-median with outliers and knapsack median, respectively. These improve on the best-known approximation ratio (7.081 + epsilon ) for both problems as reported (Krishnaswamy et al. in: Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing, 2018).

本文探讨了广义 k 中值问题的近似算法。这些问题可以被非正式地描述为带有恒定数量额外约束的 k-中值问题，包括带有异常值的 k-中值问题和 knapsack 中值问题。我们的第一个贡献是针对广义 k 中值问题提出了一种伪近似算法，它能在分数变量数量不变的情况下输出 6.387 近似解。该算法建立在 Krishnaswamy、Li 和 Sandeep 针对有离群值的 k-median 引入的迭代舍入框架基础上（Krishnaswamy et al：第 50 届 ACM SIGACT 计算理论年度研讨会论文集，2018 年）。主要的技术创新在于允许在迭代舍入中使用更丰富的约束集，并使用由此产生的极值点结构。利用我们的伪逼近算法，我们给出了带离群值的 k-median 和 knapsack median 的改进逼近算法。这包括将我们的伪逼近算法与前处理和后处理步骤相结合，以较小的成本增加对一定数量的小数变量进行舍入。我们的算法分别实现了 k-median with outliers 和 knapsack median 的近似率（6.994 + epsilon ）和（6.387 + epsilon ）。对于这两个问题，这些近似比（7.081 + epsilon ）都有所提高（Krishnaswamy et al：第 50 届 ACM SIGACT 计算理论年度研讨会论文集，2018 年）。

{"title":"Structural iterative rounding for generalized k-median problems","authors":"Anupam Gupta, Benjamin Moseley, Rudy Zhou","doi":"10.1007/s10107-024-02119-7","DOIUrl":"https://doi.org/10.1007/s10107-024-02119-7","url":null,"abstract":"This paper considers approximation algorithms for generalized k-median problems. These problems can be informally described as k-median with a constant number of extra constraints, and includes k-median with outliers, and knapsack median. Our first contribution is a pseudo-approximation algorithm for generalized k-median that outputs a 6.387-approximate solution, with a constant number of fractional variables. The algorithm builds on the iterative rounding framework introduced by Krishnaswamy, Li, and Sandeep for k-median with outliers as reported (Krishnaswamy et al. in: Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing, 2018). The main technical innovation is allowing richer constraint sets in the iterative rounding and using the structure of the resulting extreme points. Using our pseudo-approximation algorithm, we give improved approximation algorithms for k-median with outliers and knapsack median. This involves combining our pseudo-approximation with pre- and post-processing steps to round a constant number of fractional variables at a small increase in cost. Our algorithms achieve approximation ratios (6.994 + epsilon ) and (6.387 + epsilon ) for k-median with outliers and knapsack median, respectively. These improve on the best-known approximation ratio (7.081 + epsilon ) for both problems as reported (Krishnaswamy et al. in: Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing, 2018).","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"37 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141567652","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

On circuit diameter bounds via circuit imbalances 通过电路失衡论电路直径边界

IF 2.7 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2024-07-08 DOI: 10.1007/s10107-024-02107-x

Daniel Dadush, Zhuan Khye Koh, Bento Natura, László A. Végh

We study the circuit diameter of polyhedra, introduced by Borgwardt, Finhold, and Hemmecke (SIAM J. Discrete Math. 29(1), 113–121 (2015)) as a relaxation of the combinatorial diameter. We show that the circuit diameter of a system ({xin mathbb {R}^n:, Ax=b,, mathbb {0}le xle u}) for (Ain mathbb {R}^{mtimes n}) is bounded by (O(mmin {m, n - m}log (m+kappa _A)+nlog n)), where (kappa _A) is the circuit imbalance measure of the constraint matrix. This yields a strongly polynomial circuit diameter bound if e.g., all entries of A have polynomially bounded encoding length in n. Further, we present circuit augmentation algorithms for LPs using the minimum-ratio circuit cancelling rule. Even though the standard minimum-ratio circuit cancelling algorithm is not finite in general, our variant can solve an LP in (O(mn^2log (n+kappa _A))) augmentation steps.

我们研究由 Borgwardt、Finhold 和 Hemmecke（SIAM J. Discrete Math.29(1), 113-121 (2015)）作为组合直径的放松而引入的。我们证明了一个系统的电路直径（{xin mathbb {R}^n：Ax=b,, mathbb {0}le xle u}) for (Ain mathbb {R}^{mtimes n}) is bounded by (O(mmin {m, n - m}log (m+kappa _A)+nlog n)), where (kappa _A) is the circuit imbalance measure of the constraint matrix.如果 A 的所有条目在 n 中的编码长度都是多项式约束的，那么这就产生了一个强多项式电路直径约束。尽管标准的最小比值电路消除算法在一般情况下不是有限的，但我们的变体可以在（O(mn^2log (n+kappa _A)）增强步骤内解决 LP。

引用次数: 0

Monoidal strengthening and unique lifting in MIQCPs MIQCP 中的单值加强和唯一提升

IF 2.7 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2024-07-08 DOI: 10.1007/s10107-024-02112-0

Antonia Chmiela, Gonzalo Muñoz, Felipe Serrano

Using the recently proposed maximal quadratic-free sets and the well-known monoidal strengthening procedure, we show how to improve intersection cuts for quadratically-constrained optimization problems by exploiting integrality requirements. We provide an explicit construction that allows an efficient implementation of the strengthened cuts along with computational results showing their improvements over the standard intersection cuts. We also show that, in our setting, there is unique lifting which implies that our strengthening procedure is generating the best possible cut coefficients for the integer variables.

我们利用最近提出的最大无二次型集合和著名的单环加强程序，展示了如何通过利用积分性要求来改进二次型约束优化问题的交叉切分。我们提供了一种明确的构造，可以高效地实现强化切分，同时还提供了计算结果，显示了强化切分与标准交集切分相比的改进。我们还证明，在我们的设置中，存在唯一的提升，这意味着我们的强化程序正在为整数变量生成最佳的切分系数。

引用次数: 0

Optimal methods for convex nested stochastic composite optimization 凸嵌套随机复合优化的最优方法

IF 2.7 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2024-07-05 DOI: 10.1007/s10107-024-02090-3

Zhe Zhang, Guanghui Lan

Recently, convex nested stochastic composite optimization (NSCO) has received considerable interest for its applications in reinforcement learning and risk-averse optimization. However, existing NSCO algorithms have worse stochastic oracle complexities, by orders of magnitude, than those for simpler stochastic optimization problems without nested structures. Additionally, these algorithms require all outer-layer functions to be smooth, a condition violated by some important applications. This raises a question regarding whether the nested composition make stochastic optimization more difficult in terms of oracle complexity. In this paper, we answer the question by developing order-optimal algorithms for convex NSCO problems constructed from an arbitrary composition of smooth, structured non-smooth, and general non-smooth layer functions. When all outer-layer functions are smooth, we propose a stochastic sequential dual (SSD) method to achieve an oracle complexity of (mathcal {O}(1/epsilon ^2)) (resp., (mathcal {O}(1/epsilon ))) when the problem is convex (resp., strongly convex). If any outer-layer function is non-smooth, we propose a non-smooth stochastic sequential dual (nSSD) method to achieve an (mathcal {O}(1/epsilon ^2)) oracle complexity. We provide a lower complexity bound to show the latter (mathcal {O}(1/epsilon ^2)) complexity to be unimprovable, even under a strongly convex setting. All these complexity results seem to be new in the literature, and they indicate that convex NSCO problems have the same order of oracle complexity as problems without nested composition, except in the strongly convex and outer non-smooth cases.

最近，凸嵌套随机复合优化（NSCO）因其在强化学习和风险规避优化中的应用而受到广泛关注。然而，现有的 NSCO 算法的随机甲骨文复杂度比那些没有嵌套结构的简单随机优化问题的复杂度要差，差了几个数量级。此外，这些算法要求所有外层函数都是平滑的，而一些重要应用却违反了这一条件。这就提出了一个问题：嵌套结构是否会增加随机优化的甲骨文复杂度？在本文中，我们通过开发由光滑层函数、结构非光滑层函数和一般非光滑层函数任意组成的凸 NSCO 问题的阶优算法来回答这个问题。当所有外层函数都是平滑的时候，我们提出了一种随机顺序对偶（SSD）方法，当问题是凸的（或者说，强凸的）时，它的oracle复杂度为(mathcal {O}(1/epsilon ^2))（或者说，(mathcal {O}(1/epsilon ))）。如果任何外层函数是非光滑的，我们提出了一种非光滑随机顺序对偶（nSSD）方法，以实现 (mathcal {O}(1/epsilon ^2)) oracle 复杂性。我们提供了一个较低的复杂度约束，以证明后者的 (mathcal {O}(1/epsilon ^2))复杂度是不可改进的，即使在强凸设置下也是如此。所有这些复杂度结果在文献中似乎都是新的，它们表明，除了强凸和外部非光滑情况外，凸NSCO问题与没有嵌套组合的问题具有相同的oracle复杂度阶数。

{"title":"Optimal methods for convex nested stochastic composite optimization","authors":"Zhe Zhang, Guanghui Lan","doi":"10.1007/s10107-024-02090-3","DOIUrl":"https://doi.org/10.1007/s10107-024-02090-3","url":null,"abstract":"Recently, convex nested stochastic composite optimization (NSCO) has received considerable interest for its applications in reinforcement learning and risk-averse optimization. However, existing NSCO algorithms have worse stochastic oracle complexities, by orders of magnitude, than those for simpler stochastic optimization problems without nested structures. Additionally, these algorithms require all outer-layer functions to be smooth, a condition violated by some important applications. This raises a question regarding whether the nested composition make stochastic optimization more difficult in terms of oracle complexity. In this paper, we answer the question by developing order-optimal algorithms for convex NSCO problems constructed from an arbitrary composition of smooth, structured non-smooth, and general non-smooth layer functions. When all outer-layer functions are smooth, we propose a stochastic sequential dual (SSD) method to achieve an oracle complexity of (mathcal {O}(1/epsilon ^2)) (resp., (mathcal {O}(1/epsilon ))) when the problem is convex (resp., strongly convex). If any outer-layer function is non-smooth, we propose a non-smooth stochastic sequential dual (nSSD) method to achieve an (mathcal {O}(1/epsilon ^2)) oracle complexity. We provide a lower complexity bound to show the latter (mathcal {O}(1/epsilon ^2)) complexity to be unimprovable, even under a strongly convex setting. All these complexity results seem to be new in the literature, and they indicate that convex NSCO problems have the same order of oracle complexity as problems without nested composition, except in the strongly convex and outer non-smooth cases.","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"59 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141547945","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0