Mathematical Programming最新文献_第9页

Frank–Wolfe-type methods for a class of nonconvex inequality-constrained problems 一类非凸不等式约束问题的 Frank-Wolfe 型方法

IF 2.7 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2024-02-03 DOI: 10.1007/s10107-023-02055-y

Abstract

The Frank–Wolfe (FW) method, which implements efficient linear oracles that minimize linear approximations of the objective function over a fixed compact convex set, has recently received much attention in the optimization and machine learning literature. In this paper, we propose a new FW-type method for minimizing a smooth function over a compact set defined as the level set of a single difference-of-convex function, based on new generalized linear-optimization oracles (LO). We show that these LOs can be computed efficiently with closed-form solutions in some important optimization models that arise in compressed sensing and machine learning. In addition, under a mild strict feasibility condition, we establish the subsequential convergence of our nonconvex FW-type method. Since the feasible region of our generalized LO typically changes from iteration to iteration, our convergence analysis is completely different from those existing works in the literature on FW-type methods that deal with fixed feasible regions among subproblems. Finally, motivated by the away steps for accelerating FW-type methods for convex problems, we further design an away-step oracle to supplement our nonconvex FW-type method, and establish subsequential convergence of this variant. Numerical results on the matrix completion problem with standard datasets are presented to demonstrate the efficiency of the proposed FW-type method and its away-step variant.

摘要弗兰克-沃尔夫（Frank-Wolfe，FW）方法实现了在固定紧凑凸集上最小化目标函数线性近似值的高效线性指标，最近在优化和机器学习文献中受到广泛关注。在本文中，我们基于新的广义线性优化神谕（LO），提出了一种新的 FW 型方法，用于在定义为单个凸函数差的水平集的紧凑集上最小化平滑函数。我们证明，在压缩传感和机器学习中出现的一些重要优化模型中，可以通过闭式解高效计算这些 LO。此外，在温和严格的可行性条件下，我们建立了非凸 FW 型方法的后续收敛性。由于我们的广义 LO 的可行区域通常会随着迭代而变化，因此我们的收敛性分析与现有文献中处理子问题间固定可行区域的 FW 型方法完全不同。最后，受用于加速凸问题 FW 型方法的远离步骤的启发，我们进一步设计了一个远离步骤神谕来补充我们的非凸 FW 型方法，并建立了这一变体的后续收敛性。我们给出了使用标准数据集的矩阵完成问题的数值结果，以证明所提出的 FW 型方法及其远离步骤变体的效率。

{"title":"Frank–Wolfe-type methods for a class of nonconvex inequality-constrained problems","authors":"","doi":"10.1007/s10107-023-02055-y","DOIUrl":"https://doi.org/10.1007/s10107-023-02055-y","url":null,"abstract":"<h3>Abstract</h3> The Frank–Wolfe (FW) method, which implements efficient linear oracles that minimize linear approximations of the objective function over a fixed compact convex set, has recently received much attention in the optimization and machine learning literature. In this paper, we propose a new FW-type method for minimizing a smooth function over a compact set defined as the level set of a single difference-of-convex function, based on new generalized linear-optimization oracles (LO). We show that these LOs can be computed efficiently with closed-form solutions in some important optimization models that arise in compressed sensing and machine learning. In addition, under a mild strict feasibility condition, we establish the subsequential convergence of our nonconvex FW-type method. Since the feasible region of our generalized LO typically changes from iteration to iteration, our convergence analysis is completely different from those existing works in the literature on FW-type methods that deal with fixed feasible regions among subproblems. Finally, motivated by the away steps for accelerating FW-type methods for convex problems, we further design an away-step oracle to supplement our nonconvex FW-type method, and establish subsequential convergence of this variant. Numerical results on the matrix completion problem with standard datasets are presented to demonstrate the efficiency of the proposed FW-type method and its away-step variant. ","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"10 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139677806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Designing tractable piecewise affine policies for multi-stage adjustable robust optimization 为多阶段可调鲁棒优化设计可操作的片断仿射策略

IF 2.7 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2024-02-03 DOI: 10.1007/s10107-023-02053-0

Simon Thomä, Grit Walther, Maximilian Schiffer

We study piecewise affine policies for multi-stage adjustable robust optimization (ARO) problems with non-negative right-hand side uncertainty. First, we construct new dominating uncertainty sets and show how a multi-stage ARO problem can be solved efficiently with a linear program when uncertainty is replaced by these new sets. We then demonstrate how solutions for this alternative problem can be transformed into solutions for the original problem. By carefully choosing the dominating sets, we prove strong approximation bounds for our policies and extend many previously best-known bounds for the two-staged problem variant to its multi-stage counterpart. Moreover, the new bounds are—to the best of our knowledge—the first bounds shown for the general multi-stage ARO problem considered. We extensively compare our policies to other policies from the literature and prove relative performance guarantees. In two numerical experiments, we identify beneficial and disadvantageous properties for different policies and present effective adjustments to tackle the most critical disadvantages of our policies. Overall, the experiments show that our piecewise affine policies can be computed by orders of magnitude faster than affine policies, while often yielding comparable or even better results.

我们研究了具有非负右侧不确定性的多阶段可调鲁棒优化（ARO）问题的片断仿射策略。首先，我们构建了新的支配性不确定性集，并展示了当不确定性被这些新的不确定性集取代时，如何用线性程序高效地解决多阶段 ARO 问题。然后，我们演示了如何将这一替代问题的解决方案转化为原始问题的解决方案。通过仔细选择支配集，我们证明了我们的策略具有很强的近似边界，并将两阶段问题变体的许多已知边界扩展到了多阶段问题变体。此外，据我们所知，新的界限是首次针对一般多阶段 ARO 问题给出的界限。我们将我们的策略与文献中的其他策略进行了广泛比较，并证明了相对性能保证。在两个数值实验中，我们确定了不同策略的优势和劣势，并提出了有效的调整措施，以解决我们策略中最关键的劣势。总之，实验表明，我们的片断仿射策略的计算速度比仿射策略快几个数量级，同时通常能获得相当甚至更好的结果。

{"title":"Designing tractable piecewise affine policies for multi-stage adjustable robust optimization","authors":"Simon Thomä, Grit Walther, Maximilian Schiffer","doi":"10.1007/s10107-023-02053-0","DOIUrl":"https://doi.org/10.1007/s10107-023-02053-0","url":null,"abstract":"We study piecewise affine policies for multi-stage adjustable robust optimization (ARO) problems with non-negative right-hand side uncertainty. First, we construct new dominating uncertainty sets and show how a multi-stage ARO problem can be solved efficiently with a linear program when uncertainty is replaced by these new sets. We then demonstrate how solutions for this alternative problem can be transformed into solutions for the original problem. By carefully choosing the dominating sets, we prove strong approximation bounds for our policies and extend many previously best-known bounds for the two-staged problem variant to its multi-stage counterpart. Moreover, the new bounds are—to the best of our knowledge—the first bounds shown for the general multi-stage ARO problem considered. We extensively compare our policies to other policies from the literature and prove relative performance guarantees. In two numerical experiments, we identify beneficial and disadvantageous properties for different policies and present effective adjustments to tackle the most critical disadvantages of our policies. Overall, the experiments show that our piecewise affine policies can be computed by orders of magnitude faster than affine policies, while often yielding comparable or even better results.","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"12 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139677417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A constant-factor approximation for generalized malleable scheduling under $$M ^{natural }$$ -concave processing speeds 在 $$M ^{natural }$$ -凹处理速度条件下的广义可延展调度的常系数近似值

IF 2.7 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2024-01-29 DOI: 10.1007/s10107-023-02054-z

Dimitris Fotakis, Jannik Matuschke, Orestis Papadigenopoulos

In generalized malleable scheduling, jobs can be allocated and processed simultaneously on multiple machines so as to reduce the overall makespan of the schedule. The required processing time for each job is determined by the joint processing speed of the allocated machines. We study the case that processing speeds are job-dependent (M ^{natural })-concave functions and provide a constant-factor approximation for this setting, significantly expanding the realm of functions for which such an approximation is possible. Further, we explore the connection between malleable scheduling and the problem of fairly allocating items to a set of agents with distinct utility functions, devising a black-box reduction that allows to obtain resource-augmented approximation algorithms for the latter.

在广义延展性调度中，作业可在多台机器上同时分配和处理，以减少调度的总体时间跨度。每个作业所需的处理时间由所分配机器的联合处理速度决定。我们研究了处理速度是与作业相关的 (M ^{natural })-concave 函数的情况，并为这种情况提供了一个常系数近似值，从而大大扩展了这种近似值可能适用的函数领域。此外，我们还探讨了可变调度与向一组具有不同效用函数的代理公平分配物品问题之间的联系，并设计了一种黑箱还原方法，从而为后者获得了资源增量近似算法。

引用次数: 0

Adjustability in robust linear optimization 稳健线性优化中的可调整性

IF 2.7 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2024-01-27 DOI: 10.1007/s10107-023-02049-w

Abstract

We investigate the concept of adjustability—the difference in objective values between two types of dynamic robust optimization formulations: one where (static) decisions are made before uncertainty realization, and one where uncertainty is resolved before (adjustable) decisions. This difference reflects the value of information and decision timing in optimization under uncertainty, and is related to several other concepts such as the optimality of decision rules in robust optimization. We develop a theoretical framework to quantify adjustability based on the input data of a robust optimization problem with a linear objective, linear constraints, and fixed recourse. We make very few additional assumptions. In particular, we do not assume constraint-wise separability or parameter nonnegativity that are commonly imposed in the literature for the study of adjustability. This allows us to study important but previously under-investigated problems, such as formulations with equality constraints and problems with both upper and lower bound constraints. Based on the discovery of an interesting connection between the reformulations of the static and fully adjustable problems, our analysis gives a necessary and sufficient condition—in the form of a theorem-of-the-alternatives—for adjustability to be zero when the uncertainty set is polyhedral. Based on this sharp characterization, we provide two efficient mixed-integer optimization formulations to verify zero adjustability. Then, we develop a constructive approach to quantify adjustability when the uncertainty set is general, which results in an efficient and tight poly-time algorithm to bound adjustability. We demonstrate the efficiency and tightness via both theoretical and numerical analyses.

摘要我们研究了可调整性的概念--两类动态稳健优化公式之间目标值的差异：一类是在不确定性实现之前做出（静态）决策，另一类是在做出（可调整的）决策之前解决不确定性。这种差异反映了信息和决策时机在不确定条件下优化的价值，并与其他几个概念相关，如稳健优化中决策规则的最优性。我们基于线性目标、线性约束和固定追索权的稳健优化问题的输入数据，建立了一个量化可调整性的理论框架。我们只做了很少的额外假设。特别是，我们没有假设约束分离性或参数非负性，而这些在研究可调整性的文献中是常见的。这使得我们可以研究一些重要但以前未充分研究的问题，如带有相等约束条件的公式，以及同时带有上限和下限约束条件的问题。在发现静态问题和完全可调整问题的重构之间存在有趣联系的基础上，我们的分析给出了当不确定集合为多面体时，可调整性为零的必要条件和充分条件--以替代定理的形式。基于这一尖锐的表征，我们提供了两个高效的混合整数优化公式来验证零可调性。然后，我们开发了一种构造性方法来量化不确定性集为一般时的可调整性，并由此产生了一种高效、严密的多时间算法来约束可调整性。我们通过理论和数值分析证明了这种算法的高效性和严密性。

{"title":"Adjustability in robust linear optimization","authors":"","doi":"10.1007/s10107-023-02049-w","DOIUrl":"https://doi.org/10.1007/s10107-023-02049-w","url":null,"abstract":"<h3>Abstract</h3> We investigate the concept of adjustability—the difference in objective values between two types of dynamic robust optimization formulations: one where (static) decisions are made before uncertainty realization, and one where uncertainty is resolved before (adjustable) decisions. This difference reflects the value of information and decision timing in optimization under uncertainty, and is related to several other concepts such as the optimality of decision rules in robust optimization. We develop a theoretical framework to quantify adjustability based on the input data of a robust optimization problem with a linear objective, linear constraints, and fixed recourse. We make very few additional assumptions. In particular, we do not assume constraint-wise separability or parameter nonnegativity that are commonly imposed in the literature for the study of adjustability. This allows us to study important but previously under-investigated problems, such as formulations with equality constraints and problems with both upper and lower bound constraints. Based on the discovery of an interesting connection between the reformulations of the static and fully adjustable problems, our analysis gives a necessary and sufficient condition—in the form of a theorem-of-the-alternatives—for adjustability to be zero when the uncertainty set is polyhedral. Based on this sharp characterization, we provide two efficient mixed-integer optimization formulations to verify zero adjustability. Then, we develop a constructive approach to quantify adjustability when the uncertainty set is general, which results in an efficient and tight poly-time algorithm to bound adjustability. We demonstrate the efficiency and tightness via both theoretical and numerical analyses.","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"32 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139583987","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Constrained optimization of rank-one functions with indicator variables 带指标变量的秩一函数的约束优化

IF 2.7 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2024-01-20 DOI: 10.1007/s10107-023-02047-y

Soroosh Shafiee, Fatma Kılınç-Karzan

Optimization problems involving minimization of a rank-one convex function over constraints modeling restrictions on the support of the decision variables emerge in various machine learning applications. These problems are often modeled with indicator variables for identifying the support of the continuous variables. In this paper we investigate compact extended formulations for such problems through perspective reformulation techniques. In contrast to the majority of previous work that relies on support function arguments and disjunctive programming techniques to provide convex hull results, we propose a constructive approach that exploits a hidden conic structure induced by perspective functions. To this end, we first establish a convex hull result for a general conic mixed-binary set in which each conic constraint involves a linear function of independent continuous variables and a set of binary variables. We then demonstrate that extended representations of sets associated with epigraphs of rank-one convex functions over constraints modeling indicator relations naturally admit such a conic representation. This enables us to systematically give perspective formulations for the convex hull descriptions of these sets with nonlinear separable or non-separable objective functions, sign constraints on continuous variables, and combinatorial constraints on indicator variables. We illustrate the efficacy of our results on sparse nonnegative logistic regression problems.

在各种机器学习应用中都会出现优化问题，其中涉及在对决策变量的支持进行建模限制的约束条件下最小化秩一凸函数。这些问题通常用指标变量来建模，以确定连续变量的支持度。在本文中，我们通过透视重构技术研究了此类问题的紧凑扩展公式。与之前大多数依赖支持度函数参数和断裂编程技术来提供凸壳结果的工作不同，我们提出了一种利用透视函数诱导的隐藏圆锥结构的构造性方法。为此，我们首先建立了一般圆锥混合二元集合的凸壳结果，其中每个圆锥约束都涉及独立连续变量和二元变量集合的线性函数。然后，我们证明了与秩一凸函数表图相关的集合的扩展表示，而这些表图又是以指标关系为模型的约束条件，因此自然会有这样的圆锥表示。这使我们能够系统地给出这些集合的凸壳描述的透视公式，这些集合具有非线性可分或不可分目标函数、连续变量的符号约束以及指示变量的组合约束。我们在稀疏非负逻辑回归问题上说明了我们结果的有效性。

{"title":"Constrained optimization of rank-one functions with indicator variables","authors":"Soroosh Shafiee, Fatma Kılınç-Karzan","doi":"10.1007/s10107-023-02047-y","DOIUrl":"https://doi.org/10.1007/s10107-023-02047-y","url":null,"abstract":"Optimization problems involving minimization of a rank-one convex function over constraints modeling restrictions on the support of the decision variables emerge in various machine learning applications. These problems are often modeled with indicator variables for identifying the support of the continuous variables. In this paper we investigate compact extended formulations for such problems through perspective reformulation techniques. In contrast to the majority of previous work that relies on support function arguments and disjunctive programming techniques to provide convex hull results, we propose a constructive approach that exploits a hidden conic structure induced by perspective functions. To this end, we first establish a convex hull result for a general conic mixed-binary set in which each conic constraint involves a linear function of independent continuous variables and a set of binary variables. We then demonstrate that extended representations of sets associated with epigraphs of rank-one convex functions over constraints modeling indicator relations naturally admit such a conic representation. This enables us to systematically give perspective formulations for the convex hull descriptions of these sets with nonlinear separable or non-separable objective functions, sign constraints on continuous variables, and combinatorial constraints on indicator variables. We illustrate the efficacy of our results on sparse nonnegative logistic regression problems.","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"1 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139507788","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Characterization of matrices with bounded Graver bases and depth parameters and applications to integer programming 具有有界格拉弗基和深度参数的矩阵特征及在整数编程中的应用

IF 2.7 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2024-01-20 DOI: 10.1007/s10107-023-02048-x

Marcin Briański, Martin Koutecký, Daniel Král’, Kristýna Pekárková, Felix Schröder

An intensive line of research on fixed parameter tractability of integer programming is focused on exploiting the relation between the sparsity of a constraint matrix A and the norm of the elements of its Graver basis. In particular, integer programming is fixed parameter tractable when parameterized by the primal tree-depth and the entry complexity of A, and when parameterized by the dual tree-depth and the entry complexity of A; both these parameterization imply that A is sparse, in particular, the number of its non-zero entries is linear in the number of columns or rows, respectively. We study preconditioners transforming a given matrix to a row-equivalent sparse matrix if it exists and provide structural results characterizing the existence of a sparse row-equivalent matrix in terms of the structural properties of the associated column matroid. In particular, our results imply that the (ell _1)-norm of the Graver basis is bounded by a function of the maximum (ell _1)-norm of a circuit of A. We use our results to design a parameterized algorithm that constructs a matrix row-equivalent to an input matrix A that has small primal/dual tree-depth and entry complexity if such a row-equivalent matrix exists. Our results yield parameterized algorithms for integer programming when parameterized by the (ell _1)-norm of the Graver basis of the constraint matrix, when parameterized by the (ell _1)-norm of the circuits of the constraint matrix, when parameterized by the smallest primal tree-depth and entry complexity of a matrix row-equivalent to the constraint matrix, and when parameterized by the smallest dual tree-depth and entry complexity of a matrix row-equivalent to the constraint matrix.

关于整数编程固定参数可控性的深入研究，主要集中在利用约束矩阵 A 的稀疏性与其格拉弗基元素的规范之间的关系。特别是，当以 A 的原始树深度和输入复杂度为参数时，以及以 A 的对偶树深度和输入复杂度为参数时，整数编程都是固定参数可控的；这两种参数化都意味着 A 是稀疏的，特别是，其非零条目数分别与列数或行数呈线性关系。如果存在将给定矩阵转换为行等效稀疏矩阵的预处理器，我们将对其进行研究，并根据相关列 matroid 的结构特性提供表征稀疏行等效矩阵存在性的结构性结果。特别是，我们的结果意味着格拉弗基的(ell _1)-norm是由A的一个回路的最大(ell _1)-norm的函数限定的。我们利用我们的结果设计了一种参数化算法，如果存在这样一个行等价矩阵，该算法可以构造一个与输入矩阵A行等价的矩阵，该矩阵具有较小的原始/双树深度和入口复杂度。当以约束矩阵的格拉弗基的（ell _1）-正态为参数时，当以约束矩阵的回路的（ell _1）-正态为参数时，当以与约束矩阵行向等价的矩阵的最小原始树深度和入口复杂度为参数时，以及当以与约束矩阵行向等价的矩阵的最小对偶树深度和入口复杂度为参数时，我们的结果产生了整数编程的参数化算法。

{"title":"Characterization of matrices with bounded Graver bases and depth parameters and applications to integer programming","authors":"Marcin Briański, Martin Koutecký, Daniel Král’, Kristýna Pekárková, Felix Schröder","doi":"10.1007/s10107-023-02048-x","DOIUrl":"https://doi.org/10.1007/s10107-023-02048-x","url":null,"abstract":"An intensive line of research on fixed parameter tractability of integer programming is focused on exploiting the relation between the sparsity of a constraint matrix A and the norm of the elements of its Graver basis. In particular, integer programming is fixed parameter tractable when parameterized by the primal tree-depth and the entry complexity of A, and when parameterized by the dual tree-depth and the entry complexity of A; both these parameterization imply that A is sparse, in particular, the number of its non-zero entries is linear in the number of columns or rows, respectively. We study preconditioners transforming a given matrix to a row-equivalent sparse matrix if it exists and provide structural results characterizing the existence of a sparse row-equivalent matrix in terms of the structural properties of the associated column matroid. In particular, our results imply that the (ell _1)-norm of the Graver basis is bounded by a function of the maximum (ell _1)-norm of a circuit of A. We use our results to design a parameterized algorithm that constructs a matrix row-equivalent to an input matrix A that has small primal/dual tree-depth and entry complexity if such a row-equivalent matrix exists. Our results yield parameterized algorithms for integer programming when parameterized by the (ell _1)-norm of the Graver basis of the constraint matrix, when parameterized by the (ell _1)-norm of the circuits of the constraint matrix, when parameterized by the smallest primal tree-depth and entry complexity of a matrix row-equivalent to the constraint matrix, and when parameterized by the smallest dual tree-depth and entry complexity of a matrix row-equivalent to the constraint matrix.","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"29 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139507778","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A competitive algorithm for throughput maximization on identical machines 在相同机器上实现吞吐量最大化的竞争性算法

IF 2.7 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2024-01-10 DOI: 10.1007/s10107-023-02045-0

Benjamin Moseley, Kirk Pruhs, Clifford Stein, Rudy Zhou

This paper considers the basic problem of scheduling jobs online with preemption to maximize the number of jobs completed by their deadline on m identical machines. The main result is an O(1) competitive deterministic algorithm for any number of machines (m >1).

本文研究了一个基本问题，即通过抢占式在线作业调度，在 m 台相同机器上最大限度地提高在截止日期前完成作业的数量。主要结果是针对任意机器数量（m >1）的 O（1）竞争确定性算法。

引用次数: 0

Counterexample and an additional revealing poll step for a result of “analysis of direct searches for discontinuous functions” 反例和 "不连续函数直接搜索分析 "结果的额外揭示性投票步骤

IF 2.7 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2024-01-08 DOI: 10.1007/s10107-023-02042-3

Abstract

This note provides a counterexample to a theorem announced in the last part of the paper (Vicente and Custódio Math Program 133:299–325, 2012). The counterexample involves an objective function (f: mathbb {R}rightarrow mathbb {R}) which satisfies all the assumptions required by the theorem but contradicts some of its conclusions. A corollary of this theorem is also affected by this counterexample. The main flaw revealed by the counterexample is the possibility that a directional direct search method (dDSM) generates a sequence of trial points ((x_k)_{k in mathbb {N}}) converging to a point (x_*) where f is discontinuous, lower semicontinuous and whose objective function value (f(x_*)) is strictly less than (lim _{krightarrow infty } f(x_k)) . Moreover the dDSM generates trial points in only one of the continuity sets of f near (x_*) . This note also investigates the proof of the theorem to highlight the inexact statements in the original paper. Finally this work introduces a modification of the dDSM that allows, in usual cases, to recover the properties broken by the counterexample.

摘要本注释提供了论文最后一部分（Vicente and Custódio Math Program 133:299-325, 2012）中公布的一个定理的反例。该反例涉及一个目标函数（f: mathbb {R}rightarrow mathbb {R}/），它满足定理所要求的所有假设，但与定理的某些结论相矛盾。该定理的一个推论也受到了这个反例的影响。这个反例揭示的主要缺陷是定向直接搜索法（dDSM）有可能产生一连串的试验点 ((x_k)_{k in mathbb {N}}) 收敛到 f 不连续的点(x_*)、并且其目标函数值（f(x_*)）严格小于（f(x_k)）。此外，dDSM 只在在(x_*)附近的 f 的连续性集合中的一个集合中产生试验点。本注释还研究了定理的证明，以突出原论文中不精确的陈述。最后，本文介绍了对 dDSM 的修改，在通常情况下，它可以恢复被反例破坏的性质。

{"title":"Counterexample and an additional revealing poll step for a result of “analysis of direct searches for discontinuous functions”","authors":"","doi":"10.1007/s10107-023-02042-3","DOIUrl":"https://doi.org/10.1007/s10107-023-02042-3","url":null,"abstract":"<h3>Abstract</h3> This note provides a counterexample to a theorem announced in the last part of the paper (Vicente and Custódio Math Program 133:299–325, 2012). The counterexample involves an objective function (f: mathbb {R}rightarrow mathbb {R}) which satisfies all the assumptions required by the theorem but contradicts some of its conclusions. A corollary of this theorem is also affected by this counterexample. The main flaw revealed by the counterexample is the possibility that a directional direct search method (dDSM) generates a sequence of trial points ((x_k)_{k in mathbb {N}}) converging to a point (x_*) where f is discontinuous, lower semicontinuous and whose objective function value (f(x_*)) is strictly less than (lim _{krightarrow infty } f(x_k)) . Moreover the dDSM generates trial points in only one of the continuity sets of f near (x_*) . This note also investigates the proof of the theorem to highlight the inexact statements in the original paper. Finally this work introduces a modification of the dDSM that allows, in usual cases, to recover the properties broken by the counterexample. ","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"1 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139410419","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

No dimension-free deterministic algorithm computes approximate stationarities of Lipschitzians 没有一种无维度确定性算法能计算 Lipschitzians 的近似静止性

IF 2.7 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2024-01-06 DOI: 10.1007/s10107-023-02031-6

Lai Tian, Anthony Man-Cho So

We consider the oracle complexity of computing an approximate stationary point of a Lipschitz function. When the function is smooth, it is well known that the simple deterministic gradient method has finite dimension-free oracle complexity. However, when the function can be nonsmooth, it is only recently that a randomized algorithm with finite dimension-free oracle complexity has been developed. In this paper, we show that no deterministic algorithm can do the same. Moreover, even without the dimension-free requirement, we show that any finite-time deterministic method cannot be general zero-respecting. In particular, this implies that a natural derandomization of the aforementioned randomized algorithm cannot have finite-time complexity. Our results reveal a fundamental hurdle in modern large-scale nonconvex nonsmooth optimization.

我们考虑的是计算一个 Lipschitz 函数近似静止点的算法复杂度。众所周知，当函数为光滑函数时，简单的确定性梯度法具有有限的无维算法复杂度。然而，当函数可能是非光滑的时候，直到最近才开发出一种具有有限无维度oracle复杂度的随机算法。在本文中，我们证明没有一种确定性算法能做到这一点。此外，即使没有无维度要求，我们也证明了任何有限时间确定性方法都不可能是一般零尊重的。特别是，这意味着上述随机算法的自然去随机化不可能具有有限时间复杂性。我们的结果揭示了现代大规模非凸非光滑优化中的一个基本障碍。

引用次数: 0

Complementary composite minimization, small gradients in general norms, and applications 互补复合最小化、一般规范中的小梯度及其应用

IF 2.7 2区数学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Mathematical Programming

Pub Date : 2024-01-05 DOI: 10.1007/s10107-023-02040-5

Jelena Diakonikolas, Cristóbal Guzmán

Composite minimization is a powerful framework in large-scale convex optimization, based on decoupling of the objective function into terms with structurally different properties and allowing for more flexible algorithmic design. We introduce a new algorithmic framework for complementary composite minimization, where the objective function decouples into a (weakly) smooth and a uniformly convex term. This particular form of decoupling is pervasive in statistics and machine learning, due to its link to regularization. The main contributions of our work are summarized as follows. First, we introduce the problem of complementary composite minimization in general normed spaces; second, we provide a unified accelerated algorithmic framework to address broad classes of complementary composite minimization problems; and third, we prove that the algorithms resulting from our framework are near-optimal in most of the standard optimization settings. Additionally, we show that our algorithmic framework can be used to address the problem of making the gradients small in general normed spaces. As a concrete example, we obtain a nearly-optimal method for the standard (ell _1) setup (small gradients in the (ell _infty ) norm), essentially matching the bound of Nesterov (Optima Math Optim Soc Newsl 88:10–11, 2012) that was previously known only for the Euclidean setup. Finally, we show that our composite methods are broadly applicable to a number of regression and other classes of optimization problems, where regularization plays a key role. Our methods lead to complexity bounds that are either new or match the best existing ones.

复合最小化是大规模凸优化的一个强大框架，其基础是将目标函数解耦为具有不同结构性质的项，从而实现更灵活的算法设计。我们为互补复合最小化引入了一个新的算法框架，其中目标函数解耦为一个（弱）平滑项和一个均匀凸项。由于与正则化的联系，这种特殊形式的解耦在统计学和机器学习中非常普遍。我们工作的主要贡献总结如下。首先，我们介绍了一般规范空间中的互补复合最小化问题；其次，我们提供了一个统一的加速算法框架，以解决各类互补复合最小化问题；第三，我们证明了我们的框架所产生的算法在大多数标准优化设置中接近最优。此外，我们还证明了我们的算法框架可用于解决在一般规范空间中使梯度变小的问题。举个具体的例子，我们得到了标准 (ell _1)设置（在 (ell _infty )规范中的小梯度）的近乎最优方法，基本上与内斯特洛夫（Optima Math Optim Soc Newsl 88:10-11，2012）的约束相匹配，而这一约束以前只在欧几里得设置中已知。最后，我们展示了我们的复合方法广泛适用于许多回归和其他优化问题，其中正则化起着关键作用。我们的方法所得出的复杂度边界要么是全新的，要么与现有的最佳边界相匹配。

{"title":"Complementary composite minimization, small gradients in general norms, and applications","authors":"Jelena Diakonikolas, Cristóbal Guzmán","doi":"10.1007/s10107-023-02040-5","DOIUrl":"https://doi.org/10.1007/s10107-023-02040-5","url":null,"abstract":"Composite minimization is a powerful framework in large-scale convex optimization, based on decoupling of the objective function into terms with structurally different properties and allowing for more flexible algorithmic design. We introduce a new algorithmic framework for complementary composite minimization, where the objective function decouples into a (weakly) smooth and a uniformly convex term. This particular form of decoupling is pervasive in statistics and machine learning, due to its link to regularization. The main contributions of our work are summarized as follows. First, we introduce the problem of complementary composite minimization in general normed spaces; second, we provide a unified accelerated algorithmic framework to address broad classes of complementary composite minimization problems; and third, we prove that the algorithms resulting from our framework are near-optimal in most of the standard optimization settings. Additionally, we show that our algorithmic framework can be used to address the problem of making the gradients small in general normed spaces. As a concrete example, we obtain a nearly-optimal method for the standard (ell _1) setup (small gradients in the (ell _infty ) norm), essentially matching the bound of Nesterov (Optima Math Optim Soc Newsl 88:10–11, 2012) that was previously known only for the Euclidean setup. Finally, we show that our composite methods are broadly applicable to a number of regression and other classes of optimization problems, where regularization plays a key role. Our methods lead to complexity bounds that are either new or match the best existing ones.","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"20 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139373803","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0