首页 > 最新文献

Mathematical Methods of Operations Research最新文献

英文 中文
Low-complexity algorithm for restless bandits with imperfect observations 观测不完善的不安定强盗的低复杂度算法
IF 1.2 4区 数学 Q3 MATHEMATICS, APPLIED Pub Date : 2024-09-05 DOI: 10.1007/s00186-024-00868-x
Keqin Liu, Richard Weber, Chengzhong Zhang

We consider a class of restless bandit problems that finds a broad application area in reinforcement learning and stochastic optimization. We consider N independent discrete-time Markov processes, each of which had two possible states: 1 and 0 (‘good’ and ‘bad’). Only if a process is both in state 1 and observed to be so does reward accrue. The aim is to maximize the expected discounted sum of returns over the infinite horizon subject to a constraint that only M ((<N)) processes may be observed at each step. Observation is error-prone: there are known probabilities that state 1 (0) will be observed as 0 (1). From this one knows, at any time t, a probability that process i is in state 1. The resulting system may be modeled as a restless multi-armed bandit problem with an information state space of uncountable cardinality. Restless bandit problems with even finite state spaces are PSPACE-HARD in general. We propose a novel approach for simplifying the dynamic programming equations of this class of restless bandits and develop a low-complexity algorithm that achieves a strong performance and is readily extensible to the general restless bandit model with observation errors. Under certain conditions, we establish the existence (indexability) of Whittle index and its equivalence to our algorithm. When those conditions do not hold, we show by numerical experiments the near-optimal performance of our algorithm in the general parametric space. Furthermore, we theoretically prove the optimality of our algorithm for homogeneous systems.

我们考虑的是一类不安定的强盗问题,它在强化学习和随机优化中有着广泛的应用。我们考虑 N 个独立的离散时间马尔可夫过程,每个过程都有两种可能的状态:1 和 0("好 "和 "坏"):1和0("好 "和 "坏")。只有当进程处于状态 1 并被观察到时,才会产生奖励。我们的目标是在每一步只能观察到 M ((<N))个过程的约束下,最大化无限期内的预期贴现收益总和。观察是容易出错的:状态 1(0)被观察为 0(1)的概率是已知的。由此可以知道,在任何时间 t,进程 i 处于状态 1 的概率。由此产生的系统可以建模为一个不安定的多臂强盗问题,其信息状态空间具有不可计数的卡方性。一般来说,即使是有限状态空间的无休止强盗问题也是 PSPACE-HARD(空间困难)的。我们提出了一种简化该类无休止强盗动态程序方程的新方法,并开发了一种低复杂度算法,该算法性能优异,可随时扩展到具有观测误差的一般无休止强盗模型。在某些条件下,我们建立了惠特尔指数的存在性(可索引性)及其与我们算法的等价性。当这些条件不成立时,我们通过数值实验证明了我们的算法在一般参数空间中接近最优的性能。此外,我们还从理论上证明了我们的算法对于同质系统的最优性。
{"title":"Low-complexity algorithm for restless bandits with imperfect observations","authors":"Keqin Liu, Richard Weber, Chengzhong Zhang","doi":"10.1007/s00186-024-00868-x","DOIUrl":"https://doi.org/10.1007/s00186-024-00868-x","url":null,"abstract":"<p>We consider a class of restless bandit problems that finds a broad application area in reinforcement learning and stochastic optimization. We consider <i>N</i> independent discrete-time Markov processes, each of which had two possible states: 1 and 0 (‘good’ and ‘bad’). Only if a process is both in state 1 and observed to be so does reward accrue. The aim is to maximize the expected discounted sum of returns over the infinite horizon subject to a constraint that only <i>M</i> <span>((&lt;N))</span> processes may be observed at each step. Observation is error-prone: there are known probabilities that state 1 (0) will be observed as 0 (1). From this one knows, at any time <i>t</i>, a probability that process <i>i</i> is in state 1. The resulting system may be modeled as a restless multi-armed bandit problem with an information state space of uncountable cardinality. Restless bandit problems with even finite state spaces are PSPACE-HARD in general. We propose a novel approach for simplifying the dynamic programming equations of this class of restless bandits and develop a low-complexity algorithm that achieves a strong performance and is readily extensible to the general restless bandit model with observation errors. Under certain conditions, we establish the existence (indexability) of Whittle index and its equivalence to our algorithm. When those conditions do not hold, we show by numerical experiments the near-optimal performance of our algorithm in the general parametric space. Furthermore, we theoretically prove the optimality of our algorithm for homogeneous systems.</p>","PeriodicalId":49862,"journal":{"name":"Mathematical Methods of Operations Research","volume":"15 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142201491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-stage distributionally robust convex stochastic optimization with Bayesian-type ambiguity sets 具有贝叶斯型模糊集的多阶段分布稳健凸随机优化
IF 1.2 4区 数学 Q3 MATHEMATICS, APPLIED Pub Date : 2024-08-14 DOI: 10.1007/s00186-024-00872-1
Wentao Ma, Zhiping Chen

The existent methods for constructing ambiguity sets in distributionally robust optimization often suffer from over-conservativeness and inefficient utilization of available data. To address these limitations and to practically solve multi-stage distributionally robust optimization (MDRO), we propose a data-driven Bayesian-type approach that constructs the ambiguity set of possible distributions from a Bayesian perspective. We demonstrate that our Bayesian-type MDRO problem can be reformulated as a risk-averse multi-stage stochastic programming problem and subsequently investigate its theoretical properties such as consistency, finite sample guarantee, and statistical robustness. Moreover, the reformulation enables us to employ cutting planes algorithms in dynamic settings to solve the Bayesian-type MDRO problem. To illustrate the practicality and advantages of the proposed model and algorithm, we apply it to a distributionally robust inventory control problem and a distributionally robust hydrothermal scheduling problem, and compare it with usual formulations and solution methods to highlight the superior performance of our approach.

在分布稳健优化中,现有的模糊集构建方法往往存在过度保守和可用数据利用效率低的问题。为了解决这些局限性,并切实解决多阶段分布稳健优化(MDRO)问题,我们提出了一种数据驱动的贝叶斯式方法,从贝叶斯的角度构建可能分布的模糊集。我们证明了贝叶斯型 MDRO 问题可以重新表述为风险规避型多阶段随机编程问题,并随后研究了其理论特性,如一致性、有限样本保证和统计稳健性。此外,重新表述使我们能够在动态环境中采用切割平面算法来解决贝叶斯型 MDRO 问题。为了说明所提模型和算法的实用性和优势,我们将其应用于分布鲁棒库存控制问题和分布鲁棒水热调度问题,并将其与通常的公式和求解方法进行比较,以突出我们方法的优越性能。
{"title":"Multi-stage distributionally robust convex stochastic optimization with Bayesian-type ambiguity sets","authors":"Wentao Ma, Zhiping Chen","doi":"10.1007/s00186-024-00872-1","DOIUrl":"https://doi.org/10.1007/s00186-024-00872-1","url":null,"abstract":"<p>The existent methods for constructing ambiguity sets in distributionally robust optimization often suffer from over-conservativeness and inefficient utilization of available data. To address these limitations and to practically solve multi-stage distributionally robust optimization (MDRO), we propose a data-driven Bayesian-type approach that constructs the ambiguity set of possible distributions from a Bayesian perspective. We demonstrate that our Bayesian-type MDRO problem can be reformulated as a risk-averse multi-stage stochastic programming problem and subsequently investigate its theoretical properties such as consistency, finite sample guarantee, and statistical robustness. Moreover, the reformulation enables us to employ cutting planes algorithms in dynamic settings to solve the Bayesian-type MDRO problem. To illustrate the practicality and advantages of the proposed model and algorithm, we apply it to a distributionally robust inventory control problem and a distributionally robust hydrothermal scheduling problem, and compare it with usual formulations and solution methods to highlight the superior performance of our approach.</p>","PeriodicalId":49862,"journal":{"name":"Mathematical Methods of Operations Research","volume":"15 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142201492","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A new value for communication situations 交流情况的新价值
IF 1.2 4区 数学 Q3 MATHEMATICS, APPLIED Pub Date : 2024-08-12 DOI: 10.1007/s00186-024-00873-0
Daniel Li Li, Erfang Shan

A communication situation (NvH) consists of a cooperative game (Nv) and a communication hypergraph (NH), for which the Myerson value and the position value are well-known allocation rules. The value defined in this paper treats links in H as imaginal players, for which we define a bipartite graph between N and H according to the structure given by H, and propose an allocation rule called the bipartite value. This value assigns payoff to each player with two parts: as a player and as a member in links. A characterization of the bipartite value is given.

通信情境(N,v,H)由合作博弈(N,v)和通信超图(N,H)组成,其中迈尔森值和位置值是众所周知的分配规则。本文定义的值将 H 中的链接视为意象玩家,为此我们根据 H 给出的结构定义了 N 和 H 之间的双向图,并提出了一种名为双向值的分配规则。该值将每个玩家的报酬分为两部分:作为玩家的报酬和作为链接中成员的报酬。本文给出了双向值的特征。
{"title":"A new value for communication situations","authors":"Daniel Li Li, Erfang Shan","doi":"10.1007/s00186-024-00873-0","DOIUrl":"https://doi.org/10.1007/s00186-024-00873-0","url":null,"abstract":"<p>A communication situation (<i>N</i>, <i>v</i>, <i>H</i>) consists of a cooperative game (<i>N</i>, <i>v</i>) and a communication hypergraph (<i>N</i>, <i>H</i>), for which the Myerson value and the position value are well-known allocation rules. The value defined in this paper treats links in <i>H</i> as imaginal players, for which we define a bipartite graph between <i>N</i> and <i>H</i> according to the structure given by <i>H</i>, and propose an allocation rule called the bipartite value. This value assigns payoff to each player with two parts: as a player and as a member in links. A characterization of the bipartite value is given.</p>","PeriodicalId":49862,"journal":{"name":"Mathematical Methods of Operations Research","volume":"58 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141945415","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the relationship between the value function and the efficient frontier of a mixed integer linear optimization problem 论混合整数线性优化问题的价值函数与有效前沿之间的关系
IF 1.2 4区 数学 Q3 MATHEMATICS, APPLIED Pub Date : 2024-08-02 DOI: 10.1007/s00186-024-00871-2
Samira Fallah, Ted K. Ralphs, Natashia L. Boland

In this study, we investigate the connection between the efficient frontier (EF) of a general multiobjective mixed integer linear optimization problem (MILP) and the so-called restricted value function (RVF) of a closely related single-objective MILP. In the first part of the paper, we detail the mathematical structure of the RVF, including characterizing the set of points at which it is differentiable, the gradients at such points, and the subdifferential at all nondifferentiable points. We then show that the EF of the multiobjective MILP is comprised of points on the boundary of the epigraph of the RVF and that any description of the EF suffices to describe the RVF and vice versa. Because of the close relationship of the RVF to the EF, we observe that methods for constructing the so-called value function (VF) of an MILP and methods for constructing the EF of a multiobjective optimization problem are effectively interchangeable. Exploiting this observation, we propose a generalized cutting-plane algorithm for constructing the EF of a multiobjective MILP that arises from an existing algorithm for constructing the classical MILP VF. The algorithm identifies the set of all integer parts of solutions on the EF. We prove that the algorithm converges finitely under a standard boundedness assumption and comes with a performance guarantee if terminated early.

在本研究中,我们探讨了一般多目标混合整数线性优化问题(MILP)的有效前沿(EF)与密切相关的单目标 MILP 的所谓受限值函数(RVF)之间的联系。在本文的第一部分,我们将详细介绍 RVF 的数学结构,包括描述其可微分的点集、这些点上的梯度以及所有不可微分点上的次微分。然后,我们证明多目标 MILP 的 EF 是由 RVF 边界上的点组成的,对 EF 的任何描述都足以描述 RVF,反之亦然。由于 RVF 与 EF 关系密切,我们发现构建 MILP 的所谓值函数 (VF) 的方法和构建多目标优化问题的 EF 的方法实际上是可以互换的。利用这一观察结果,我们提出了一种用于构建多目标 MILP EF 的广义切割面算法,该算法源于构建经典 MILP VF 的现有算法。该算法确定了 EF 上所有整数部分解的集合。我们证明,该算法在标准有界性假设下有限收敛,而且如果提前终止,还能保证性能。
{"title":"On the relationship between the value function and the efficient frontier of a mixed integer linear optimization problem","authors":"Samira Fallah, Ted K. Ralphs, Natashia L. Boland","doi":"10.1007/s00186-024-00871-2","DOIUrl":"https://doi.org/10.1007/s00186-024-00871-2","url":null,"abstract":"<p>In this study, we investigate the connection between the efficient frontier (EF) of a general multiobjective mixed integer linear optimization problem (MILP) and the so-called <i>restricted value function</i> (RVF) of a closely related single-objective MILP. In the first part of the paper, we detail the mathematical structure of the RVF, including characterizing the set of points at which it is differentiable, the gradients at such points, and the subdifferential at all nondifferentiable points. We then show that the EF of the multiobjective MILP is comprised of points on the boundary of the epigraph of the RVF and that any description of the EF suffices to describe the RVF and vice versa. Because of the close relationship of the RVF to the EF, we observe that methods for constructing the so-called value function (VF) of an MILP and methods for constructing the EF of a multiobjective optimization problem are effectively interchangeable. Exploiting this observation, we propose a generalized cutting-plane algorithm for constructing the EF of a multiobjective MILP that arises from an existing algorithm for constructing the classical MILP VF. The algorithm identifies the set of all integer parts of solutions on the EF. We prove that the algorithm converges finitely under a standard boundedness assumption and comes with a performance guarantee if terminated early.</p>","PeriodicalId":49862,"journal":{"name":"Mathematical Methods of Operations Research","volume":"56 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141880569","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An approximation algorithm for multiobjective mixed-integer convex optimization 多目标混合整数凸优化的近似算法
IF 1.2 4区 数学 Q3 MATHEMATICS, APPLIED Pub Date : 2024-07-29 DOI: 10.1007/s00186-024-00870-3
Ina Lammel, Karl-Heinz Küfer, Philipp Süss

In this article we introduce an algorithm that approximates the nondominated sets of multiobjective mixed-integer convex optimization problems. The algorithm constructs an inner and outer approximation of the front exploiting the convexity of the patches for problems with an arbitrary number of criteria. In the algorithm, the problem is decomposed into patches, which are multiobjective convex problems, by fixing the integer assignments. The patch problems are solved using (simplicial) Sandwiching. We identify parts of patches that are dominated by other patches and ensure that these patch parts are not refined further. We prove that the algorithm converges and show a bound on the reduction of the approximation error in the course of the algorithm. We illustrate the behaviour of our algorithm using some numerical examples and compare its performance to an algorithm from literature.

本文介绍了一种近似多目标混合整数凸优化问题非支配集的算法。对于具有任意数量标准的问题,该算法利用补丁的凸性构建前沿的内近似和外近似。在该算法中,通过固定整数赋值,将问题分解为补丁,即多目标凸问题。补丁问题使用(简单)三明治法求解。我们识别出被其他补丁支配的补丁部分,并确保这些补丁部分不再进一步细化。我们证明了算法的收敛性,并展示了算法过程中近似误差减少的界限。我们用一些数值示例说明了我们算法的行为,并将其性能与文献中的算法进行了比较。
{"title":"An approximation algorithm for multiobjective mixed-integer convex optimization","authors":"Ina Lammel, Karl-Heinz Küfer, Philipp Süss","doi":"10.1007/s00186-024-00870-3","DOIUrl":"https://doi.org/10.1007/s00186-024-00870-3","url":null,"abstract":"<p>In this article we introduce an algorithm that approximates the nondominated sets of multiobjective mixed-integer convex optimization problems. The algorithm constructs an inner and outer approximation of the front exploiting the convexity of the patches for problems with an arbitrary number of criteria. In the algorithm, the problem is decomposed into patches, which are multiobjective convex problems, by fixing the integer assignments. The patch problems are solved using (simplicial) Sandwiching. We identify parts of patches that are dominated by other patches and ensure that these patch parts are not refined further. We prove that the algorithm converges and show a bound on the reduction of the approximation error in the course of the algorithm. We illustrate the behaviour of our algorithm using some numerical examples and compare its performance to an algorithm from literature.</p>","PeriodicalId":49862,"journal":{"name":"Mathematical Methods of Operations Research","volume":"10 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141864727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tropical convexity in location problems 位置问题中的热带凸性
IF 1.2 4区 数学 Q3 MATHEMATICS, APPLIED Pub Date : 2024-07-03 DOI: 10.1007/s00186-024-00869-w
Andrei Comăneci

We investigate location problems where the optimal solution is found within the tropical convex hull of the given input points. Our initial focus is on geodesically star-convex sets, using the asymmetric tropical distance. We introduce the concept of tropically quasiconvex functions, which have sub-level sets with this shape, and are closely related to monotonic functions. Our findings demonstrate that location problems using tropically quasiconvex functions as distance measures will result in an optimal solution within the tropical convex hull of the input points. We also extend this result to cases where the input points are replaced with tropically convex sets. Finally, we explore the applications of our research in phylogenetics, highlighting the properties of consensus methods that arise from our class of location problems.

我们研究了在给定输入点的热带凸壳内找到最优解的定位问题。我们最初的研究重点是使用非对称热带距离的大地星凸集。我们引入了热带准凸函数的概念,它具有这种形状的子级集,与单调函数密切相关。我们的研究结果表明,使用热带准凸函数作为距离度量的定位问题,会在输入点的热带凸壳范围内得到最优解。我们还将这一结果扩展到输入点被替换为热带凸集的情况。最后,我们探讨了我们的研究在系统发育学中的应用,强调了由我们这一类定位问题产生的共识方法的特性。
{"title":"Tropical convexity in location problems","authors":"Andrei Comăneci","doi":"10.1007/s00186-024-00869-w","DOIUrl":"https://doi.org/10.1007/s00186-024-00869-w","url":null,"abstract":"<p>We investigate location problems where the optimal solution is found within the tropical convex hull of the given input points. Our initial focus is on geodesically star-convex sets, using the asymmetric tropical distance. We introduce the concept of tropically quasiconvex functions, which have sub-level sets with this shape, and are closely related to monotonic functions. Our findings demonstrate that location problems using tropically quasiconvex functions as distance measures will result in an optimal solution within the tropical convex hull of the input points. We also extend this result to cases where the input points are replaced with tropically convex sets. Finally, we explore the applications of our research in phylogenetics, highlighting the properties of consensus methods that arise from our class of location problems.</p>","PeriodicalId":49862,"journal":{"name":"Mathematical Methods of Operations Research","volume":"51 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141513094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Discrete-time stopping games with risk-sensitive discounted cost criterion 具有风险敏感贴现成本标准的离散时间停止博弈
IF 1.2 4区 数学 Q3 MATHEMATICS, APPLIED Pub Date : 2024-07-02 DOI: 10.1007/s00186-024-00864-1
Wenzhao Zhang, Congying Liu

In this paper, we focus on the discrete-time stopping games under the risk-sensitive discounted cost criterion. The state space and the action spaces of all the players are assumed to be Borel spaces. The cost functions are allowed to be unbounded from above and from below. At each decision epoch, each player chooses an action to influence the transition laws, and player 1 incurs a running cost. If players 1 or 2 decides to stop the game, player 1 incurs a corresponding terminated cost. Under suitable hypothesis, we show that the game model has a value which is a unique solution of risk-sensitive stopping optimality equation by an approximation technique. Furthermore, we derive the existence of equilibria.

本文主要研究风险敏感贴现成本准则下的离散时间停止博弈。假设所有博弈者的状态空间和行动空间都是 Borel 空间。成本函数允许自上而下无约束。在每个决策时段,每个参与者都会选择一个行动来影响过渡规律,参与者 1 会产生运行成本。如果玩家 1 或 2 决定停止博弈,则玩家 1 会产生相应的终止成本。在合适的假设条件下,我们通过近似技术证明了博弈模型有一个值是风险敏感停止最优方程的唯一解。此外,我们还推导出了均衡的存在性。
{"title":"Discrete-time stopping games with risk-sensitive discounted cost criterion","authors":"Wenzhao Zhang, Congying Liu","doi":"10.1007/s00186-024-00864-1","DOIUrl":"https://doi.org/10.1007/s00186-024-00864-1","url":null,"abstract":"<p>In this paper, we focus on the discrete-time stopping games under the risk-sensitive discounted cost criterion. The state space and the action spaces of all the players are assumed to be Borel spaces. The cost functions are allowed to be unbounded from above and from below. At each decision epoch, each player chooses an action to influence the transition laws, and player 1 incurs a running cost. If players 1 or 2 decides to stop the game, player 1 incurs a corresponding terminated cost. Under suitable hypothesis, we show that the game model has a value which is a unique solution of risk-sensitive stopping optimality equation by an approximation technique. Furthermore, we derive the existence of equilibria.</p>","PeriodicalId":49862,"journal":{"name":"Mathematical Methods of Operations Research","volume":"11 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141513129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Convex optimization via inertial algorithms with vanishing Tikhonov regularization: fast convergence to the minimum norm solution 通过惯性算法的凸优化与消失的 Tikhonov 正则化:快速收敛至最小规范解
IF 1.2 4区 数学 Q3 MATHEMATICS, APPLIED Pub Date : 2024-06-27 DOI: 10.1007/s00186-024-00867-y
Hedy Attouch, Szilárd Csaba László

In a Hilbertian framework, for the minimization of a general convex differentiable function f, we introduce new inertial dynamics and algorithms that generate trajectories and iterates that converge fastly towards the minimizer of f with minimum norm. Our study is based on the non-autonomous version of the Polyak heavy ball method, which, at time t, is associated with the strongly convex function obtained by adding to f a Tikhonov regularization term with vanishing coefficient (varepsilon (t)). In this dynamic, the damping coefficient is proportional to the square root of the Tikhonov regularization parameter (varepsilon (t)). By adjusting the speed of convergence of (varepsilon (t)) towards zero, we will obtain both rapid convergence towards the infimal value of f, and the strong convergence of the trajectories towards the element of minimum norm of the set of minimizers of f. In particular, we obtain an improved version of the dynamic of Su-Boyd-Candès for the accelerated gradient method of Nesterov. This study naturally leads to corresponding first-order algorithms obtained by temporal discretization. In the case of a proper lower semicontinuous and convex function f, we study the proximal algorithms in detail, and show that they benefit from similar properties.

在希尔伯特框架下,针对一般凸可微函数 f 的最小化问题,我们引入了新的惯性动力学和算法,其生成的轨迹和迭代可快速收敛至 f 的最小化且具有最小规范。我们的研究基于非自主版本的波利克重球方法,在时间 t 上,该方法与强凸函数相关联,强凸函数是通过在 f 上添加一个具有消失系数 (varepsilon (t)) 的 Tikhonov 正则化项而得到的。在这种动态中,阻尼系数与 Tikhonov 正则化参数 (varepsilon (t)) 的平方根成正比。通过将 (varepsilon (t)) 的收敛速度调整为零,我们将同时获得向 f 的次极值快速收敛和向 f 最小化集合的最小规范元素强收敛的轨迹。这项研究自然会引出通过时间离散化获得的相应一阶算法。在适当的下半连续凸函数 f 的情况下,我们详细研究了近似算法,并证明它们受益于类似的性质。
{"title":"Convex optimization via inertial algorithms with vanishing Tikhonov regularization: fast convergence to the minimum norm solution","authors":"Hedy Attouch, Szilárd Csaba László","doi":"10.1007/s00186-024-00867-y","DOIUrl":"https://doi.org/10.1007/s00186-024-00867-y","url":null,"abstract":"<p>In a Hilbertian framework, for the minimization of a general convex differentiable function <i>f</i>, we introduce new inertial dynamics and algorithms that generate trajectories and iterates that converge fastly towards the minimizer of <i>f</i> with minimum norm. Our study is based on the non-autonomous version of the Polyak heavy ball method, which, at time <i>t</i>, is associated with the strongly convex function obtained by adding to <i>f</i> a Tikhonov regularization term with vanishing coefficient <span>(varepsilon (t))</span>. In this dynamic, the damping coefficient is proportional to the square root of the Tikhonov regularization parameter <span>(varepsilon (t))</span>. By adjusting the speed of convergence of <span>(varepsilon (t))</span> towards zero, we will obtain both rapid convergence towards the infimal value of <i>f</i>, and the strong convergence of the trajectories towards the element of minimum norm of the set of minimizers of <i>f</i>. In particular, we obtain an improved version of the dynamic of Su-Boyd-Candès for the accelerated gradient method of Nesterov. This study naturally leads to corresponding first-order algorithms obtained by temporal discretization. In the case of a proper lower semicontinuous and convex function <i>f</i>, we study the proximal algorithms in detail, and show that they benefit from similar properties.\u0000</p>","PeriodicalId":49862,"journal":{"name":"Mathematical Methods of Operations Research","volume":"196 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141509496","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Asymptotic upper bounds for an M/M/C/K retrial queue with a guard channel and guard buffer 带保护通道和保护缓冲区的 M/M/C/K 重审队列的渐近上限
IF 1.2 4区 数学 Q3 MATHEMATICS, APPLIED Pub Date : 2024-06-26 DOI: 10.1007/s00186-024-00865-0
Nesrine Zidani, Natalia Djellab

The paper deals with Markovian multiserver retrial queuing system with exponential abandonments, two types of arrivals: Fresh calls and Handover calls and waiting places in the service area. This model can be used for analysing a cellular mobile network, where the service area is divided into cells. In this paper, the number of customers in the system and in the orbit form a level-dependent quasi-birth-and-death process, whose stationary distribution is expressed in terms of a sequence of rate matrices. First, we derive the Taylor series expansion for nonzero elements of the rate matrices. Then, by the expansion results, we obtain an asymptotic upper bound for the stationary distribution of both the number of busy channels and the number of customers in the orbit. Furthermore, we present some numerical results to examine the performance of the system.

本文论述的是马尔可夫多服务器重试排队系统,该系统具有指数放弃、两种到达类型:新呼叫和移交呼叫以及服务区的等待位置。该模型可用于分析蜂窝移动网络,其中服务区被划分为多个小区。在本文中,系统中和轨道上的客户数构成了一个与等级相关的准生死过程,其静态分布用速率矩阵序列表示。首先,我们推导出速率矩阵非零元素的泰勒级数展开。然后,根据扩展结果,我们得到了繁忙信道数和轨道中客户数的静态分布的渐近上限。此外,我们还给出了一些数值结果,以检验系统的性能。
{"title":"Asymptotic upper bounds for an M/M/C/K retrial queue with a guard channel and guard buffer","authors":"Nesrine Zidani, Natalia Djellab","doi":"10.1007/s00186-024-00865-0","DOIUrl":"https://doi.org/10.1007/s00186-024-00865-0","url":null,"abstract":"<p>The paper deals with Markovian multiserver retrial queuing system with exponential abandonments, two types of arrivals: Fresh calls and Handover calls and waiting places in the service area. This model can be used for analysing a cellular mobile network, where the service area is divided into cells. In this paper, the number of customers in the system and in the orbit form a level-dependent quasi-birth-and-death process, whose stationary distribution is expressed in terms of a sequence of rate matrices. First, we derive the Taylor series expansion for nonzero elements of the rate matrices. Then, by the expansion results, we obtain an asymptotic upper bound for the stationary distribution of both the number of busy channels and the number of customers in the orbit. Furthermore, we present some numerical results to examine the performance of the system.</p>","PeriodicalId":49862,"journal":{"name":"Mathematical Methods of Operations Research","volume":"168 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141509498","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Convergence rate of LQG mean field games with common noise 具有共同噪声的 LQG 平均场博弈的收敛率
IF 1.2 4区 数学 Q3 MATHEMATICS, APPLIED Pub Date : 2024-06-25 DOI: 10.1007/s00186-024-00863-2
Jiamin Jian, Qingshuo Song, Jiaxuan Ye

This paper focuses on exploring the convergence properties of a generic player’s trajectory and empirical measures in an N-player Linear-Quadratic-Gaussian Nash game, where Brownian motion serves as the common noise. The study establishes three distinct convergence rates concerning the representative player and empirical measure. To investigate the convergence, the methodology relies on a specific decomposition of the equilibrium path in the N-player game and utilizes the associated mean field games framework.

本文重点探讨了 N 人线性-二次方-高斯纳什博弈中一般博弈者的轨迹和经验度量的收敛特性,其中布朗运动是常见噪声。该研究确定了代表棋手和经验度量的三种不同收敛率。为了研究收敛性,该方法依赖于对 N 人博弈中均衡路径的特定分解,并利用相关的均值场博弈框架。
{"title":"Convergence rate of LQG mean field games with common noise","authors":"Jiamin Jian, Qingshuo Song, Jiaxuan Ye","doi":"10.1007/s00186-024-00863-2","DOIUrl":"https://doi.org/10.1007/s00186-024-00863-2","url":null,"abstract":"<p>This paper focuses on exploring the convergence properties of a generic player’s trajectory and empirical measures in an <i>N</i>-player Linear-Quadratic-Gaussian Nash game, where Brownian motion serves as the common noise. The study establishes three distinct convergence rates concerning the representative player and empirical measure. To investigate the convergence, the methodology relies on a specific decomposition of the equilibrium path in the <i>N</i>-player game and utilizes the associated mean field games framework.</p>","PeriodicalId":49862,"journal":{"name":"Mathematical Methods of Operations Research","volume":"131 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141509497","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Mathematical Methods of Operations Research
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1