Pub Date : 2024-09-05DOI: 10.1007/s00186-024-00868-x
Keqin Liu, Richard Weber, Chengzhong Zhang
We consider a class of restless bandit problems that finds a broad application area in reinforcement learning and stochastic optimization. We consider N independent discrete-time Markov processes, each of which had two possible states: 1 and 0 (‘good’ and ‘bad’). Only if a process is both in state 1 and observed to be so does reward accrue. The aim is to maximize the expected discounted sum of returns over the infinite horizon subject to a constraint that only M((<N)) processes may be observed at each step. Observation is error-prone: there are known probabilities that state 1 (0) will be observed as 0 (1). From this one knows, at any time t, a probability that process i is in state 1. The resulting system may be modeled as a restless multi-armed bandit problem with an information state space of uncountable cardinality. Restless bandit problems with even finite state spaces are PSPACE-HARD in general. We propose a novel approach for simplifying the dynamic programming equations of this class of restless bandits and develop a low-complexity algorithm that achieves a strong performance and is readily extensible to the general restless bandit model with observation errors. Under certain conditions, we establish the existence (indexability) of Whittle index and its equivalence to our algorithm. When those conditions do not hold, we show by numerical experiments the near-optimal performance of our algorithm in the general parametric space. Furthermore, we theoretically prove the optimality of our algorithm for homogeneous systems.
我们考虑的是一类不安定的强盗问题,它在强化学习和随机优化中有着广泛的应用。我们考虑 N 个独立的离散时间马尔可夫过程,每个过程都有两种可能的状态:1 和 0("好 "和 "坏"):1和0("好 "和 "坏")。只有当进程处于状态 1 并被观察到时,才会产生奖励。我们的目标是在每一步只能观察到 M ((<N))个过程的约束下,最大化无限期内的预期贴现收益总和。观察是容易出错的:状态 1(0)被观察为 0(1)的概率是已知的。由此可以知道,在任何时间 t,进程 i 处于状态 1 的概率。由此产生的系统可以建模为一个不安定的多臂强盗问题,其信息状态空间具有不可计数的卡方性。一般来说,即使是有限状态空间的无休止强盗问题也是 PSPACE-HARD(空间困难)的。我们提出了一种简化该类无休止强盗动态程序方程的新方法,并开发了一种低复杂度算法,该算法性能优异,可随时扩展到具有观测误差的一般无休止强盗模型。在某些条件下,我们建立了惠特尔指数的存在性(可索引性)及其与我们算法的等价性。当这些条件不成立时,我们通过数值实验证明了我们的算法在一般参数空间中接近最优的性能。此外,我们还从理论上证明了我们的算法对于同质系统的最优性。
{"title":"Low-complexity algorithm for restless bandits with imperfect observations","authors":"Keqin Liu, Richard Weber, Chengzhong Zhang","doi":"10.1007/s00186-024-00868-x","DOIUrl":"https://doi.org/10.1007/s00186-024-00868-x","url":null,"abstract":"<p>We consider a class of restless bandit problems that finds a broad application area in reinforcement learning and stochastic optimization. We consider <i>N</i> independent discrete-time Markov processes, each of which had two possible states: 1 and 0 (‘good’ and ‘bad’). Only if a process is both in state 1 and observed to be so does reward accrue. The aim is to maximize the expected discounted sum of returns over the infinite horizon subject to a constraint that only <i>M</i> <span>((<N))</span> processes may be observed at each step. Observation is error-prone: there are known probabilities that state 1 (0) will be observed as 0 (1). From this one knows, at any time <i>t</i>, a probability that process <i>i</i> is in state 1. The resulting system may be modeled as a restless multi-armed bandit problem with an information state space of uncountable cardinality. Restless bandit problems with even finite state spaces are PSPACE-HARD in general. We propose a novel approach for simplifying the dynamic programming equations of this class of restless bandits and develop a low-complexity algorithm that achieves a strong performance and is readily extensible to the general restless bandit model with observation errors. Under certain conditions, we establish the existence (indexability) of Whittle index and its equivalence to our algorithm. When those conditions do not hold, we show by numerical experiments the near-optimal performance of our algorithm in the general parametric space. Furthermore, we theoretically prove the optimality of our algorithm for homogeneous systems.</p>","PeriodicalId":49862,"journal":{"name":"Mathematical Methods of Operations Research","volume":"15 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142201491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-14DOI: 10.1007/s00186-024-00872-1
Wentao Ma, Zhiping Chen
The existent methods for constructing ambiguity sets in distributionally robust optimization often suffer from over-conservativeness and inefficient utilization of available data. To address these limitations and to practically solve multi-stage distributionally robust optimization (MDRO), we propose a data-driven Bayesian-type approach that constructs the ambiguity set of possible distributions from a Bayesian perspective. We demonstrate that our Bayesian-type MDRO problem can be reformulated as a risk-averse multi-stage stochastic programming problem and subsequently investigate its theoretical properties such as consistency, finite sample guarantee, and statistical robustness. Moreover, the reformulation enables us to employ cutting planes algorithms in dynamic settings to solve the Bayesian-type MDRO problem. To illustrate the practicality and advantages of the proposed model and algorithm, we apply it to a distributionally robust inventory control problem and a distributionally robust hydrothermal scheduling problem, and compare it with usual formulations and solution methods to highlight the superior performance of our approach.
{"title":"Multi-stage distributionally robust convex stochastic optimization with Bayesian-type ambiguity sets","authors":"Wentao Ma, Zhiping Chen","doi":"10.1007/s00186-024-00872-1","DOIUrl":"https://doi.org/10.1007/s00186-024-00872-1","url":null,"abstract":"<p>The existent methods for constructing ambiguity sets in distributionally robust optimization often suffer from over-conservativeness and inefficient utilization of available data. To address these limitations and to practically solve multi-stage distributionally robust optimization (MDRO), we propose a data-driven Bayesian-type approach that constructs the ambiguity set of possible distributions from a Bayesian perspective. We demonstrate that our Bayesian-type MDRO problem can be reformulated as a risk-averse multi-stage stochastic programming problem and subsequently investigate its theoretical properties such as consistency, finite sample guarantee, and statistical robustness. Moreover, the reformulation enables us to employ cutting planes algorithms in dynamic settings to solve the Bayesian-type MDRO problem. To illustrate the practicality and advantages of the proposed model and algorithm, we apply it to a distributionally robust inventory control problem and a distributionally robust hydrothermal scheduling problem, and compare it with usual formulations and solution methods to highlight the superior performance of our approach.</p>","PeriodicalId":49862,"journal":{"name":"Mathematical Methods of Operations Research","volume":"15 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142201492","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-12DOI: 10.1007/s00186-024-00873-0
Daniel Li Li, Erfang Shan
A communication situation (N, v, H) consists of a cooperative game (N, v) and a communication hypergraph (N, H), for which the Myerson value and the position value are well-known allocation rules. The value defined in this paper treats links in H as imaginal players, for which we define a bipartite graph between N and H according to the structure given by H, and propose an allocation rule called the bipartite value. This value assigns payoff to each player with two parts: as a player and as a member in links. A characterization of the bipartite value is given.
通信情境(N,v,H)由合作博弈(N,v)和通信超图(N,H)组成,其中迈尔森值和位置值是众所周知的分配规则。本文定义的值将 H 中的链接视为意象玩家,为此我们根据 H 给出的结构定义了 N 和 H 之间的双向图,并提出了一种名为双向值的分配规则。该值将每个玩家的报酬分为两部分:作为玩家的报酬和作为链接中成员的报酬。本文给出了双向值的特征。
{"title":"A new value for communication situations","authors":"Daniel Li Li, Erfang Shan","doi":"10.1007/s00186-024-00873-0","DOIUrl":"https://doi.org/10.1007/s00186-024-00873-0","url":null,"abstract":"<p>A communication situation (<i>N</i>, <i>v</i>, <i>H</i>) consists of a cooperative game (<i>N</i>, <i>v</i>) and a communication hypergraph (<i>N</i>, <i>H</i>), for which the Myerson value and the position value are well-known allocation rules. The value defined in this paper treats links in <i>H</i> as imaginal players, for which we define a bipartite graph between <i>N</i> and <i>H</i> according to the structure given by <i>H</i>, and propose an allocation rule called the bipartite value. This value assigns payoff to each player with two parts: as a player and as a member in links. A characterization of the bipartite value is given.</p>","PeriodicalId":49862,"journal":{"name":"Mathematical Methods of Operations Research","volume":"58 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141945415","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-02DOI: 10.1007/s00186-024-00871-2
Samira Fallah, Ted K. Ralphs, Natashia L. Boland
In this study, we investigate the connection between the efficient frontier (EF) of a general multiobjective mixed integer linear optimization problem (MILP) and the so-called restricted value function (RVF) of a closely related single-objective MILP. In the first part of the paper, we detail the mathematical structure of the RVF, including characterizing the set of points at which it is differentiable, the gradients at such points, and the subdifferential at all nondifferentiable points. We then show that the EF of the multiobjective MILP is comprised of points on the boundary of the epigraph of the RVF and that any description of the EF suffices to describe the RVF and vice versa. Because of the close relationship of the RVF to the EF, we observe that methods for constructing the so-called value function (VF) of an MILP and methods for constructing the EF of a multiobjective optimization problem are effectively interchangeable. Exploiting this observation, we propose a generalized cutting-plane algorithm for constructing the EF of a multiobjective MILP that arises from an existing algorithm for constructing the classical MILP VF. The algorithm identifies the set of all integer parts of solutions on the EF. We prove that the algorithm converges finitely under a standard boundedness assumption and comes with a performance guarantee if terminated early.
在本研究中,我们探讨了一般多目标混合整数线性优化问题(MILP)的有效前沿(EF)与密切相关的单目标 MILP 的所谓受限值函数(RVF)之间的联系。在本文的第一部分,我们将详细介绍 RVF 的数学结构,包括描述其可微分的点集、这些点上的梯度以及所有不可微分点上的次微分。然后,我们证明多目标 MILP 的 EF 是由 RVF 边界上的点组成的,对 EF 的任何描述都足以描述 RVF,反之亦然。由于 RVF 与 EF 关系密切,我们发现构建 MILP 的所谓值函数 (VF) 的方法和构建多目标优化问题的 EF 的方法实际上是可以互换的。利用这一观察结果,我们提出了一种用于构建多目标 MILP EF 的广义切割面算法,该算法源于构建经典 MILP VF 的现有算法。该算法确定了 EF 上所有整数部分解的集合。我们证明,该算法在标准有界性假设下有限收敛,而且如果提前终止,还能保证性能。
{"title":"On the relationship between the value function and the efficient frontier of a mixed integer linear optimization problem","authors":"Samira Fallah, Ted K. Ralphs, Natashia L. Boland","doi":"10.1007/s00186-024-00871-2","DOIUrl":"https://doi.org/10.1007/s00186-024-00871-2","url":null,"abstract":"<p>In this study, we investigate the connection between the efficient frontier (EF) of a general multiobjective mixed integer linear optimization problem (MILP) and the so-called <i>restricted value function</i> (RVF) of a closely related single-objective MILP. In the first part of the paper, we detail the mathematical structure of the RVF, including characterizing the set of points at which it is differentiable, the gradients at such points, and the subdifferential at all nondifferentiable points. We then show that the EF of the multiobjective MILP is comprised of points on the boundary of the epigraph of the RVF and that any description of the EF suffices to describe the RVF and vice versa. Because of the close relationship of the RVF to the EF, we observe that methods for constructing the so-called value function (VF) of an MILP and methods for constructing the EF of a multiobjective optimization problem are effectively interchangeable. Exploiting this observation, we propose a generalized cutting-plane algorithm for constructing the EF of a multiobjective MILP that arises from an existing algorithm for constructing the classical MILP VF. The algorithm identifies the set of all integer parts of solutions on the EF. We prove that the algorithm converges finitely under a standard boundedness assumption and comes with a performance guarantee if terminated early.</p>","PeriodicalId":49862,"journal":{"name":"Mathematical Methods of Operations Research","volume":"56 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141880569","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-29DOI: 10.1007/s00186-024-00870-3
Ina Lammel, Karl-Heinz Küfer, Philipp Süss
In this article we introduce an algorithm that approximates the nondominated sets of multiobjective mixed-integer convex optimization problems. The algorithm constructs an inner and outer approximation of the front exploiting the convexity of the patches for problems with an arbitrary number of criteria. In the algorithm, the problem is decomposed into patches, which are multiobjective convex problems, by fixing the integer assignments. The patch problems are solved using (simplicial) Sandwiching. We identify parts of patches that are dominated by other patches and ensure that these patch parts are not refined further. We prove that the algorithm converges and show a bound on the reduction of the approximation error in the course of the algorithm. We illustrate the behaviour of our algorithm using some numerical examples and compare its performance to an algorithm from literature.
{"title":"An approximation algorithm for multiobjective mixed-integer convex optimization","authors":"Ina Lammel, Karl-Heinz Küfer, Philipp Süss","doi":"10.1007/s00186-024-00870-3","DOIUrl":"https://doi.org/10.1007/s00186-024-00870-3","url":null,"abstract":"<p>In this article we introduce an algorithm that approximates the nondominated sets of multiobjective mixed-integer convex optimization problems. The algorithm constructs an inner and outer approximation of the front exploiting the convexity of the patches for problems with an arbitrary number of criteria. In the algorithm, the problem is decomposed into patches, which are multiobjective convex problems, by fixing the integer assignments. The patch problems are solved using (simplicial) Sandwiching. We identify parts of patches that are dominated by other patches and ensure that these patch parts are not refined further. We prove that the algorithm converges and show a bound on the reduction of the approximation error in the course of the algorithm. We illustrate the behaviour of our algorithm using some numerical examples and compare its performance to an algorithm from literature.</p>","PeriodicalId":49862,"journal":{"name":"Mathematical Methods of Operations Research","volume":"10 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141864727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-03DOI: 10.1007/s00186-024-00869-w
Andrei Comăneci
We investigate location problems where the optimal solution is found within the tropical convex hull of the given input points. Our initial focus is on geodesically star-convex sets, using the asymmetric tropical distance. We introduce the concept of tropically quasiconvex functions, which have sub-level sets with this shape, and are closely related to monotonic functions. Our findings demonstrate that location problems using tropically quasiconvex functions as distance measures will result in an optimal solution within the tropical convex hull of the input points. We also extend this result to cases where the input points are replaced with tropically convex sets. Finally, we explore the applications of our research in phylogenetics, highlighting the properties of consensus methods that arise from our class of location problems.
{"title":"Tropical convexity in location problems","authors":"Andrei Comăneci","doi":"10.1007/s00186-024-00869-w","DOIUrl":"https://doi.org/10.1007/s00186-024-00869-w","url":null,"abstract":"<p>We investigate location problems where the optimal solution is found within the tropical convex hull of the given input points. Our initial focus is on geodesically star-convex sets, using the asymmetric tropical distance. We introduce the concept of tropically quasiconvex functions, which have sub-level sets with this shape, and are closely related to monotonic functions. Our findings demonstrate that location problems using tropically quasiconvex functions as distance measures will result in an optimal solution within the tropical convex hull of the input points. We also extend this result to cases where the input points are replaced with tropically convex sets. Finally, we explore the applications of our research in phylogenetics, highlighting the properties of consensus methods that arise from our class of location problems.</p>","PeriodicalId":49862,"journal":{"name":"Mathematical Methods of Operations Research","volume":"51 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141513094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-02DOI: 10.1007/s00186-024-00864-1
Wenzhao Zhang, Congying Liu
In this paper, we focus on the discrete-time stopping games under the risk-sensitive discounted cost criterion. The state space and the action spaces of all the players are assumed to be Borel spaces. The cost functions are allowed to be unbounded from above and from below. At each decision epoch, each player chooses an action to influence the transition laws, and player 1 incurs a running cost. If players 1 or 2 decides to stop the game, player 1 incurs a corresponding terminated cost. Under suitable hypothesis, we show that the game model has a value which is a unique solution of risk-sensitive stopping optimality equation by an approximation technique. Furthermore, we derive the existence of equilibria.
{"title":"Discrete-time stopping games with risk-sensitive discounted cost criterion","authors":"Wenzhao Zhang, Congying Liu","doi":"10.1007/s00186-024-00864-1","DOIUrl":"https://doi.org/10.1007/s00186-024-00864-1","url":null,"abstract":"<p>In this paper, we focus on the discrete-time stopping games under the risk-sensitive discounted cost criterion. The state space and the action spaces of all the players are assumed to be Borel spaces. The cost functions are allowed to be unbounded from above and from below. At each decision epoch, each player chooses an action to influence the transition laws, and player 1 incurs a running cost. If players 1 or 2 decides to stop the game, player 1 incurs a corresponding terminated cost. Under suitable hypothesis, we show that the game model has a value which is a unique solution of risk-sensitive stopping optimality equation by an approximation technique. Furthermore, we derive the existence of equilibria.</p>","PeriodicalId":49862,"journal":{"name":"Mathematical Methods of Operations Research","volume":"11 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141513129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-27DOI: 10.1007/s00186-024-00867-y
Hedy Attouch, Szilárd Csaba László
In a Hilbertian framework, for the minimization of a general convex differentiable function f, we introduce new inertial dynamics and algorithms that generate trajectories and iterates that converge fastly towards the minimizer of f with minimum norm. Our study is based on the non-autonomous version of the Polyak heavy ball method, which, at time t, is associated with the strongly convex function obtained by adding to f a Tikhonov regularization term with vanishing coefficient (varepsilon (t)). In this dynamic, the damping coefficient is proportional to the square root of the Tikhonov regularization parameter (varepsilon (t)). By adjusting the speed of convergence of (varepsilon (t)) towards zero, we will obtain both rapid convergence towards the infimal value of f, and the strong convergence of the trajectories towards the element of minimum norm of the set of minimizers of f. In particular, we obtain an improved version of the dynamic of Su-Boyd-Candès for the accelerated gradient method of Nesterov. This study naturally leads to corresponding first-order algorithms obtained by temporal discretization. In the case of a proper lower semicontinuous and convex function f, we study the proximal algorithms in detail, and show that they benefit from similar properties.
在希尔伯特框架下,针对一般凸可微函数 f 的最小化问题,我们引入了新的惯性动力学和算法,其生成的轨迹和迭代可快速收敛至 f 的最小化且具有最小规范。我们的研究基于非自主版本的波利克重球方法,在时间 t 上,该方法与强凸函数相关联,强凸函数是通过在 f 上添加一个具有消失系数 (varepsilon (t)) 的 Tikhonov 正则化项而得到的。在这种动态中,阻尼系数与 Tikhonov 正则化参数 (varepsilon (t)) 的平方根成正比。通过将 (varepsilon (t)) 的收敛速度调整为零,我们将同时获得向 f 的次极值快速收敛和向 f 最小化集合的最小规范元素强收敛的轨迹。这项研究自然会引出通过时间离散化获得的相应一阶算法。在适当的下半连续凸函数 f 的情况下,我们详细研究了近似算法,并证明它们受益于类似的性质。
{"title":"Convex optimization via inertial algorithms with vanishing Tikhonov regularization: fast convergence to the minimum norm solution","authors":"Hedy Attouch, Szilárd Csaba László","doi":"10.1007/s00186-024-00867-y","DOIUrl":"https://doi.org/10.1007/s00186-024-00867-y","url":null,"abstract":"<p>In a Hilbertian framework, for the minimization of a general convex differentiable function <i>f</i>, we introduce new inertial dynamics and algorithms that generate trajectories and iterates that converge fastly towards the minimizer of <i>f</i> with minimum norm. Our study is based on the non-autonomous version of the Polyak heavy ball method, which, at time <i>t</i>, is associated with the strongly convex function obtained by adding to <i>f</i> a Tikhonov regularization term with vanishing coefficient <span>(varepsilon (t))</span>. In this dynamic, the damping coefficient is proportional to the square root of the Tikhonov regularization parameter <span>(varepsilon (t))</span>. By adjusting the speed of convergence of <span>(varepsilon (t))</span> towards zero, we will obtain both rapid convergence towards the infimal value of <i>f</i>, and the strong convergence of the trajectories towards the element of minimum norm of the set of minimizers of <i>f</i>. In particular, we obtain an improved version of the dynamic of Su-Boyd-Candès for the accelerated gradient method of Nesterov. This study naturally leads to corresponding first-order algorithms obtained by temporal discretization. In the case of a proper lower semicontinuous and convex function <i>f</i>, we study the proximal algorithms in detail, and show that they benefit from similar properties.\u0000</p>","PeriodicalId":49862,"journal":{"name":"Mathematical Methods of Operations Research","volume":"196 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141509496","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-26DOI: 10.1007/s00186-024-00865-0
Nesrine Zidani, Natalia Djellab
The paper deals with Markovian multiserver retrial queuing system with exponential abandonments, two types of arrivals: Fresh calls and Handover calls and waiting places in the service area. This model can be used for analysing a cellular mobile network, where the service area is divided into cells. In this paper, the number of customers in the system and in the orbit form a level-dependent quasi-birth-and-death process, whose stationary distribution is expressed in terms of a sequence of rate matrices. First, we derive the Taylor series expansion for nonzero elements of the rate matrices. Then, by the expansion results, we obtain an asymptotic upper bound for the stationary distribution of both the number of busy channels and the number of customers in the orbit. Furthermore, we present some numerical results to examine the performance of the system.
{"title":"Asymptotic upper bounds for an M/M/C/K retrial queue with a guard channel and guard buffer","authors":"Nesrine Zidani, Natalia Djellab","doi":"10.1007/s00186-024-00865-0","DOIUrl":"https://doi.org/10.1007/s00186-024-00865-0","url":null,"abstract":"<p>The paper deals with Markovian multiserver retrial queuing system with exponential abandonments, two types of arrivals: Fresh calls and Handover calls and waiting places in the service area. This model can be used for analysing a cellular mobile network, where the service area is divided into cells. In this paper, the number of customers in the system and in the orbit form a level-dependent quasi-birth-and-death process, whose stationary distribution is expressed in terms of a sequence of rate matrices. First, we derive the Taylor series expansion for nonzero elements of the rate matrices. Then, by the expansion results, we obtain an asymptotic upper bound for the stationary distribution of both the number of busy channels and the number of customers in the orbit. Furthermore, we present some numerical results to examine the performance of the system.</p>","PeriodicalId":49862,"journal":{"name":"Mathematical Methods of Operations Research","volume":"168 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141509498","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-25DOI: 10.1007/s00186-024-00863-2
Jiamin Jian, Qingshuo Song, Jiaxuan Ye
This paper focuses on exploring the convergence properties of a generic player’s trajectory and empirical measures in an N-player Linear-Quadratic-Gaussian Nash game, where Brownian motion serves as the common noise. The study establishes three distinct convergence rates concerning the representative player and empirical measure. To investigate the convergence, the methodology relies on a specific decomposition of the equilibrium path in the N-player game and utilizes the associated mean field games framework.
本文重点探讨了 N 人线性-二次方-高斯纳什博弈中一般博弈者的轨迹和经验度量的收敛特性,其中布朗运动是常见噪声。该研究确定了代表棋手和经验度量的三种不同收敛率。为了研究收敛性,该方法依赖于对 N 人博弈中均衡路径的特定分解,并利用相关的均值场博弈框架。
{"title":"Convergence rate of LQG mean field games with common noise","authors":"Jiamin Jian, Qingshuo Song, Jiaxuan Ye","doi":"10.1007/s00186-024-00863-2","DOIUrl":"https://doi.org/10.1007/s00186-024-00863-2","url":null,"abstract":"<p>This paper focuses on exploring the convergence properties of a generic player’s trajectory and empirical measures in an <i>N</i>-player Linear-Quadratic-Gaussian Nash game, where Brownian motion serves as the common noise. The study establishes three distinct convergence rates concerning the representative player and empirical measure. To investigate the convergence, the methodology relies on a specific decomposition of the equilibrium path in the <i>N</i>-player game and utilizes the associated mean field games framework.</p>","PeriodicalId":49862,"journal":{"name":"Mathematical Methods of Operations Research","volume":"131 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2024-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141509497","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}