Stochastic Systems最新文献

Sharp Waiting-Time Bounds for Multiserver Jobs 多服务器工作的急剧等待时间界限

Q1 Mathematics

Stochastic Systems

Pub Date : 2024-08-26 DOI: 10.1287/stsy.2023.0006

Yige Hong, Weina Wang

Multiserver jobs, which are jobs that occupy multiple servers simultaneously during service, are prevalent in today’s computing clusters. But, little is known about the delay performance of systems with multiserver jobs. We consider queueing models for multiserver jobs in scaling regimes where the system load becomes heavy and meanwhile, the total number of servers in the system and the number of servers that a job needs become large. Prior work has derived upper bounds on the queueing probability in this scaling regime. However, without proper lower bounds, the existing results cannot be used to differentiate between policies. In this paper, we study the delay performance by establishing sharp bounds on the steady-state mean waiting time of multiserver jobs, where the waiting time of a job is the time spent in queueing rather than in service. We first characterize the exact order of the mean waiting time under the first come, first serve (FCFS) policy. Then, we prove a lower bound on the mean waiting time of all policies, which has an order gap with the mean waiting time under FCFS. We show that the lower bound is achievable by a priority policy that we call smallest need first (SNF).Funding: This research was supported in part by the National Science Foundation [Grant ECCS-2145713].Supplemental Material: The online appendix is available at https://doi.org/10.1287/stsy.2023.0006 .

多服务器作业是指在服务过程中同时占用多个服务器的作业，在当今的计算集群中非常普遍。但是，人们对多服务器作业系统的延迟性能知之甚少。我们考虑了多服务器作业在系统负载变得很重，同时系统中服务器总数和作业所需的服务器数量变得很大的扩展状态下的排队模型。之前的工作已经推导出了这种扩展机制下的排队概率上限。但是，由于没有适当的下限，现有结果无法用于区分不同的策略。在本文中，我们通过建立多服务器作业稳态平均等待时间的尖锐界限来研究延迟性能，其中作业的等待时间是指排队时间，而不是服务时间。我们首先描述了先来先服务（FCFS）策略下平均等待时间的精确阶数。然后，我们证明了所有策略下平均等待时间的下限，该下限与 FCFS 下的平均等待时间存在阶差。我们证明，我们称之为 "最小需求优先（SNF）"的优先策略可以实现该下限：本研究得到了美国国家科学基金会[Grant ECCS-2145713]的部分资助：在线附录可在 https://doi.org/10.1287/stsy.2023.0006 上获取。

{"title":"Sharp Waiting-Time Bounds for Multiserver Jobs","authors":"Yige Hong, Weina Wang","doi":"10.1287/stsy.2023.0006","DOIUrl":"https://doi.org/10.1287/stsy.2023.0006","url":null,"abstract":"Multiserver jobs, which are jobs that occupy multiple servers simultaneously during service, are prevalent in today’s computing clusters. But, little is known about the delay performance of systems with multiserver jobs. We consider queueing models for multiserver jobs in scaling regimes where the system load becomes heavy and meanwhile, the total number of servers in the system and the number of servers that a job needs become large. Prior work has derived upper bounds on the queueing probability in this scaling regime. However, without proper lower bounds, the existing results cannot be used to differentiate between policies. In this paper, we study the delay performance by establishing sharp bounds on the steady-state mean waiting time of multiserver jobs, where the waiting time of a job is the time spent in queueing rather than in service. We first characterize the exact order of the mean waiting time under the first come, first serve (FCFS) policy. Then, we prove a lower bound on the mean waiting time of all policies, which has an order gap with the mean waiting time under FCFS. We show that the lower bound is achievable by a priority policy that we call smallest need first (SNF).Funding: This research was supported in part by the National Science Foundation [Grant ECCS-2145713].Supplemental Material: The online appendix is available at https://doi.org/10.1287/stsy.2023.0006 .","PeriodicalId":36337,"journal":{"name":"Stochastic Systems","volume":"81 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142178811","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Asymptotic Optimality of Switched Control Policies in a Simple Parallel Server System Under an Extended Heavy Traffic Condition 大流量扩展条件下简单并行服务器系统中切换控制策略的渐近最优性

Q1 Mathematics

Stochastic Systems

Pub Date : 2024-07-26 DOI: 10.1287/stsy.2022.0022

Rami Atar, Eyal Castiel, Martin I. Reiman

This paper studies a two-class, two-server parallel server system under the recently introduced extended heavy traffic condition, which states that the underlying “static allocation” linear program (LP) is critical, but does not require that it has a unique solution. The main result is the construction of policies that asymptotically achieve previously proved a lower bound, on an expected discounted linear combination of diffusion-scaled queue lengths and are therefore asymptotically optimal (AO). Each extreme point solution to the LP determines a control mode—that is, a set of activities (class-server pairs) that are operational. When there are multiple solutions, these modes can be selected dynamically. It is shown that the number of modes required for AO is either one or two. In the latter case, there is a switching point in the (normalized) workload domain, characterized in terms of a free boundary problem. Our policies are defined by identifying pairs of elementary policies and switching between them at this switching point. They provide the first example in the heavy traffic literature where weak limits under an AO policy are given by a diffusion process where both the drift and diffusion coefficients are discontinuous.Funding: R. Atar is supported by the Israel Science Foundation [Grant 1035/20].

本文研究了最近引入的扩展大流量条件下的两类双服务器并行服务器系统，该条件指出底层 "静态分配 "线性规划（LP）是关键的，但并不要求它有唯一的解。主要结果是构建了一些策略，这些策略可以渐进地达到之前证明的下限，即扩散尺度队列长度的预期折现线性组合，因此是渐进最优的（AO）。LP 的每个极值点解决方案都决定了一种控制模式，即一组可运行的活动（班级-服务器对）。当有多个解时，可以动态选择这些模式。结果表明，AO 所需的模式数量要么是一个，要么是两个。在后一种情况下，（归一化）工作量域中存在一个切换点，其特征是自由边界问题。我们的策略是通过识别成对的基本策略并在该切换点进行切换来定义的。它们提供了大流量文献中的第一个例子，即 AO 政策下的弱限制是由漂移和扩散系数都不连续的扩散过程给出的：R. Atar 由以色列科学基金会 [Grant 1035/20] 资助。

{"title":"Asymptotic Optimality of Switched Control Policies in a Simple Parallel Server System Under an Extended Heavy Traffic Condition","authors":"Rami Atar, Eyal Castiel, Martin I. Reiman","doi":"10.1287/stsy.2022.0022","DOIUrl":"https://doi.org/10.1287/stsy.2022.0022","url":null,"abstract":"This paper studies a two-class, two-server parallel server system under the recently introduced extended heavy traffic condition, which states that the underlying “static allocation” linear program (LP) is critical, but does not require that it has a unique solution. The main result is the construction of policies that asymptotically achieve previously proved a lower bound, on an expected discounted linear combination of diffusion-scaled queue lengths and are therefore asymptotically optimal (AO). Each extreme point solution to the LP determines a control mode—that is, a set of activities (class-server pairs) that are operational. When there are multiple solutions, these modes can be selected dynamically. It is shown that the number of modes required for AO is either one or two. In the latter case, there is a switching point in the (normalized) workload domain, characterized in terms of a free boundary problem. Our policies are defined by identifying pairs of elementary policies and switching between them at this switching point. They provide the first example in the heavy traffic literature where weak limits under an AO policy are given by a diffusion process where both the drift and diffusion coefficients are discontinuous.Funding: R. Atar is supported by the Israel Science Foundation [Grant 1035/20].","PeriodicalId":36337,"journal":{"name":"Stochastic Systems","volume":"37 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141770098","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Distributionally Robust Observable Strategic Queues 分布稳健的可观测战略队列

Q1 Mathematics

Stochastic Systems

Pub Date : 2024-05-16 DOI: 10.1287/stsy.2022.0009

Yijie Wang, Madhushini Narayana Prasad, Grani A. Hanasusanto, John J. Hasenbein

This paper presents an extension of Naor’s analysis on the join-or-balk problem in observable M/M/1 queues. Although all other Markovian assumptions still hold, we explore this problem assuming uncertain arrival rates under the distributionally robust settings. We first study the problem with the classical moment ambiguity set, where the support, mean, and mean-absolute deviation of the underlying distribution are known. Next, we extend the model to the data-driven setting, where decision makers only have access to a finite set of samples. We develop three optimal joining threshold strategies from the perspectives of an individual customer, a social optimizer, and a revenue maximizer such that their respective worst-case expected benefit rates are maximized. Finally, we compare our findings with Naor’s original results and the traditional sample average approximation scheme.Funding: This research was supported by the National Science Foundation [Grants 2342505 and 2343869].

本文扩展了 Naor 对可观测 M/M/1 队列中加入或逡巡问题的分析。尽管所有其他马尔可夫假设仍然成立，但我们探讨了在分布稳健设置下假设不确定到达率的问题。我们首先研究了经典矩模糊集问题，在这种情况下，底层分布的支持度、平均值和平均绝对偏差都是已知的。接下来，我们将模型扩展到数据驱动设置，即决策者只能获得有限的样本集。我们从个人客户、社会最优化者和收益最大化者的角度出发，制定了三种最优加入阈值策略，从而使各自的最坏情况预期收益率最大化。最后，我们将研究结果与纳奥尔的原始结果和传统的样本平均近似方案进行了比较：本研究得到了美国国家科学基金会 [2342505 和 2343869] 的资助。

引用次数: 0

The BAR Approach for Multiclass Queueing Networks with SBP Service Policies 采用 SBP 服务策略的多类排队网络的 BAR 方法

Q1 Mathematics

Stochastic Systems

Pub Date : 2024-05-02 DOI: 10.1287/stsy.2023.0011

Anton Braverman, J. G. Dai, Masakiyo Miyazawa

The basic adjoint relationship (BAR) approach is an analysis technique based on the stationary equation of a Markov process. This approach was introduced to study heavy-traffic, steady-state convergence of generalized Jackson networks in which each service station has a single job class. We extend it to multiclass queueing networks operating under static-buffer-priority (SBP) service disciplines. Our extension makes a connection with Palm distributions that allows one to attack a difficulty arising from queue-length truncation, which appears to be unavoidable in the multiclass setting. For multiclass queueing networks operating under SBP service disciplines, our BAR approach provides an alternative to the “interchange of limits” approach that has dominated the literature in the last twenty years. The BAR approach can produce sharp results and allows one to establish steady-state convergence under three additional conditions: stability, state space collapse (SSC) and a certain matrix being “tight.” These three conditions do not appear to depend on the interarrival and service-time distributions beyond their means, and their verification can be studied as three separate modules. In particular, they can be studied in a simpler, continuous-time Markov chain setting when all distributions are exponential. As an example, these three conditions are shown to hold in reentrant lines operating under last-buffer-first-serve discipline. In a two-station, five-class reentrant line, under the heavy-traffic condition, the tight-matrix condition implies both the stability condition and the SSC condition. Whether such a relationship holds generally is an open problem.

基本邻接关系（BAR）方法是一种基于马尔可夫过程静态方程的分析技术。引入这种方法是为了研究广义杰克逊网络的大流量稳态收敛问题，在这种网络中，每个服务站都有一个作业类别。我们将其扩展到在静态缓冲优先（SBP）服务规则下运行的多级排队网络。我们的扩展将 Palm 分布联系起来，从而解决了队列长度截断所带来的难题，这在多类别设置中似乎是不可避免的。对于在 SBP 服务规则下运行的多类队列网络，我们的 BAR 方法提供了一种替代方法，可用于替代过去二十年中占主导地位的 "极限互换 "方法。BAR 方法可以产生尖锐的结果，并允许人们在三个附加条件下建立稳态收敛：稳定性、状态空间坍缩（SSC）和特定矩阵 "紧密"。这三个条件似乎并不超出到达间隔和服务时间分布的范围，其验证可作为三个独立模块进行研究。特别是，当所有分布都是指数分布时，可以在更简单的连续时间马尔可夫链环境中研究它们。举例来说，这三个条件在按照 "最后缓冲-优先服务 "规则运行的可重入线路中都是成立的。在一条两站五班的重入线路中，在大流量条件下，紧矩阵条件意味着稳定性条件和 SSC 条件。这种关系是否普遍成立还是一个未决问题。

{"title":"The BAR Approach for Multiclass Queueing Networks with SBP Service Policies","authors":"Anton Braverman, J. G. Dai, Masakiyo Miyazawa","doi":"10.1287/stsy.2023.0011","DOIUrl":"https://doi.org/10.1287/stsy.2023.0011","url":null,"abstract":"The basic adjoint relationship (BAR) approach is an analysis technique based on the stationary equation of a Markov process. This approach was introduced to study heavy-traffic, steady-state convergence of generalized Jackson networks in which each service station has a single job class. We extend it to multiclass queueing networks operating under static-buffer-priority (SBP) service disciplines. Our extension makes a connection with Palm distributions that allows one to attack a difficulty arising from queue-length truncation, which appears to be unavoidable in the multiclass setting. For multiclass queueing networks operating under SBP service disciplines, our BAR approach provides an alternative to the “interchange of limits” approach that has dominated the literature in the last twenty years. The BAR approach can produce sharp results and allows one to establish steady-state convergence under three additional conditions: stability, state space collapse (SSC) and a certain matrix being “tight.” These three conditions do not appear to depend on the interarrival and service-time distributions beyond their means, and their verification can be studied as three separate modules. In particular, they can be studied in a simpler, continuous-time Markov chain setting when all distributions are exponential. As an example, these three conditions are shown to hold in reentrant lines operating under last-buffer-first-serve discipline. In a two-station, five-class reentrant line, under the heavy-traffic condition, the tight-matrix condition implies both the stability condition and the SSC condition. Whether such a relationship holds generally is an open problem.","PeriodicalId":36337,"journal":{"name":"Stochastic Systems","volume":"6 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140827875","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Ergodic Control of Bipartite Matching Queues with Class Change and Matching Failure 具有类别变化和匹配失败的双向匹配队列的遍历控制

Q1 Mathematics

Stochastic Systems

Pub Date : 2024-03-26 DOI: 10.1287/stsy.2022.0008

Amin Khademi, Xin Liu

Motivated by transplant applications, we study a bipartite matching queue with multiclass customers and multitype resources. Customers may change their classes or abandon the system while waiting in queue, and they may decline the offered resource units which results in matching failure. We are interested in designing efficient instantaneous matching policies that allocate resources upon arrival to waiting customers. Our objective is bicriteria and formulated as a cost functional that linearly combines the long-run average expected reward due to successful matches and the long-run average expected cost from customer waiting and abandonment. We first develop a stability condition on the class change and abandonment rates, which requires at least one customer queue with abandonment and that any queue without abandonment have a class transition path to a queue with abandonment. Under this condition, we construct a simple linear program, referred to as the fluid control problem (FCP), which serves as a lower bound for the original stochastic control problem under any admissible policy. We then propose a randomized matching policy based on the solution of the FCP and show that the proposed policy is asymptotically optimal under both the long-run average and ergodic cost criteria. In addition, we apply our method to study two X matching models with two customer classes and two resource types to provide insights on how the class change and matching failure impact the optimal policies.

受移植应用的启发，我们研究了具有多类别客户和多类型资源的双向匹配队列。客户在队列中等待时可能会改变他们的类别或放弃系统，他们也可能拒绝接受所提供的资源单位，从而导致匹配失败。我们对设计高效的瞬时匹配策略很感兴趣，这种策略可以在资源到达时分配给等待的客户。我们的目标是双标准的，并表述为一个成本函数，它线性结合了成功匹配带来的长期平均预期回报以及客户等待和放弃带来的长期平均预期成本。我们首先制定了班级变化率和放弃率的稳定条件，要求至少有一个客户队列出现放弃现象，并且任何未出现放弃现象的队列都有通往出现放弃现象队列的班级转换路径。在此条件下，我们构建了一个简单的线性程序，称为流体控制问题（FCP），它是任何可接受策略下原始随机控制问题的下限。然后，我们根据 FCP 的解提出了一种随机匹配策略，并证明所提出的策略在长期平均成本和遍历成本标准下都是渐近最优的。此外，我们还应用我们的方法研究了具有两个客户类别和两种资源类型的 X 匹配模型，以深入了解类别变化和匹配失败对最优策略的影响。

{"title":"Ergodic Control of Bipartite Matching Queues with Class Change and Matching Failure","authors":"Amin Khademi, Xin Liu","doi":"10.1287/stsy.2022.0008","DOIUrl":"https://doi.org/10.1287/stsy.2022.0008","url":null,"abstract":"Motivated by transplant applications, we study a bipartite matching queue with multiclass customers and multitype resources. Customers may change their classes or abandon the system while waiting in queue, and they may decline the offered resource units which results in matching failure. We are interested in designing efficient instantaneous matching policies that allocate resources upon arrival to waiting customers. Our objective is bicriteria and formulated as a cost functional that linearly combines the long-run average expected reward due to successful matches and the long-run average expected cost from customer waiting and abandonment. We first develop a stability condition on the class change and abandonment rates, which requires at least one customer queue with abandonment and that any queue without abandonment have a class transition path to a queue with abandonment. Under this condition, we construct a simple linear program, referred to as the fluid control problem (FCP), which serves as a lower bound for the original stochastic control problem under any admissible policy. We then propose a randomized matching policy based on the solution of the FCP and show that the proposed policy is asymptotically optimal under both the long-run average and ergodic cost criteria. In addition, we apply our method to study two X matching models with two customer classes and two resource types to provide insights on how the class change and matching failure impact the optimal policies.","PeriodicalId":36337,"journal":{"name":"Stochastic Systems","volume":"112 39","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140379098","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Optimization of Inventory and Capacity in Large-Scale Assembly Systems Using Extreme-Value Theory 利用极值理论优化大规模装配系统中的库存和产能

Q1 Mathematics

Stochastic Systems

Pub Date : 2024-03-26 DOI: 10.1287/stsy.2022.0014

M. Meijer, Dennis Schol, Willem van Jaarsveld, M. Vlasiou, Bert Zwart

High-tech systems are typically produced in two stages: (1) production of components using specialized equipment and staff and (2) system assembly/integration. Component production capacity is subject to fluctuations, causing a high risk of shortages of at least one component, which results in costly delays. Companies hedge this risk by strategic investments in excess production capacity and in buffer inventories of components. To optimize these, it is crucial to characterize the relation between component shortage risk and capacity and inventory investments. We suppose that component production capacity and produce demand are normally distributed over finite time intervals, and we accordingly model the production system as a symmetric fork-join queueing network with N statistically identical queues with a common arrival process and independent service processes. Assuming a symmetric cost structure, we subsequently apply extreme value theory to gain analytic insights into this optimization problem. We derive several new results for this queueing network, notably that the scaled maximum of N steady-state queue lengths converges in distribution to a Gaussian random variable. These results translate into asymptotically optimal methods to dimension the system. Tests on a range of problems reveal that these methods typically work well for systems of moderate size. Funding: This work is part of the research program Complexity in High-Tech Manufacturing, (partly) financed by the Dutch Research Council (NWO) [Grant 438.16.121]. The research is also supported by the NWO programs MEERVOUD to M. Vlasiou [Grant 632.003.002] and Talent VICI to B. Zwart [Grant 639.033.413].

高科技系统的生产通常分为两个阶段：(1) 使用专业设备和人员生产组件；(2) 系统组装/集成。元件生产能力会出现波动，造成至少一种元件短缺的高风险，从而导致代价高昂的延误。公司通过对过剩生产能力和元件缓冲库存进行战略投资来规避这一风险。要优化这些投资，关键是要确定零部件短缺风险与产能和库存投资之间的关系。我们假设零部件生产能力和生产需求在有限的时间间隔内呈正态分布，并相应地将生产系统建模为一个对称的叉接排队网络，其中有 N 个统计上相同的队列，它们具有共同的到达过程和独立的服务过程。假定成本结构是对称的，我们随后将应用极值理论来分析这一优化问题。我们得出了该排队网络的几个新结果，特别是 N 个稳态队列长度的缩放最大值在分布上收敛于高斯随机变量。这些结果转化成了系统维度的渐近最优方法。对一系列问题的测试表明，这些方法通常对中等规模的系统效果良好。资助：本研究是高科技制造中的复杂性研究项目的一部分，由荷兰研究理事会（NWO）[438.16.121 号拨款]（部分）资助。M. Vlasiou 的 MEERVOUD 项目 [632.003.002 号资助] 和 B. Zwart 的 Talent VICI 项目 [639.033.413 号资助] 也为本研究提供了支持。

{"title":"Optimization of Inventory and Capacity in Large-Scale Assembly Systems Using Extreme-Value Theory","authors":"M. Meijer, Dennis Schol, Willem van Jaarsveld, M. Vlasiou, Bert Zwart","doi":"10.1287/stsy.2022.0014","DOIUrl":"https://doi.org/10.1287/stsy.2022.0014","url":null,"abstract":"High-tech systems are typically produced in two stages: (1) production of components using specialized equipment and staff and (2) system assembly/integration. Component production capacity is subject to fluctuations, causing a high risk of shortages of at least one component, which results in costly delays. Companies hedge this risk by strategic investments in excess production capacity and in buffer inventories of components. To optimize these, it is crucial to characterize the relation between component shortage risk and capacity and inventory investments. We suppose that component production capacity and produce demand are normally distributed over finite time intervals, and we accordingly model the production system as a symmetric fork-join queueing network with N statistically identical queues with a common arrival process and independent service processes. Assuming a symmetric cost structure, we subsequently apply extreme value theory to gain analytic insights into this optimization problem. We derive several new results for this queueing network, notably that the scaled maximum of N steady-state queue lengths converges in distribution to a Gaussian random variable. These results translate into asymptotically optimal methods to dimension the system. Tests on a range of problems reveal that these methods typically work well for systems of moderate size. Funding: This work is part of the research program Complexity in High-Tech Manufacturing, (partly) financed by the Dutch Research Council (NWO) [Grant 438.16.121]. The research is also supported by the NWO programs MEERVOUD to M. Vlasiou [Grant 632.003.002] and Talent VICI to B. Zwart [Grant 639.033.413].","PeriodicalId":36337,"journal":{"name":"Stochastic Systems","volume":"119 23","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140381250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Gradient-Based Empirical Risk Minimization Using Local Polynomial Regression 利用局部多项式回归实现基于梯度的经验风险最小化

Q1 Mathematics

Stochastic Systems

Pub Date : 2024-03-26 DOI: 10.1287/stsy.2022.0003

Ali Jadbabaie, Anuran Makur, Devavrat Shah

In this paper, we consider the widely studied problem of empirical risk minimization (ERM) of strongly convex and smooth loss functions using iterative gradient-based methods. A major goal of the existing literature has been to compare different prototypical algorithms, such as batch gradient descent (GD) or stochastic gradient descent (SGD), by analyzing their rates of convergence to ϵ-approximate solutions with respect to the number of gradient computations, which is also known as the oracle complexity. For example, the oracle complexity of GD is [Formula: see text], where n is the number of training samples and p is the parameter space dimension. When n is large, this can be prohibitively expensive in practice, and SGD is preferred due to its oracle complexity of [Formula: see text]. Such standard analyses only utilize the smoothness of the loss function in the parameter being optimized. In contrast, we demonstrate that when the loss function is smooth in the data, we can learn the oracle at every iteration and beat the oracle complexities of GD, SGD, and their variants in important regimes. Specifically, at every iteration, our proposed algorithm, Local Polynomial Interpolation-based Gradient Descent (LPI-GD), first performs local polynomial regression with a virtual batch of data points to learn the gradient of the loss function and then estimates the true gradient of the ERM objective function. We establish that the oracle complexity of LPI-GD is [Formula: see text], where d is the data space dimension, and the gradient of the loss function is assumed to belong to an η-Hölder class with respect to the data. Our proof extends the analysis of local polynomial regression in nonparametric statistics to provide supremum norm guarantees for interpolation in multivariate settings and also exploits tools from the inexact GD literature. Unlike the complexities of GD and SGD, the complexity of our method depends on d. However, our algorithm outperforms GD, SGD, and their variants in oracle complexity for a broad range of settings where d is small relative to n. For example, with typical loss functions (such as squared or cross-entropy loss), when [Formula: see text] for any [Formula: see text] and [Formula: see text] is at the statistical limit, our method can be made to require [Formula: see text] oracle calls for any [Formula: see text], while SGD and GD require [Formula: see text] and [Formula: see text] oracle calls, respectively.Funding: This work was supported in part by the Office of Naval Research [Grant N000142012394], in part by the Army Research Office [Multidisciplinary University Research Initiative Grant W911NF-19-1-0217], and in part by the National Science Foundation [Transdisciplinary Research In Principles Of Data Science, Foundations of Data Science].

在本文中，我们使用基于梯度的迭代方法，研究了强凸平滑损失函数的经验风险最小化（ERM）这一广泛研究的问题。现有文献的一个主要目标是比较不同的原型算法，如批量梯度下降算法（GD）或随机梯度下降算法（SGD），分析它们收敛到ϵ近似解的率与梯度计算次数（也称为oracle复杂度）的关系。例如，GD 的oracle 复杂度为[公式：见正文]，其中 n 是训练样本数，p 是参数空间维数。当 n 较大时，这种方法在实际应用中会过于昂贵，而 SGD 的甲骨文复杂度为[公式：见正文]，因此更受青睐。这种标准分析只能利用被优化参数的损失函数的平滑性。相比之下，我们证明了当数据中的损失函数是平滑的，我们可以在每次迭代中学习神谕，并在重要情况下击败 GD、SGD 及其变体的神谕复杂度。具体来说，在每次迭代时，我们提出的算法--基于局部多项式插值的梯度下降算法（LPI-GD）--首先用一批虚拟数据点进行局部多项式回归，学习损失函数的梯度，然后估计 ERM 目标函数的真实梯度。我们确定 LPI-GD 的算法复杂度为 [公式：见正文]，其中 d 是数据空间维度，损失函数的梯度假定与数据有关，属于 η-Hölder 类。我们的证明扩展了非参数统计中的局部多项式回归分析，为多变量设置中的插值提供了至高规范保证，同时也利用了非精确 GD 文献中的工具。与 GD 和 SGD 的复杂性不同，我们的方法的复杂性取决于 d。然而，在 d 相对于 n 较小的各种情况下，我们的算法在 Oracle 复杂性方面优于 GD、SGD 及其变体。例如，对于典型的损失函数（如平方损失或交叉熵损失），当任意[公式：见正文]的[公式：见正文]和[公式：见正文]处于统计极限时，我们的方法可以使任意[公式：见正文]的[公式：见正文]都不需要[公式：见正文]的神谕调用，而SGD和GD则分别需要[公式：见正文]和[公式：见正文]的神谕调用：这项工作部分得到海军研究办公室[Grant N000142012394]的支持，部分得到陆军研究办公室[Multidisciplinary University Research Initiative Grant W911NF-19-1-0217]的支持，部分得到美国国家科学基金会[Transdisciplinary Research In Principles Of Data Science, Foundations of Data Science]的支持。

{"title":"Gradient-Based Empirical Risk Minimization Using Local Polynomial Regression","authors":"Ali Jadbabaie, Anuran Makur, Devavrat Shah","doi":"10.1287/stsy.2022.0003","DOIUrl":"https://doi.org/10.1287/stsy.2022.0003","url":null,"abstract":"In this paper, we consider the widely studied problem of empirical risk minimization (ERM) of strongly convex and smooth loss functions using iterative gradient-based methods. A major goal of the existing literature has been to compare different prototypical algorithms, such as batch gradient descent (GD) or stochastic gradient descent (SGD), by analyzing their rates of convergence to ϵ-approximate solutions with respect to the number of gradient computations, which is also known as the oracle complexity. For example, the oracle complexity of GD is [Formula: see text], where n is the number of training samples and p is the parameter space dimension. When n is large, this can be prohibitively expensive in practice, and SGD is preferred due to its oracle complexity of [Formula: see text]. Such standard analyses only utilize the smoothness of the loss function in the parameter being optimized. In contrast, we demonstrate that when the loss function is smooth in the data, we can learn the oracle at every iteration and beat the oracle complexities of GD, SGD, and their variants in important regimes. Specifically, at every iteration, our proposed algorithm, Local Polynomial Interpolation-based Gradient Descent (LPI-GD), first performs local polynomial regression with a virtual batch of data points to learn the gradient of the loss function and then estimates the true gradient of the ERM objective function. We establish that the oracle complexity of LPI-GD is [Formula: see text], where d is the data space dimension, and the gradient of the loss function is assumed to belong to an η-Hölder class with respect to the data. Our proof extends the analysis of local polynomial regression in nonparametric statistics to provide supremum norm guarantees for interpolation in multivariate settings and also exploits tools from the inexact GD literature. Unlike the complexities of GD and SGD, the complexity of our method depends on d. However, our algorithm outperforms GD, SGD, and their variants in oracle complexity for a broad range of settings where d is small relative to n. For example, with typical loss functions (such as squared or cross-entropy loss), when [Formula: see text] for any [Formula: see text] and [Formula: see text] is at the statistical limit, our method can be made to require [Formula: see text] oracle calls for any [Formula: see text], while SGD and GD require [Formula: see text] and [Formula: see text] oracle calls, respectively.Funding: This work was supported in part by the Office of Naval Research [Grant N000142012394], in part by the Army Research Office [Multidisciplinary University Research Initiative Grant W911NF-19-1-0217], and in part by the National Science Foundation [Transdisciplinary Research In Principles Of Data Science, Foundations of Data Science].","PeriodicalId":36337,"journal":{"name":"Stochastic Systems","volume":"14 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140599585","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Appointment Requests from Multiple Channels: Characterizing Optimal Set of Appointment Days to Offer with Patient Preferences 来自多种渠道的预约请求：根据患者偏好确定最佳预约日组合

Q1 Mathematics

Stochastic Systems

Pub Date : 2024-03-18 DOI: 10.1287/stsy.2022.0029

Feray Tunçalp, Lerzan Örmeci

We consider the appointment scheduling for a physician in a healthcare facility. Patients, of two types differentiated by their revenues and day preferences, contact the facility through either a call center to be scheduled immediately or a website to be scheduled the following morning. The facility aims to maximize the long-run average revenue, while ensuring that a certain service level is satisfied for patients generating lower revenue. The facility has two decisions: offering a set of appointment days and choosing the patient type to prioritize while contacting the website patients. Model 1 is a periodic Markov Decision Process (MDP) model without the service-level constraint. We establish certain structural properties of Model 1, while providing sufficient conditions for the existence of a preferred patient type and for the nonoptimality of the commonly used offer-all policy. We also demonstrate the importance of patient preference in determining the preferred type. Model 2 is the constrained MDP model that accommodates the service-level constraint and has an optimal randomized policy with a special structure. This allows developing an efficient method to identify a well-performing policy. We illustrate the performance of this policy through numerical experiments, for systems with and without no-shows.Supplemental Material: The online appendix is available at https://doi.org/10.1287/stsy.2022.0029 .

我们考虑的是医疗机构中医生的预约安排。患者分为两类，他们的收入和日期偏好各不相同，他们通过呼叫中心联系医疗机构，要求立即安排就诊时间，或者通过网站联系医疗机构，要求在第二天早上安排就诊时间。医疗机构的目标是最大限度地提高长期平均收入，同时确保为收入较低的患者提供一定的服务水平。该机构有两个决策：提供一组预约日，以及在联系网站患者时选择优先考虑的患者类型。模型 1 是一个没有服务水平约束的周期性马尔可夫决策过程（MDP）模型。我们确定了模型 1 的某些结构特性，同时为首选患者类型的存在和常用的 "全部提供 "政策的非最优性提供了充分条件。我们还证明了患者偏好对确定首选类型的重要性。模型 2 是受约束的 MDP 模型，它考虑了服务水平约束，并具有特殊结构的最优随机政策。这使得我们可以开发出一种高效的方法来确定性能良好的政策。我们通过数值实验来说明这种策略的性能，实验对象包括有无空闲的系统：在线附录见 https://doi.org/10.1287/stsy.2022.0029 。

{"title":"Appointment Requests from Multiple Channels: Characterizing Optimal Set of Appointment Days to Offer with Patient Preferences","authors":"Feray Tunçalp, Lerzan Örmeci","doi":"10.1287/stsy.2022.0029","DOIUrl":"https://doi.org/10.1287/stsy.2022.0029","url":null,"abstract":"We consider the appointment scheduling for a physician in a healthcare facility. Patients, of two types differentiated by their revenues and day preferences, contact the facility through either a call center to be scheduled immediately or a website to be scheduled the following morning. The facility aims to maximize the long-run average revenue, while ensuring that a certain service level is satisfied for patients generating lower revenue. The facility has two decisions: offering a set of appointment days and choosing the patient type to prioritize while contacting the website patients. Model 1 is a periodic Markov Decision Process (MDP) model without the service-level constraint. We establish certain structural properties of Model 1, while providing sufficient conditions for the existence of a preferred patient type and for the nonoptimality of the commonly used offer-all policy. We also demonstrate the importance of patient preference in determining the preferred type. Model 2 is the constrained MDP model that accommodates the service-level constraint and has an optimal randomized policy with a special structure. This allows developing an efficient method to identify a well-performing policy. We illustrate the performance of this policy through numerical experiments, for systems with and without no-shows.Supplemental Material: The online appendix is available at https://doi.org/10.1287/stsy.2022.0029 .","PeriodicalId":36337,"journal":{"name":"Stochastic Systems","volume":"9 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140168240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Exploiting Data Locality to Improve Performance of Heterogeneous Server Clusters 利用数据位置性提高异构服务器集群的性能

Q1 Mathematics

Stochastic Systems

Pub Date : 2024-02-06 DOI: 10.1287/stsy.2022.0040

Zhisheng Zhao, Debankur Mukherjee, Ruoyu Wu

We consider load balancing in large-scale heterogeneous server systems in the presence of data locality that imposes constraints on which tasks can be assigned to which servers. The constraints are naturally captured by a bipartite graph between the servers and the dispatchers handling assignments of various arrival flows. When a task arrives, the corresponding dispatcher assigns it to a server with the shortest queue among [Formula: see text] randomly selected servers obeying these constraints. Server processing speeds are heterogeneous, and they depend on the server type. For a broad class of bipartite graphs, we characterize the limit of the appropriately scaled occupancy process, both on the process level and in steady state, as the system size becomes large. Using such a characterization, we show that imposing data locality constraints can significantly improve the performance of heterogeneous systems. This is in stark contrast to either heterogeneous servers in a full flexible system or data locality constraints in systems with homogeneous servers, both of which have been observed to degrade the system performance. Extensive numerical experiments corroborate the theoretical results.Funding: This work was partially supported by the National Science Foundation [CCF. 07/2021–06/2024].

我们考虑的是大规模异构服务器系统中的负载平衡问题，因为数据局部性会对哪些任务可以分配给哪些服务器造成限制。服务器和处理各种到达流分配的调度员之间的双向图自然地捕捉到了这些约束。当任务到达时，相应的调度员会将其分配给随机选择的服务器中队列最短的服务器。服务器的处理速度各不相同，取决于服务器的类型。对于一大类双方形图，当系统规模变大时，我们会从进程层面和稳态两方面描述适当比例占用过程的极限。利用这种描述，我们证明了施加数据局部性约束可以显著提高异构系统的性能。这与完全灵活系统中的异构服务器或同构服务器系统中的数据局部性约束形成了鲜明对比，据观察，这两种约束都会降低系统性能。广泛的数值实验证实了理论结果：这项工作得到了美国国家科学基金会[CCF.07/2021-06/2024]的部分资助。

{"title":"Exploiting Data Locality to Improve Performance of Heterogeneous Server Clusters","authors":"Zhisheng Zhao, Debankur Mukherjee, Ruoyu Wu","doi":"10.1287/stsy.2022.0040","DOIUrl":"https://doi.org/10.1287/stsy.2022.0040","url":null,"abstract":"We consider load balancing in large-scale heterogeneous server systems in the presence of data locality that imposes constraints on which tasks can be assigned to which servers. The constraints are naturally captured by a bipartite graph between the servers and the dispatchers handling assignments of various arrival flows. When a task arrives, the corresponding dispatcher assigns it to a server with the shortest queue among [Formula: see text] randomly selected servers obeying these constraints. Server processing speeds are heterogeneous, and they depend on the server type. For a broad class of bipartite graphs, we characterize the limit of the appropriately scaled occupancy process, both on the process level and in steady state, as the system size becomes large. Using such a characterization, we show that imposing data locality constraints can significantly improve the performance of heterogeneous systems. This is in stark contrast to either heterogeneous servers in a full flexible system or data locality constraints in systems with homogeneous servers, both of which have been observed to degrade the system performance. Extensive numerical experiments corroborate the theoretical results.Funding: This work was partially supported by the National Science Foundation [CCF. 07/2021–06/2024].","PeriodicalId":36337,"journal":{"name":"Stochastic Systems","volume":"18 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139902558","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Large-System Insensitivity of Zero-Waiting Load Balancing Algorithms 零等待负载平衡算法对大型系统的不敏感性

Q1 Mathematics

Stochastic Systems

Pub Date : 2024-01-22 DOI: 10.1287/stsy.2022.0023

Xin Liu, Kang Gong, Lei Ying

This paper studies the sensitivity (or insensitivity) of a class of load balancing algorithms that achieve asymptotic zero-waiting in the sub-Halfin-Whitt regime, named LB-zero. Most existing results on zero-waiting load balancing algorithms assume the service time distribution is exponential. This paper establishes the large-system insensitivity of LB-zero for jobs whose service time follows a Coxian distribution with a finite number of phases. This result justifies that LB-zero achieves asymptotic zero-waiting for a large class of service time distributions as the Coxian family is dense in the class of positive-valued distributions. To prove this result, this paper develops a new technique, called “iterative state-space peeling” (ISSP). ISSP first identifies an iterative relation between the upper and lower bounds on the queue states and then proves that the system lives near the fixed point of the iterative bounds with a high probability. Based on ISSP, the steady-state distribution of the queue length is further analyzed by applying Stein’s method in the neighborhood of the fixed point. ISSP, like state-space collapse in heavy-traffic analysis, is a general approach that may be used to study other complex stochastic systems.

本文研究了一类负载平衡算法的灵敏度（或不灵敏度），该算法可在亚半文-维特机制下实现渐进零等待，被命名为 LB-zero。关于零等待负载均衡算法的大多数现有结果都假设服务时间分布为指数分布。本文证明了 LB-zero 对服务时间服从有限阶段数的考克斯分布的作业的大系统不敏感性。这一结果证明了 LB-zero 算法在一大类服务时间分布中实现了渐近零等待，因为 Coxian 族在正值分布类中是密集的。为了证明这一结果，本文开发了一种名为 "迭代状态空间剥离"（ISSP）的新技术。ISSP 首先确定队列状态的上界和下界之间的迭代关系，然后证明系统以很高的概率生存在迭代边界的定点附近。在 ISSP 的基础上，在定点附近应用斯坦因方法进一步分析队列长度的稳态分布。ISSP 与重交通分析中的状态空间坍缩一样，是一种通用方法，可用于研究其他复杂的随机系统。

{"title":"Large-System Insensitivity of Zero-Waiting Load Balancing Algorithms","authors":"Xin Liu, Kang Gong, Lei Ying","doi":"10.1287/stsy.2022.0023","DOIUrl":"https://doi.org/10.1287/stsy.2022.0023","url":null,"abstract":"This paper studies the sensitivity (or insensitivity) of a class of load balancing algorithms that achieve asymptotic zero-waiting in the sub-Halfin-Whitt regime, named LB-zero. Most existing results on zero-waiting load balancing algorithms assume the service time distribution is exponential. This paper establishes the large-system insensitivity of LB-zero for jobs whose service time follows a Coxian distribution with a finite number of phases. This result justifies that LB-zero achieves asymptotic zero-waiting for a large class of service time distributions as the Coxian family is dense in the class of positive-valued distributions. To prove this result, this paper develops a new technique, called “iterative state-space peeling” (ISSP). ISSP first identifies an iterative relation between the upper and lower bounds on the queue states and then proves that the system lives near the fixed point of the iterative bounds with a high probability. Based on ISSP, the steady-state distribution of the queue length is further analyzed by applying Stein’s method in the neighborhood of the fixed point. ISSP, like state-space collapse in heavy-traffic analysis, is a general approach that may be used to study other complex stochastic systems.","PeriodicalId":36337,"journal":{"name":"Stochastic Systems","volume":"27 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139583130","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0