Sharp Waiting-Time Bounds for Multiserver Jobs

Q1 Mathematics Stochastic Systems Pub Date : 2024-08-26 DOI:10.1287/stsy.2023.0006
Yige Hong, Weina Wang
{"title":"Sharp Waiting-Time Bounds for Multiserver Jobs","authors":"Yige Hong, Weina Wang","doi":"10.1287/stsy.2023.0006","DOIUrl":null,"url":null,"abstract":"Multiserver jobs, which are jobs that occupy multiple servers simultaneously during service, are prevalent in today’s computing clusters. But, little is known about the delay performance of systems with multiserver jobs. We consider queueing models for multiserver jobs in scaling regimes where the system load becomes heavy and meanwhile, the total number of servers in the system and the number of servers that a job needs become large. Prior work has derived upper bounds on the queueing probability in this scaling regime. However, without proper lower bounds, the existing results cannot be used to differentiate between policies. In this paper, we study the delay performance by establishing sharp bounds on the steady-state mean waiting time of multiserver jobs, where the waiting time of a job is the time spent in queueing rather than in service. We first characterize the exact order of the mean waiting time under the first come, first serve (FCFS) policy. Then, we prove a lower bound on the mean waiting time of all policies, which has an order gap with the mean waiting time under FCFS. We show that the lower bound is achievable by a priority policy that we call smallest need first (SNF).Funding: This research was supported in part by the National Science Foundation [Grant ECCS-2145713].Supplemental Material: The online appendix is available at https://doi.org/10.1287/stsy.2023.0006 .","PeriodicalId":36337,"journal":{"name":"Stochastic Systems","volume":"81 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Stochastic Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1287/stsy.2023.0006","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Mathematics","Score":null,"Total":0}
引用次数: 0

Abstract

Multiserver jobs, which are jobs that occupy multiple servers simultaneously during service, are prevalent in today’s computing clusters. But, little is known about the delay performance of systems with multiserver jobs. We consider queueing models for multiserver jobs in scaling regimes where the system load becomes heavy and meanwhile, the total number of servers in the system and the number of servers that a job needs become large. Prior work has derived upper bounds on the queueing probability in this scaling regime. However, without proper lower bounds, the existing results cannot be used to differentiate between policies. In this paper, we study the delay performance by establishing sharp bounds on the steady-state mean waiting time of multiserver jobs, where the waiting time of a job is the time spent in queueing rather than in service. We first characterize the exact order of the mean waiting time under the first come, first serve (FCFS) policy. Then, we prove a lower bound on the mean waiting time of all policies, which has an order gap with the mean waiting time under FCFS. We show that the lower bound is achievable by a priority policy that we call smallest need first (SNF).Funding: This research was supported in part by the National Science Foundation [Grant ECCS-2145713].Supplemental Material: The online appendix is available at https://doi.org/10.1287/stsy.2023.0006 .
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
多服务器工作的急剧等待时间界限
多服务器作业是指在服务过程中同时占用多个服务器的作业,在当今的计算集群中非常普遍。但是,人们对多服务器作业系统的延迟性能知之甚少。我们考虑了多服务器作业在系统负载变得很重,同时系统中服务器总数和作业所需的服务器数量变得很大的扩展状态下的排队模型。之前的工作已经推导出了这种扩展机制下的排队概率上限。但是,由于没有适当的下限,现有结果无法用于区分不同的策略。在本文中,我们通过建立多服务器作业稳态平均等待时间的尖锐界限来研究延迟性能,其中作业的等待时间是指排队时间,而不是服务时间。我们首先描述了先来先服务(FCFS)策略下平均等待时间的精确阶数。然后,我们证明了所有策略下平均等待时间的下限,该下限与 FCFS 下的平均等待时间存在阶差。我们证明,我们称之为 "最小需求优先(SNF)"的优先策略可以实现该下限:本研究得到了美国国家科学基金会[Grant ECCS-2145713]的部分资助:在线附录可在 https://doi.org/10.1287/stsy.2023.0006 上获取。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Stochastic Systems
Stochastic Systems Decision Sciences-Statistics, Probability and Uncertainty
CiteScore
3.70
自引率
0.00%
发文量
18
期刊最新文献
Sharp Waiting-Time Bounds for Multiserver Jobs Asymptotic Optimality of Switched Control Policies in a Simple Parallel Server System Under an Extended Heavy Traffic Condition Distributionally Robust Observable Strategic Queues The BAR Approach for Multiclass Queueing Networks with SBP Service Policies Ergodic Control of Bipartite Matching Queues with Class Change and Matching Failure
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1