Reducing Tail Latencies Through Environment- and Neighbour-aware Thread Management

Andrew Jeffery, Chris Jensen, Richard Mortier
{"title":"Reducing Tail Latencies Through Environment- and Neighbour-aware Thread Management","authors":"Andrew Jeffery, Chris Jensen, Richard Mortier","doi":"arxiv-2407.11582","DOIUrl":null,"url":null,"abstract":"Application tail latency is a key metric for many services, with high\nlatencies being linked directly to loss of revenue. Modern deeply-nested\nmicro-service architectures exacerbate tail latencies, increasing the\nlikelihood of users experiencing them. In this work, we show how CPU\novercommitment by OS threads leads to high tail latencies when applications are\nunder heavy load. CPU overcommitment can arise from two operational factors:\nincorrectly determining the number of CPUs available when under a CPU quota,\nand the ignorance of neighbour applications and their CPU usage. We discuss\ndifferent languages' solutions to obtaining the CPUs available, evaluating the\nimpact, and discuss opportunities for a more unified language-independent\ninterface to obtain the number of CPUs available. We then evaluate the impact\nof neighbour usage on tail latency and introduce a new neighbour-aware\nthreadpool, the friendlypool, that dynamically avoids overcommitment. In our\nevaluation, the friendlypool reduces maximum worker latency by up to\n$6.7\\times$ at the cost of decreasing throughput by up to $1.4\\times$.","PeriodicalId":501291,"journal":{"name":"arXiv - CS - Performance","volume":"2012 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Performance","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.11582","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Application tail latency is a key metric for many services, with high latencies being linked directly to loss of revenue. Modern deeply-nested micro-service architectures exacerbate tail latencies, increasing the likelihood of users experiencing them. In this work, we show how CPU overcommitment by OS threads leads to high tail latencies when applications are under heavy load. CPU overcommitment can arise from two operational factors: incorrectly determining the number of CPUs available when under a CPU quota, and the ignorance of neighbour applications and their CPU usage. We discuss different languages' solutions to obtaining the CPUs available, evaluating the impact, and discuss opportunities for a more unified language-independent interface to obtain the number of CPUs available. We then evaluate the impact of neighbour usage on tail latency and introduce a new neighbour-aware threadpool, the friendlypool, that dynamically avoids overcommitment. In our evaluation, the friendlypool reduces maximum worker latency by up to $6.7\times$ at the cost of decreasing throughput by up to $1.4\times$.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
通过环境和邻居感知线程管理减少尾端延迟
应用程序尾端延迟是许多服务的关键指标,高延迟与收入损失直接相关。现代深嵌套微服务架构加剧了尾部延迟,增加了用户遇到尾部延迟的可能性。在这项工作中,我们展示了操作系统线程对 CPU 的过度承诺如何在应用程序处于重负载时导致高尾延迟。CPU 过度承诺可能源于两个操作因素:在 CPU 配额下错误地确定可用 CPU 的数量,以及对相邻应用程序及其 CPU 使用情况的不了解。我们讨论了不同语言获取可用 CPU 的解决方案,评估了其影响,并讨论了建立一个更统一的、与语言无关的接口来获取可用 CPU 数量的可能性。然后,我们评估了邻居使用对尾部延迟的影响,并引入了一种新的邻居感知线程池--友好线程池(friendlypool),它可以动态避免过度承诺。在我们的评估中,友好线程池以降低吞吐量达 1.4 美元/次为代价,将最大工作者延迟降低了 6.7 美元/次。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
HRA: A Multi-Criteria Framework for Ranking Metaheuristic Optimization Algorithms Temporal Load Imbalance on Ondes3D Seismic Simulator for Different Multicore Architectures Can Graph Reordering Speed Up Graph Neural Network Training? An Experimental Study The Landscape of GPU-Centric Communication A Global Perspective on the Past, Present, and Future of Video Streaming over Starlink
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1