A Service-Aware Autoscaling Strategy for Container Orchestration Platforms with Soft Resource Isolation

F. Tonini, C. Natalino, D. Temesgene, Z. Ghebretensae, L. Wosinska, P. Monti
{"title":"A Service-Aware Autoscaling Strategy for Container Orchestration Platforms with Soft Resource Isolation","authors":"F. Tonini, C. Natalino, D. Temesgene, Z. Ghebretensae, L. Wosinska, P. Monti","doi":"10.1109/EuCNC/6GSummit58263.2023.10188268","DOIUrl":null,"url":null,"abstract":"Container orchestration platforms like Kubernetes (K8s) allow easy deployment and management of cloud native services. When deploying their services, service providers need to specify a proper amount of resources to K8s, so that the desired Quality of Service (QoS) to their users can be maintained. To cope with the varying traffic demand coming from users, they can rely on the K8s Horizontal Pod Autoscaling (HPA) mechanism. To ensure that enough resources are available when needed, the standard HPA mechanism relies on resource overprovisioning. In this way, the required QoS is achieved most of (or all) the time but at the expense of additional resources that are allocated (and charged for), while they may stay idle for significant periods of time. A way to reduce overprovisioning is provided by the soft resource isolation of K8s, which allows services to compensate for a temporary lack of resources with shared resources, i.e., idle resources of the machines where services are running. However, during traffic spikes, these idle resources may not be enough to serve the whole demand, degrading the QoS. The HPA, which is not aware of how much demand could not be served, is not always able to correctly estimate the required additional resources, further degrading the QoS. To overcome this, service providers need to leverage overprovisioning, limiting the use of shared resources. In this paper, we propose a novel mechanism for autoscaling resources in K8s that relies on service-related data to avoid the additional degradation introduced by the HPA. The proposed strategy also offers a way to tune overprovisioning and shared resources. Simulation results show that our approach can reduce idle resources by up to 60% compared with the HPA mechanism.","PeriodicalId":65870,"journal":{"name":"公共管理高层论坛","volume":"17 1","pages":"454-459"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"公共管理高层论坛","FirstCategoryId":"96","ListUrlMain":"https://doi.org/10.1109/EuCNC/6GSummit58263.2023.10188268","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Container orchestration platforms like Kubernetes (K8s) allow easy deployment and management of cloud native services. When deploying their services, service providers need to specify a proper amount of resources to K8s, so that the desired Quality of Service (QoS) to their users can be maintained. To cope with the varying traffic demand coming from users, they can rely on the K8s Horizontal Pod Autoscaling (HPA) mechanism. To ensure that enough resources are available when needed, the standard HPA mechanism relies on resource overprovisioning. In this way, the required QoS is achieved most of (or all) the time but at the expense of additional resources that are allocated (and charged for), while they may stay idle for significant periods of time. A way to reduce overprovisioning is provided by the soft resource isolation of K8s, which allows services to compensate for a temporary lack of resources with shared resources, i.e., idle resources of the machines where services are running. However, during traffic spikes, these idle resources may not be enough to serve the whole demand, degrading the QoS. The HPA, which is not aware of how much demand could not be served, is not always able to correctly estimate the required additional resources, further degrading the QoS. To overcome this, service providers need to leverage overprovisioning, limiting the use of shared resources. In this paper, we propose a novel mechanism for autoscaling resources in K8s that relies on service-related data to avoid the additional degradation introduced by the HPA. The proposed strategy also offers a way to tune overprovisioning and shared resources. Simulation results show that our approach can reduce idle resources by up to 60% compared with the HPA mechanism.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
具有软资源隔离的容器编排平台的服务感知自动伸缩策略
像Kubernetes (k8)这样的容器编排平台允许轻松部署和管理云原生服务。在部署服务时,服务提供商需要为k8指定适当数量的资源,以便为其用户维持所需的服务质量(QoS)。为了应对来自用户的不同流量需求,他们可以依靠K8s水平Pod自动缩放(HPA)机制。为了确保在需要时有足够的资源可用,标准HPA机制依赖于资源过度供应。通过这种方式,所需的QoS在大部分时间(或全部时间)都得到了实现,但代价是要分配(并为此收费)额外的资源,而这些资源可能在相当长的一段时间内处于空闲状态。k8的软资源隔离提供了一种减少过度配置的方法,它允许服务使用共享资源(即运行服务的机器的空闲资源)来补偿资源的暂时缺乏。但是,在流量高峰期间,这些空闲资源可能不足以满足全部需求,从而降低QoS。HPA不知道有多少需求不能被满足,因此并不总是能够正确地估计所需的额外资源,从而进一步降低了QoS。为了克服这个问题,服务提供者需要利用过度供应,限制共享资源的使用。在本文中,我们提出了一种新的基于服务相关数据的k8资源自动缩放机制,以避免HPA带来的额外退化。所建议的策略还提供了一种调优过度配置和共享资源的方法。仿真结果表明,与HPA机制相比,我们的方法可以减少多达60%的空闲资源。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
385
期刊最新文献
Undersampling and SNR Degradation in Practical Direct RF Sampling Systems Research Challenges in Trustworthy Artificial Intelligence and Computing for Health: The Case of the PRE-ACT project Inter-Satellite Link Prediction for Non-Terrestrial Networks Using Supervised Learning AI-Powered Edge Computing Evolution for Beyond 5G Communication Networks Phase Modulation-based Fronthaul Network for 5G mmWave FR-2 Signal Transmission over Hybrid Links
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1