An Automatic Scaling System for Online Application with Microservices Architecture

2022 IEEE International Conference on Joint Cloud Computing (JCC) Pub Date : 2022-08-01 DOI:10.1109/JCC56315.2022.00018

Youmei Song, Chao Li, Kuoran Zhuang, Tengyu Ma, Tianyu Wo

{"title":"An Automatic Scaling System for Online Application with Microservices Architecture","authors":"Youmei Song, Chao Li, Kuoran Zhuang, Tengyu Ma, Tianyu Wo","doi":"10.1109/JCC56315.2022.00018","DOIUrl":null,"url":null,"abstract":"Auto-scaling is an efficient technique to handle fluctuations of application workloads by acquiring or releasing resources. However, performing auto-scaling in a microservice system for online applications faces critical challenges, including unpredictably massive microservice requests, without fine-granularity performance metrics, and complex dependencies among services. In this paper, we design a cost-efficient autoscaling system, which pinpoints the scaling-needed services as quickly as possible and makes decisions on the right resource amount allocation toward them. Specifically, we first propose a multi-level microservice monitoring mechanism to capture historical and latest service-level performance metrics, and detect the over-provisioning services and under-provisioning services via jointly considering the changes of latency and throughput. For the overload anomalies, a random walk method is further adopted for detecting the root causes based on the dependency topology of microservices. When anomalies are detected, we design a threshold-based method by incorporating the ARIMI method for predicting resource usage status to allocate or recycle the right number of computation resources for them. Extensive and systematic evaluations of different algorithm modules with real-world and simulated workload data confirm the superiority of our mechanism over multiple algorithms.","PeriodicalId":239996,"journal":{"name":"2022 IEEE International Conference on Joint Cloud Computing (JCC)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Joint Cloud Computing (JCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/JCC56315.2022.00018","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Auto-scaling is an efficient technique to handle fluctuations of application workloads by acquiring or releasing resources. However, performing auto-scaling in a microservice system for online applications faces critical challenges, including unpredictably massive microservice requests, without fine-granularity performance metrics, and complex dependencies among services. In this paper, we design a cost-efficient autoscaling system, which pinpoints the scaling-needed services as quickly as possible and makes decisions on the right resource amount allocation toward them. Specifically, we first propose a multi-level microservice monitoring mechanism to capture historical and latest service-level performance metrics, and detect the over-provisioning services and under-provisioning services via jointly considering the changes of latency and throughput. For the overload anomalies, a random walk method is further adopted for detecting the root causes based on the dependency topology of microservices. When anomalies are detected, we design a threshold-based method by incorporating the ARIMI method for predicting resource usage status to allocate or recycle the right number of computation resources for them. Extensive and systematic evaluations of different algorithm modules with real-world and simulated workload data confirm the superiority of our mechanism over multiple algorithms.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于微服务架构的在线应用自动伸缩系统

自动伸缩是一种通过获取或释放资源来处理应用程序工作负载波动的有效技术。然而，在微服务系统中为在线应用程序执行自动扩展面临着严峻的挑战，包括不可预测的大量微服务请求，没有细粒度的性能指标，以及服务之间复杂的依赖关系。在本文中，我们设计了一个经济高效的自动扩展系统，该系统可以快速地确定扩展所需的服务，并对它们做出正确的资源量分配决策。具体而言，我们首先提出了一种多级微服务监控机制，以捕获历史和最新的服务级性能指标，并通过联合考虑延迟和吞吐量的变化来检测服务的过度供应和不足。针对过载异常，进一步采用基于微服务依赖拓扑的随机游走方法检测根本原因。当检测到异常时，我们设计了一种基于阈值的方法，结合ARIMI方法来预测资源使用状态，从而为它们分配或回收适当数量的计算资源。广泛和系统的评估不同的算法模块与现实世界和模拟工作负载数据证实了我们的机制优于多种算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊