Predictive auto-scaling with OpenStack Monasca

Proceedings of the 14th IEEE/ACM International Conference on Utility and Cloud Computing Pub Date : 2021-11-03 DOI:10.1145/3468737.3494104

Giacomo Lanciano, Filippo Galli, T. Cucinotta, D. Bacciu, A. Passarella

{"title":"Predictive auto-scaling with OpenStack Monasca","authors":"Giacomo Lanciano, Filippo Galli, T. Cucinotta, D. Bacciu, A. Passarella","doi":"10.1145/3468737.3494104","DOIUrl":null,"url":null,"abstract":"Cloud auto-scaling mechanisms are typically based on reactive automation rules that scale a cluster whenever some metric, e.g., the average CPU usage among instances, exceeds a predefined threshold. Tuning these rules becomes particularly cumbersome when scaling-up a cluster involves non-negligible times to bootstrap new instances, as it happens frequently in production cloud services. To deal with this problem, we propose an architecture for auto-scaling cloud services based on the status in which the system is expected to evolve in the near future. Our approach leverages on time-series forecasting techniques, like those based on machine learning and artificial neural networks, to predict the future dynamics of key metrics, e.g., resource consumption metrics, and apply a threshold-based scaling policy on them. The result is a predictive automation policy that is able, for instance, to automatically anticipate peaks in the load of a cloud application and trigger ahead of time appropriate scaling actions to accommodate the expected increase in traffic. We prototyped our approach as an open-source OpenStack component, which relies on, and extends, the monitoring capabilities offered by Monasca, resulting in the addition of predictive metrics that can be leveraged by orchestration components like Heat or Senlin. We show experimental results using a recurrent neural network and a multi-layer perceptron as predictor, which are compared with a simple linear regression and a traditional non-predictive auto-scaling policy. However, the proposed framework allows for the easy customization of the prediction policy as needed.","PeriodicalId":254382,"journal":{"name":"Proceedings of the 14th IEEE/ACM International Conference on Utility and Cloud Computing","volume":"59 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 14th IEEE/ACM International Conference on Utility and Cloud Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3468737.3494104","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Cloud auto-scaling mechanisms are typically based on reactive automation rules that scale a cluster whenever some metric, e.g., the average CPU usage among instances, exceeds a predefined threshold. Tuning these rules becomes particularly cumbersome when scaling-up a cluster involves non-negligible times to bootstrap new instances, as it happens frequently in production cloud services. To deal with this problem, we propose an architecture for auto-scaling cloud services based on the status in which the system is expected to evolve in the near future. Our approach leverages on time-series forecasting techniques, like those based on machine learning and artificial neural networks, to predict the future dynamics of key metrics, e.g., resource consumption metrics, and apply a threshold-based scaling policy on them. The result is a predictive automation policy that is able, for instance, to automatically anticipate peaks in the load of a cloud application and trigger ahead of time appropriate scaling actions to accommodate the expected increase in traffic. We prototyped our approach as an open-source OpenStack component, which relies on, and extends, the monitoring capabilities offered by Monasca, resulting in the addition of predictive metrics that can be leveraged by orchestration components like Heat or Senlin. We show experimental results using a recurrent neural network and a multi-layer perceptron as predictor, which are compared with a simple linear regression and a traditional non-predictive auto-scaling policy. However, the proposed framework allows for the easy customization of the prediction policy as needed.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

使用OpenStack Monasca进行预测自动伸缩

云自动扩展机制通常基于响应式自动化规则，当某些指标(例如，实例之间的平均CPU使用率)超过预定义的阈值时，该规则就会扩展集群。当扩展集群涉及启动新实例的不可忽略的时间时，调优这些规则变得特别麻烦，因为这种情况在生产云服务中经常发生。为了解决这个问题，我们提出了一种基于系统在不久的将来预期发展的状态的自动扩展云服务架构。我们的方法利用时间序列预测技术，如基于机器学习和人工神经网络的预测技术，来预测关键指标(如资源消耗指标)的未来动态，并对其应用基于阈值的缩放策略。结果是一个预测性自动化策略，例如，它能够自动预测云应用程序负载的峰值，并提前触发适当的扩展操作，以适应预期的流量增长。我们将我们的方法原型化为一个开源OpenStack组件，它依赖并扩展了Monasca提供的监控功能，从而增加了可被编排组件(如Heat或Senlin)利用的预测指标。我们展示了使用递归神经网络和多层感知器作为预测器的实验结果，并将其与简单的线性回归和传统的非预测自缩放策略进行了比较。然而，建议的框架允许根据需要轻松定制预测策略。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the 14th IEEE/ACM International Conference on Utility and Cloud Computing

自引率

0.00%

发文量

期刊最新文献

Distributed federated service chaining for heterogeneous network environments Accord RDS Leveraging vCPU-utilization rates to select cost-efficient VMs for parallel workloads Multi-cloud serverless function composition