Scaling of Cloud Resources-Principal Component Analysis and Random Forest Approach

2018 IEEE International Conference on the Science of Electrical Engineering in Israel (ICSEE) Pub Date : 2018-12-01 DOI:10.1109/ICSEE.2018.8646134

Omer Anisfeld, Erez Biton, Ruven Milshtein, M. Shifrin, Omer Gurewitz

{"title":"Scaling of Cloud Resources-Principal Component Analysis and Random Forest Approach","authors":"Omer Anisfeld, Erez Biton, Ruven Milshtein, M. Shifrin, Omer Gurewitz","doi":"10.1109/ICSEE.2018.8646134","DOIUrl":null,"url":null,"abstract":"The scaling challenge for a system which constitutes multiple clients, which address application servers deployed on the cloud, becomes more complicate once the applications’ nature imply consistent communication, e.g., video streaming. The effective scaling solution in this case is such that it will assure an acceptable client quality of experience (QoE), typically measured by video delay. In this paper, we provide a solution to the auto-scaling for cloud provider by means of analyzing the impact of various system parameters. The parameters which may impact the QoE on the client side include, but not limited to, average memory consumption, transmission and reception frequency, average CPU consumption on the side of the cloud provider. We perform Principal Component Analysis (PCA) in order to find a projection of the parameters, resulting in a set of features which can be sorted by their measure of impact. Next, we introduce scaling decision mechanism based on Random Forest (RF). Only most influencing features are employed for that, which renders the training process of the RF to be computationally effective. The proposed approach is novel in the sense that the scaling decisions found by the RF are in the projected space found by PCA (instead of having threshold derived directly from the original parameters), which is not necessarily intuitive. However, these features are numerically approved to be the most influencing. Moreover, as long as the features in the projected space are uncorrelated, it allows us to base the RF on only small subset of them, which would be ineffective in the original measurements space, where the correlation is high. In our Kubernetes-based implementation which employs this method, the resulting auto-scaler performs better than the default auto-scaler.","PeriodicalId":254455,"journal":{"name":"2018 IEEE International Conference on the Science of Electrical Engineering in Israel (ICSEE)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on the Science of Electrical Engineering in Israel (ICSEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSEE.2018.8646134","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

The scaling challenge for a system which constitutes multiple clients, which address application servers deployed on the cloud, becomes more complicate once the applications’ nature imply consistent communication, e.g., video streaming. The effective scaling solution in this case is such that it will assure an acceptable client quality of experience (QoE), typically measured by video delay. In this paper, we provide a solution to the auto-scaling for cloud provider by means of analyzing the impact of various system parameters. The parameters which may impact the QoE on the client side include, but not limited to, average memory consumption, transmission and reception frequency, average CPU consumption on the side of the cloud provider. We perform Principal Component Analysis (PCA) in order to find a projection of the parameters, resulting in a set of features which can be sorted by their measure of impact. Next, we introduce scaling decision mechanism based on Random Forest (RF). Only most influencing features are employed for that, which renders the training process of the RF to be computationally effective. The proposed approach is novel in the sense that the scaling decisions found by the RF are in the projected space found by PCA (instead of having threshold derived directly from the original parameters), which is not necessarily intuitive. However, these features are numerically approved to be the most influencing. Moreover, as long as the features in the projected space are uncorrelated, it allows us to base the RF on only small subset of them, which would be ineffective in the original measurements space, where the correlation is high. In our Kubernetes-based implementation which employs this method, the resulting auto-scaler performs better than the default auto-scaler.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

云资源的缩放——主成分分析和随机森林方法

对于一个由多个客户端组成的系统(这些客户端地址部署在云上的应用服务器)来说，一旦应用程序的性质意味着一致的通信(例如视频流)，扩展挑战就会变得更加复杂。在这种情况下，有效的扩展解决方案是这样的，它将确保可接受的客户端体验质量(QoE)，通常由视频延迟衡量。本文通过分析各种系统参数的影响，为云提供商提供了一种自动扩展的解决方案。可能影响客户端QoE的参数包括(但不限于)云提供商端的平均内存消耗、传输和接收频率、平均CPU消耗。我们执行主成分分析(PCA)以找到参数的投影，从而得到一组可以根据其影响度量进行排序的特征。其次，我们引入了基于随机森林(RF)的尺度决策机制。仅使用影响最大的特征，使得RF的训练过程在计算上是有效的。所提出的方法是新颖的，因为RF发现的缩放决策是在PCA发现的投影空间中(而不是直接从原始参数中获得阈值)，这并不一定是直观的。然而，这些特征在数值上被认为是最具影响力的。此外，只要投影空间中的特征是不相关的，它允许我们仅基于它们的一小部分来建立RF，这在原始测量空间中是无效的，因为相关性很高。在我们基于kubernetes的实现中，使用了这种方法，生成的自动缩放器比默认的自动缩放器性能更好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2018 IEEE International Conference on the Science of Electrical Engineering in Israel (ICSEE)

自引率

0.00%

发文量