利用用户耐心扩展云服务中的资源容量

2014 IEEE 7th International Conference on Cloud Computing Pub Date : 2014-06-27 DOI:10.1109/CLOUD.2014.67

Renato L. F. Cunha, M. Assunção, C. Cardonha, M. Netto

{"title":"利用用户耐心扩展云服务中的资源容量","authors":"Renato L. F. Cunha, M. Assunção, C. Cardonha, M. Netto","doi":"10.1109/CLOUD.2014.67","DOIUrl":null,"url":null,"abstract":"An important feature of cloud computing is its elasticity, that is, the ability to have resource capacity dynamically modified according to the current system load. Auto-scaling is challenging because it must account for two conflicting objectives: minimising system capacity available to users and maximising QoS, which typically translates to short response times. Current auto-scaling techniques are based solely on load forecasts and ignore the perception that users have from cloud services. As a consequence, providers tend to provision a volume of resources that is significantly larger than necessary to keep users satisfied. In this article, we propose a scheduling algorithm and an auto-scaling triggering technique that explore user patience in order to identify critical times when auto-scaling is needed and the appropriate volume of capacity by which the cloud platform should either extend or shrink. The proposed technique assists service providers in reducing costs related to resource allocation while keeping the same QoS to users. Our experiments show that it is possible to reduce resource-hour by up to approximately 8% compared to auto-scaling based on system utilisation.","PeriodicalId":288542,"journal":{"name":"2014 IEEE 7th International Conference on Cloud Computing","volume":"99 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Exploiting User Patience for Scaling Resource Capacity in Cloud Services\",\"authors\":\"Renato L. F. Cunha, M. Assunção, C. Cardonha, M. Netto\",\"doi\":\"10.1109/CLOUD.2014.67\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"An important feature of cloud computing is its elasticity, that is, the ability to have resource capacity dynamically modified according to the current system load. Auto-scaling is challenging because it must account for two conflicting objectives: minimising system capacity available to users and maximising QoS, which typically translates to short response times. Current auto-scaling techniques are based solely on load forecasts and ignore the perception that users have from cloud services. As a consequence, providers tend to provision a volume of resources that is significantly larger than necessary to keep users satisfied. In this article, we propose a scheduling algorithm and an auto-scaling triggering technique that explore user patience in order to identify critical times when auto-scaling is needed and the appropriate volume of capacity by which the cloud platform should either extend or shrink. The proposed technique assists service providers in reducing costs related to resource allocation while keeping the same QoS to users. Our experiments show that it is possible to reduce resource-hour by up to approximately 8% compared to auto-scaling based on system utilisation.\",\"PeriodicalId\":288542,\"journal\":{\"name\":\"2014 IEEE 7th International Conference on Cloud Computing\",\"volume\":\"99 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-06-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 IEEE 7th International Conference on Cloud Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CLOUD.2014.67\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE 7th International Conference on Cloud Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CLOUD.2014.67","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

摘要

云计算的一个重要特征是它的弹性，即能够根据当前系统负载动态修改资源容量。自动扩展是具有挑战性的，因为它必须考虑到两个相互冲突的目标:最小化用户可用的系统容量和最大化QoS，这通常转化为短响应时间。当前的自动扩展技术仅仅基于负载预测，而忽略了用户从云服务中获得的感知。因此，提供商倾向于提供大量的资源，远远超过了保持用户满意所需的资源。在本文中，我们提出了一种调度算法和一种自动缩放触发技术，可以探索用户的耐心，以便确定需要自动缩放的关键时刻，以及云平台应该扩展或缩小的适当容量。所提出的技术帮助服务提供商降低与资源分配相关的成本，同时保持对用户相同的QoS。我们的实验表明，与基于系统利用率的自动扩展相比，可以将资源小时减少大约8%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Exploiting User Patience for Scaling Resource Capacity in Cloud Services

An important feature of cloud computing is its elasticity, that is, the ability to have resource capacity dynamically modified according to the current system load. Auto-scaling is challenging because it must account for two conflicting objectives: minimising system capacity available to users and maximising QoS, which typically translates to short response times. Current auto-scaling techniques are based solely on load forecasts and ignore the perception that users have from cloud services. As a consequence, providers tend to provision a volume of resources that is significantly larger than necessary to keep users satisfied. In this article, we propose a scheduling algorithm and an auto-scaling triggering technique that explore user patience in order to identify critical times when auto-scaling is needed and the appropriate volume of capacity by which the cloud platform should either extend or shrink. The proposed technique assists service providers in reducing costs related to resource allocation while keeping the same QoS to users. Our experiments show that it is possible to reduce resource-hour by up to approximately 8% compared to auto-scaling based on system utilisation.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2014 IEEE 7th International Conference on Cloud Computing

自引率

0.00%

发文量

期刊最新文献

User-Friendly Visualization of Cloud Quality Energy and Performance-Aware Task Scheduling in a Mobile Cloud Computing Environment MediaPaaS: A Cloud-Based Media Processing Platform for Elastic Live Broadcasting AppCloak: Rapid Migration of Legacy Applications into Cloud Introducing SSDs to the Hadoop MapReduce Framework