Bursting the Cloud Data Bubble: Towards Transparent Storage Elasticity in IaaS Clouds

2014 IEEE 28th International Parallel and Distributed Processing Symposium Pub Date : 2014-05-19 DOI:10.1109/IPDPS.2014.25

Bogdan Nicolae, Pierre Riteau, K. Keahey

{"title":"Bursting the Cloud Data Bubble: Towards Transparent Storage Elasticity in IaaS Clouds","authors":"Bogdan Nicolae, Pierre Riteau, K. Keahey","doi":"10.1109/IPDPS.2014.25","DOIUrl":null,"url":null,"abstract":"Storage elasticity on IaaS clouds is an important feature for data-intensive workloads: storage requirements can vary greatly during application runtime, making worst-case over-provisioning a poor choice that leads to unnecessarily tied-up storage and extra costs for the user. While the ability to adapt dynamically to storage requirements is thus attractive, how to implement it is not well understood. Current approaches simply rely on users to attach and detach virtual disks to the virtual machine (VM) instances and then manage them manually, thus greatly increasing application complexity while reducing cost efficiency. Unlike such approaches, this paper aims to provide a transparent solution that presents a unified storage space to the VM in the form of a regular POSIX file system that hides the details of attaching and detaching virtual disks by handling those actions transparently based on dynamic application requirements. The main difficulty in this context is to understand the intent of the application and regulate the available storage in order to avoid running out of space while minimizing the performance overhead of doing so. To this end, we propose a storage space prediction scheme that analyzes multiple system parameters and dynamically adapts monitoring based on the intensity of the I/O in order to get as close as possible to the real usage. We show the value of our proposal over static worst-case over-provisioning and simpler elastic schemes that rely on a reactive model to attach and detach virtual disks, using both synthetic benchmarks and real-life data-intensive applications. Our experiments demonstrate that we can reduce storage waste/cost by 30-40% with only 2-5% performance overhead.","PeriodicalId":309291,"journal":{"name":"2014 IEEE 28th International Parallel and Distributed Processing Symposium","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"28","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE 28th International Parallel and Distributed Processing Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IPDPS.2014.25","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 28

Abstract

Storage elasticity on IaaS clouds is an important feature for data-intensive workloads: storage requirements can vary greatly during application runtime, making worst-case over-provisioning a poor choice that leads to unnecessarily tied-up storage and extra costs for the user. While the ability to adapt dynamically to storage requirements is thus attractive, how to implement it is not well understood. Current approaches simply rely on users to attach and detach virtual disks to the virtual machine (VM) instances and then manage them manually, thus greatly increasing application complexity while reducing cost efficiency. Unlike such approaches, this paper aims to provide a transparent solution that presents a unified storage space to the VM in the form of a regular POSIX file system that hides the details of attaching and detaching virtual disks by handling those actions transparently based on dynamic application requirements. The main difficulty in this context is to understand the intent of the application and regulate the available storage in order to avoid running out of space while minimizing the performance overhead of doing so. To this end, we propose a storage space prediction scheme that analyzes multiple system parameters and dynamically adapts monitoring based on the intensity of the I/O in order to get as close as possible to the real usage. We show the value of our proposal over static worst-case over-provisioning and simpler elastic schemes that rely on a reactive model to attach and detach virtual disks, using both synthetic benchmarks and real-life data-intensive applications. Our experiments demonstrate that we can reduce storage waste/cost by 30-40% with only 2-5% performance overhead.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

打破云数据泡沫:在IaaS云中实现透明存储弹性

IaaS云上的存储弹性是数据密集型工作负载的一个重要特性:在应用程序运行期间，存储需求可能变化很大，这使得最坏情况下的过度配置成为一个糟糕的选择，它会导致不必要的存储捆绑和用户的额外成本。虽然动态适应存储需求的能力很有吸引力，但如何实现它还没有得到很好的理解。目前的方法仅仅依赖于用户将虚拟磁盘附加和分离到虚拟机实例，然后手动管理它们，从而大大增加了应用程序的复杂性，同时降低了成本效率。与这些方法不同，本文旨在提供一种透明的解决方案，以常规POSIX文件系统的形式向VM提供统一的存储空间，通过基于动态应用程序需求透明地处理这些操作，隐藏了附加和分离虚拟磁盘的细节。在这种情况下，主要的困难是理解应用程序的意图并调节可用存储，以避免耗尽空间，同时尽量减少这样做的性能开销。为此，我们提出了一种存储空间预测方案，该方案分析了多个系统参数，并根据I/O的强度动态调整监控，以尽可能接近实际使用情况。我们展示了我们的建议的价值，而不是静态的最坏情况过度配置和更简单的弹性方案，这些方案依赖于一个反应模型来附加和分离虚拟磁盘，使用合成基准测试和真实的数据密集型应用程序。我们的实验表明，我们可以减少30-40%的存储浪费/成本，而性能开销仅为2-5%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2014 IEEE 28th International Parallel and Distributed Processing Symposium

自引率

0.00%

发文量