SLA-Aware Provisioning and Scheduling of Cloud Resources for Big Data Analytics

2014 IEEE International Conference on Cloud Computing in Emerging Markets (CCEM) Pub Date : 2014-10-01 DOI:10.1109/CCEM.2014.7015497

Mohammed Alrokayan, Amir Vahid Dastjerdi, R. Buyya

{"title":"SLA-Aware Provisioning and Scheduling of Cloud Resources for Big Data Analytics","authors":"Mohammed Alrokayan, Amir Vahid Dastjerdi, R. Buyya","doi":"10.1109/CCEM.2014.7015497","DOIUrl":null,"url":null,"abstract":"The stunning growth in data has immensely impacted organizations. Their infrastructure and traditional data management system could not keep up to scale of Big Data. They have to either invest heavily on their infrastructure or move their Big Data analytics to Cloud where they can benefit from both on-demand scalability and contemporary data management techniques. However, to make Cloud hosted Big Data analytics available to wider range of enterprises, we have to carefully capture their preferences in terms of budget and service level objectives. Therefore, this study aims at proposing a SLA and cost-aware resource provisioning and task scheduling approach tailored for Big Data applications in the Cloud. Current approaches assume that data is pre-stored in cluster nodes prior to deployment of Big Data applications. In addition, their focus is purely on task scheduling, and not virtual machine provisioning. We argue that in the Cloud computing context this is not applicable, because the nodes are provisioned dynamically (data cannot be pre-stored) and leaving provisioning to user may lead to under or over provisioning that can both lead to SLA or budget constraint violations. Therefore,in this study we first model the user request, which consist of Big Data analytics jobs with budget and deadline. Then, we model infrastructures as a list of data centers, virtual machines (offered in a pay-as-you-go model), data sources, and network throughputs. After that, to address the aforementioned issues, we propose and compare cost-aware and SLA-based algorithms which provision cloud resources and schedule analytics tasks.","PeriodicalId":143177,"journal":{"name":"2014 IEEE International Conference on Cloud Computing in Emerging Markets (CCEM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"42","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE International Conference on Cloud Computing in Emerging Markets (CCEM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCEM.2014.7015497","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 42

Abstract

The stunning growth in data has immensely impacted organizations. Their infrastructure and traditional data management system could not keep up to scale of Big Data. They have to either invest heavily on their infrastructure or move their Big Data analytics to Cloud where they can benefit from both on-demand scalability and contemporary data management techniques. However, to make Cloud hosted Big Data analytics available to wider range of enterprises, we have to carefully capture their preferences in terms of budget and service level objectives. Therefore, this study aims at proposing a SLA and cost-aware resource provisioning and task scheduling approach tailored for Big Data applications in the Cloud. Current approaches assume that data is pre-stored in cluster nodes prior to deployment of Big Data applications. In addition, their focus is purely on task scheduling, and not virtual machine provisioning. We argue that in the Cloud computing context this is not applicable, because the nodes are provisioned dynamically (data cannot be pre-stored) and leaving provisioning to user may lead to under or over provisioning that can both lead to SLA or budget constraint violations. Therefore,in this study we first model the user request, which consist of Big Data analytics jobs with budget and deadline. Then, we model infrastructures as a list of data centers, virtual machines (offered in a pay-as-you-go model), data sources, and network throughputs. After that, to address the aforementioned issues, we propose and compare cost-aware and SLA-based algorithms which provision cloud resources and schedule analytics tasks.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

面向大数据分析的基于sla的云资源配置与调度

数据的惊人增长极大地影响了组织。他们的基础设施和传统的数据管理系统已经跟不上大数据的规模。他们要么在基础设施上投入巨资，要么将大数据分析转移到云端，在那里他们可以从按需可扩展性和现代数据管理技术中受益。然而，为了让更广泛的企业可以使用云托管的大数据分析，我们必须在预算和服务水平目标方面仔细捕捉他们的偏好。因此，本研究旨在为云中的大数据应用提出一种SLA和成本意识资源配置和任务调度方法。目前的方法假设数据在部署大数据应用程序之前预先存储在集群节点中。此外，它们的重点纯粹是任务调度，而不是虚拟机供应。我们认为，在云计算上下文中，这是不适用的，因为节点是动态供应的(数据不能预先存储)，将供应留给用户可能会导致供应不足或供应过剩，从而导致违反SLA或预算约束。因此，在本研究中，我们首先对用户请求进行建模，其中包括具有预算和截止日期的大数据分析工作。然后，我们将基础设施建模为数据中心、虚拟机(在即用即付模型中提供)、数据源和网络吞吐量的列表。之后，为了解决上述问题，我们提出并比较了提供云资源和调度分析任务的成本感知算法和基于sla的算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2014 IEEE International Conference on Cloud Computing in Emerging Markets (CCEM)

自引率

0.00%

发文量

期刊最新文献

Outsourcing Resource-Intensive Tasks from Mobile Apps to Clouds: Android and Aneka Integration Big Data Infrastructure for Aviation Data Analytics Cloud Based Self Driving Cars Analyzing User Behavior Using Keystroke Dynamics to Protect Cloud from Malicious Insiders A Cloud Computing Service Level Agreement Framework with Negotiation and Secure Monitoring