SLA-Aware Provisioning and Scheduling of Cloud Resources for Big Data Analytics

Mohammed Alrokayan, Amir Vahid Dastjerdi, R. Buyya
{"title":"SLA-Aware Provisioning and Scheduling of Cloud Resources for Big Data Analytics","authors":"Mohammed Alrokayan, Amir Vahid Dastjerdi, R. Buyya","doi":"10.1109/CCEM.2014.7015497","DOIUrl":null,"url":null,"abstract":"The stunning growth in data has immensely impacted organizations. Their infrastructure and traditional data management system could not keep up to scale of Big Data. They have to either invest heavily on their infrastructure or move their Big Data analytics to Cloud where they can benefit from both on-demand scalability and contemporary data management techniques. However, to make Cloud hosted Big Data analytics available to wider range of enterprises, we have to carefully capture their preferences in terms of budget and service level objectives. Therefore, this study aims at proposing a SLA and cost-aware resource provisioning and task scheduling approach tailored for Big Data applications in the Cloud. Current approaches assume that data is pre-stored in cluster nodes prior to deployment of Big Data applications. In addition, their focus is purely on task scheduling, and not virtual machine provisioning. We argue that in the Cloud computing context this is not applicable, because the nodes are provisioned dynamically (data cannot be pre-stored) and leaving provisioning to user may lead to under or over provisioning that can both lead to SLA or budget constraint violations. Therefore,in this study we first model the user request, which consist of Big Data analytics jobs with budget and deadline. Then, we model infrastructures as a list of data centers, virtual machines (offered in a pay-as-you-go model), data sources, and network throughputs. After that, to address the aforementioned issues, we propose and compare cost-aware and SLA-based algorithms which provision cloud resources and schedule analytics tasks.","PeriodicalId":143177,"journal":{"name":"2014 IEEE International Conference on Cloud Computing in Emerging Markets (CCEM)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2014-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"42","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE International Conference on Cloud Computing in Emerging Markets (CCEM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCEM.2014.7015497","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 42

Abstract

The stunning growth in data has immensely impacted organizations. Their infrastructure and traditional data management system could not keep up to scale of Big Data. They have to either invest heavily on their infrastructure or move their Big Data analytics to Cloud where they can benefit from both on-demand scalability and contemporary data management techniques. However, to make Cloud hosted Big Data analytics available to wider range of enterprises, we have to carefully capture their preferences in terms of budget and service level objectives. Therefore, this study aims at proposing a SLA and cost-aware resource provisioning and task scheduling approach tailored for Big Data applications in the Cloud. Current approaches assume that data is pre-stored in cluster nodes prior to deployment of Big Data applications. In addition, their focus is purely on task scheduling, and not virtual machine provisioning. We argue that in the Cloud computing context this is not applicable, because the nodes are provisioned dynamically (data cannot be pre-stored) and leaving provisioning to user may lead to under or over provisioning that can both lead to SLA or budget constraint violations. Therefore,in this study we first model the user request, which consist of Big Data analytics jobs with budget and deadline. Then, we model infrastructures as a list of data centers, virtual machines (offered in a pay-as-you-go model), data sources, and network throughputs. After that, to address the aforementioned issues, we propose and compare cost-aware and SLA-based algorithms which provision cloud resources and schedule analytics tasks.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
面向大数据分析的基于sla的云资源配置与调度
数据的惊人增长极大地影响了组织。他们的基础设施和传统的数据管理系统已经跟不上大数据的规模。他们要么在基础设施上投入巨资,要么将大数据分析转移到云端,在那里他们可以从按需可扩展性和现代数据管理技术中受益。然而,为了让更广泛的企业可以使用云托管的大数据分析,我们必须在预算和服务水平目标方面仔细捕捉他们的偏好。因此,本研究旨在为云中的大数据应用提出一种SLA和成本意识资源配置和任务调度方法。目前的方法假设数据在部署大数据应用程序之前预先存储在集群节点中。此外,它们的重点纯粹是任务调度,而不是虚拟机供应。我们认为,在云计算上下文中,这是不适用的,因为节点是动态供应的(数据不能预先存储),将供应留给用户可能会导致供应不足或供应过剩,从而导致违反SLA或预算约束。因此,在本研究中,我们首先对用户请求进行建模,其中包括具有预算和截止日期的大数据分析工作。然后,我们将基础设施建模为数据中心、虚拟机(在即用即付模型中提供)、数据源和网络吞吐量的列表。之后,为了解决上述问题,我们提出并比较了提供云资源和调度分析任务的成本感知算法和基于sla的算法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Outsourcing Resource-Intensive Tasks from Mobile Apps to Clouds: Android and Aneka Integration Big Data Infrastructure for Aviation Data Analytics Cloud Based Self Driving Cars Analyzing User Behavior Using Keystroke Dynamics to Protect Cloud from Malicious Insiders A Cloud Computing Service Level Agreement Framework with Negotiation and Secure Monitoring
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1