Modeling the Performance of MapReduce under Resource Contentions and Task Failures

2013 IEEE 5th International Conference on Cloud Computing Technology and Science Pub Date : 2013-12-02 DOI:10.1109/CloudCom.2013.28

Xiaolong Cui, Xuelian Lin, Chunming Hu, Richong Zhang, Chengzhang Wang

引用次数: 16

Abstract

MapReduce is a widely used programming model for large scale data processing. In order to estimate the performance of MapReduce job and analyze the bottleneck of MapReduce job, a practical performance model for MapReduce is needed. Many works have been done on modeling the performance of MapReduce jobs. However, existing performance models ignore some important factors, such as I/O congestions and task failures over cluster, which may significantly change the execution costs of MapReduce job. This paper, aiming at predicting the execution time of a MapReduce job, presents an enhanced performance model that takes the resource contention and task failures into consideration. In addition, the experimental results show that the model is more accurate than those without considering the contention and failure factors.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

资源竞争和任务失败下MapReduce的性能建模

MapReduce是一种广泛应用于大规模数据处理的编程模型。为了评估MapReduce作业的性能和分析MapReduce作业的瓶颈，需要一个实用的MapReduce性能模型。在MapReduce作业的性能建模方面已经做了很多工作。然而，现有的性能模型忽略了一些重要的因素，如集群上的I/O拥塞和任务失败，这些因素可能会极大地改变MapReduce作业的执行成本。本文针对MapReduce作业的执行时间预测，提出了一种考虑资源争用和任务失败的增强性能模型。此外，实验结果表明，该模型比不考虑竞争和失效因素的模型更准确。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2013 IEEE 5th International Conference on Cloud Computing Technology and Science

自引率

0.00%

发文量

期刊最新文献

A Feasibility Study of Host-Level Contention Detection by Guest Virtual Machines Porting Grid Applications to the Cloud with Schlouder Towards Data Handling Requirements-Aware Cloud Computing Providing Desirable Data to Users When Integrating Wireless Sensor Networks with Mobile Cloud MELA: Monitoring and Analyzing Elasticity of Cloud Services