Self-Configuration of the Number of Concurrently Running MapReduce Jobs in a Hadoop Cluster

2015 IEEE International Conference on Autonomic Computing Pub Date : 2015-07-07 DOI:10.1109/ICAC.2015.54

Bo Zhang, Filip Krikava, Romain Rouvoy, L. Seinturier

引用次数: 11

Abstract

There is a trade-off between the number of concurrently running MapReduce jobs and their corresponding map and reduce tasks within a node in a Hadoop cluster. Leaving this trade-off statically configured to a single value can significantly reduce job response times leaving only sub optimal resource usage. To overcome this problem, we propose a feedback control loop based approach that dynamically adjusts the Hadoop resource manager configuration based on the current state of the cluster. The preliminary assessment based on workloads synthesized from real-world traces shows that the system performance can be improved by about 30% compared to default Hadoop setup.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Hadoop集群MapReduce并发作业数自配置

在Hadoop集群的节点中，并发运行MapReduce作业的数量与其对应的map和reduce任务之间存在权衡。将这种权衡静态配置为单个值可以显著减少作业响应时间，只留下次优的资源使用。为了克服这个问题，我们提出了一种基于反馈控制循环的方法，该方法可以根据集群的当前状态动态调整Hadoop资源管理器配置。基于从实际跟踪中合成的工作负载的初步评估表明，与默认Hadoop设置相比，系统性能可以提高约30%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2015 IEEE International Conference on Autonomic Computing

自引率

0.00%

发文量

期刊最新文献

A Control-Based Approach to Autonomic Performance Management in Computing Systems Trace Analysis for Fault Detection in Application Servers A Programming System for Autonomic Self-Managing Applications A Taxonomy for Self-∗ Properties in Decentralized Autonomic Computing Transparent Autonomization in Composite Systems