Adapting MapReduce for HPC environments

IEEE International Symposium on High-Performance Parallel Distributed Computing Pub Date : 2011-06-08 DOI:10.1145/1996130.1996166

Zacharia Fadika, Elif Dede, M. Govindaraju, L. Ramakrishnan

引用次数: 7

Abstract

MapReduce is increasingly gaining popularity as a programming model for use in large-scale distributed processing. The model is most widely used when implemented using the Hadoop Distributed File System (HDFS). The use of the HDFS, however, precludes the direct applicability of the model to HPC environments, which use high performance distributed file systems. In such distributed environments, the MapReduce model can rarely make use of full resources, as local disks may not be available for data placement on all the nodes. This work proposes a MapReduce implementation and design choices directly suitable for such HPC environments.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

为HPC环境调整MapReduce

MapReduce作为一种用于大规模分布式处理的编程模型越来越受欢迎。该模型在使用HDFS (Hadoop Distributed File System)实现时使用最为广泛。然而，HDFS的使用阻碍了该模型直接适用于使用高性能分布式文件系统的HPC环境。在这种分布式环境中，MapReduce模型很少能够充分利用资源，因为本地磁盘可能无法用于在所有节点上放置数据。这项工作提出了一个MapReduce实现和设计选择，直接适用于这种高性能计算环境。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE International Symposium on High-Performance Parallel Distributed Computing

自引率

0.00%

发文量