A Distributed NameNode Cluster for a Highly-Available Hadoop Distributed File System

2014 IEEE 33rd International Symposium on Reliable Distributed Systems Pub Date : 2014-10-06 DOI:10.1109/SRDS.2014.61

Yonghwan Kim, Tadashi Araragi, Junya Nakamura, T. Masuzawa

引用次数: 11

Abstract

Recently, Hadoop attracts much attention of engineers and researchers as an emerging and effective framework for Big Data. HDFS (Hadoop Distributed File System) can manage huge amount of data with high performance and reliability using only commodity hardware. However, HDFS requires a single master node, called a NameNode, to manage the entire namespace of the file system. This causes the SPOF (Single Point Of Failure) problem because the file system becomes inaccessible when the NameNode fails. This also causes a bottleneck of efficiency since all the access requests to the file system have to contact the NameNode. Finally the scale up of a namespace is difficult because the NameNode manages all metadata of the namespace on its own memory, which is limited and expensive resource. In this paper, we propose a new HDFS architecture consisting of several NameNodes to resolve all the above problems.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

高可用性Hadoop分布式文件系统的分布式NameNode集群

最近，Hadoop作为一个新兴的、有效的大数据框架受到了工程师和研究人员的广泛关注。HDFS (Hadoop Distributed File System, Hadoop分布式文件系统)仅使用普通硬件就能以高性能和可靠性管理海量数据。然而，HDFS需要一个主节点，称为NameNode，来管理文件系统的整个命名空间。这将导致单点故障(SPOF)问题，因为当NameNode失败时，文件系统将无法访问。这也会导致效率的瓶颈，因为对文件系统的所有访问请求都必须联系NameNode。最后，扩展名称空间很困难，因为NameNode在它自己的内存上管理名称空间的所有元数据，这是一种有限且昂贵的资源。在本文中，我们提出了一个由多个namenode组成的新的HDFS架构来解决上述所有问题。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2014 IEEE 33rd International Symposium on Reliable Distributed Systems

自引率

0.00%

发文量

期刊最新文献

Modeling Reliability Requirements in Coordinated Node and Link Mapping Fast Repair for Single Failure in Erasure Coding-Based Distributed Storage Systems A Distributed NameNode Cluster for a Highly-Available Hadoop Distributed File System A Convex Hull Query Processing Method in MANETs LO-FA-MO: Fault Detection and Systemic Awareness for the QUonG Computing System