官术网_书友最值得收藏!

Chapter 2. Storage

After the overview of Hadoop in the previous chapter, we will now start looking at its various component parts in more detail. We will start at the conceptual bottom of the stack in this chapter: the means and mechanisms for storing data within Hadoop. In particular, we will discuss the following topics:

  • Describe the architecture of the Hadoop Distributed File System (HDFS)
  • Show what enhancements to HDFS have been made in Hadoop 2
  • Explore how to access HDFS using command-line tools and the Java API
  • Give a brief description of ZooKeeper—another (sort of) filesystem within Hadoop
  • Survey considerations for storing data in Hadoop and the available file formats

In Chapter 3, Processing – MapReduce and Beyond, we will describe how Hadoop provides the framework to allow data to be processed.

主站蜘蛛池模板: 武宣县| 襄垣县| 平陆县| 昌邑市| 长宁区| 嘉善县| 新余市| 枣庄市| 随州市| 恩施市| 临沭县| 云南省| 武威市| 静乐县| 东乌| 周至县| 阿荣旗| 武陟县| 来凤县| 滕州市| 海兴县| 阿合奇县| 武鸣县| 卓尼县| 汪清县| 大城县| 城步| 屏边| 尚义县| 张家港市| 兴安盟| 泰来县| 南木林县| 桃园县| 西林县| 金乡县| 临颍县| 浦东新区| 镇沅| 平凉市| 闸北区|