官术网_书友最值得收藏!

Introduction to Hadoop

This chapter introduces the reader to the world of Hadoop and the core components of Hadoop, namely the Hadoop Distributed File System (HDFS) and MapReduce. We will start by introducing the changes and new features in the Hadoop 3 release. Particularly, we will talk about the new features of HDFS and Yet Another Resource Negotiator (YARN), and changes to client applications. Furthermore, we will also install a Hadoop cluster locally and demonstrate the new features such as erasure coding (EC) and the timeline service. As as quick note, Chapter 10Visualizing Big Data shows you how to create a Hadoop cluster in AWS.

In a nutshell, the following topics will be covered throughout this chapter:

  • HDFS
    • High availability
    • Intra-DataNode balancer
    • EC
    • Port mapping
  • MapReduce
    • Task-level optimization
  • YARN
    • Opportunistic containers
    • Timeline service v.2
    • Docker containerization
  • Other changes
  • Installation of Hadoop 3.1
    • HDFS
    • YARN
    • EC
    • Timeline service v.2
主站蜘蛛池模板: 丹江口市| 保靖县| 广平县| 民丰县| 南乐县| 横山县| 晋城| 杭锦后旗| 泰安市| 从化市| 达拉特旗| 霍林郭勒市| 新丰县| 上蔡县| 开封市| 将乐县| 新平| 昌江| 璧山县| 保靖县| 兴山县| 三门县| 固始县| 南靖县| 绩溪县| 高雄县| 株洲市| 三都| 天峨县| 汤原县| 安西县| 云霄县| 安化县| 阿克陶县| 和平县| 阿克苏市| 桂平市| 沙雅县| 奉新县| 蕲春县| 梓潼县|