官术网_书友最值得收藏!

Setting up Hadoop v2 on your local machine

This recipe describes how to set up Hadoop v2 on your local machine using the local mode. Local mode is a non-distributed mode that can be used for testing and debugging your Hadoop applications. When running a Hadoop application in local mode, all the required Hadoop components and your applications execute inside a single Java Virtual Machine (JVM) process.

Getting ready

Download and install JDK 1.6 or a higher version, preferably the Oracle JDK 1.7. Oracle JDK can be downloaded from http://www.oracle.com/technetwork/java/javase/downloads/index.html.

How to do it...

Now let's start the Hadoop v2 installation:

  1. Download the most recent Hadoop v2 branch distribution (Hadoop 2.2.0 or later) from http://hadoop.apache.org/releases.html.
  2. Unzip the Hadoop distribution using the following command. You will have to change the x.x. in the filename to the actual release you have downloaded. From this point onward, we will call the unpacked Hadoop directory {HADOOP_HOME}:
    $ tar -zxvf hadoop-2.x.x.tar.gz
    
  3. Now, you can run Hadoop jobs through the {HADOOP_HOME}/bin/hadoop command, and we will elaborate on that further in the next recipe.

How it works...

Hadoop local mode does not start any servers but does all the work within a single JVM. When you submit a job to Hadoop in local mode, Hadoop starts a JVM to execute the job. The output and the behavior of the job is the same as a distributed Hadoop job, except for the fact that the job only uses the current node to run the tasks and the local filesystem is used for the data storage. In the next recipe, we will discover how to run a MapReduce program using the Hadoop local mode.

主站蜘蛛池模板: 杭州市| 腾冲县| 青铜峡市| 禹城市| 陆川县| 宁陵县| 漳平市| 沙河市| 德阳市| 鹰潭市| 阿图什市| 梁平县| 彭水| 共和县| 浦江县| 临洮县| 仁化县| 商都县| 黄平县| 新河县| 伽师县| 石楼县| 栾川县| 蕉岭县| 贺州市| 平凉市| 湟源县| 弥渡县| 沙湾县| 和田县| 合川市| 泌阳县| 太白县| 赫章县| 马龙县| 白玉县| 界首市| 蓬莱市| 罗源县| 汽车| 泊头市|