官术网_书友最值得收藏!

  • Hadoop Beginner's Guide
  • Garry Turkington
  • 179字
  • 2021-07-29 16:51:35

Time for action – changing the base HDFS directory

Let's first set the base directory that specifies the location on the local filesystem under which Hadoop will keep all its data. Carry out the following steps:

  1. Create a directory into which Hadoop will store its data:
    $ mkdir /var/lib/hadoop
    
  2. Ensure the directory is writeable by any user:
    $ chmod 777 /var/lib/hadoop
    
  3. Modify core-site.xml once again to add the following property:
    <property>
    <name>hadoop.tmp.dir</name>
    <value>/var/lib/hadoop</value>
    </property>

What just happened?

As we will be storing data in Hadoop and all the various components are running on our local host, this data will need to be stored on our local filesystem somewhere. Regardless of the mode, Hadoop by default uses the hadoop.tmp.dir property as the base directory under which all files and data are written.

MapReduce, for example, uses a /mapred directory under this base directory; HDFS uses /dfs. The danger is that the default value of hadoop.tmp.dir is /tmp and some Linux distributions delete the contents of /tmp on each reboot. So it's safer to explicitly state where the data is to be held.

主站蜘蛛池模板: 浙江省| 来凤县| 上栗县| 通榆县| 贵南县| 沂源县| 永仁县| 金平| 汉中市| 伊春市| 瑞金市| 新安县| 新干县| 繁峙县| 安西县| 宿迁市| 绥阳县| 华蓥市| 闵行区| 邵东县| 新和县| 梨树县| 周至县| 伊金霍洛旗| 元朗区| 公安县| 游戏| 公安县| 枣阳市| 孝昌县| 济阳县| 宿州市| 天全县| 英吉沙县| 闵行区| 宜丰县| 康定县| 藁城市| 新田县| 方城县| 措勤县|