- Big Data Analytics with Hadoop 3
- Sridhar Alla
- 292字
- 2021-06-25 21:26:05
Starting HDFS
Follow these steps as shown to start HDFS (NameNode and DataNode):
- Format the filesystem:
$ ./bin/hdfs namenode -format
- Start the NameNode daemon and the DataNode daemon:
$ ./sbin/start-dfs.sh
The Hadoop daemon log output is written to the $HADOOP_LOG_DIR directory (defaults to $HADOOP_HOME/logs).
- Browse the web interface for the NameNode; by default it is available at http://localhost:9870/.
- Make the HDFS directories required to execute MapReduce jobs:
$ ./bin/hdfs dfs -mkdir /user
$ ./bin/hdfs dfs -mkdir /user/<username>
- When you're done, stop the daemons with the following:
$ ./sbin/stop-dfs.sh
- Open a browser to check your local Hadoop, which can be launched in the browser as http://localhost:9870/. The following is what the HDFS installation looks like:

- Clicking on the Datanodes tab shows the nodes as shown in the following screenshot:

Figure: Screenshot showing the nodes in the Datanodes tab
- Clicking on the logs will show the various logs in your cluster, as shown in the following screenshot:

- As shown in the following screenshot, you can also look at the various JVM metrics of your cluster components:

- As shown in the following screenshot, you can also check the configuration. This is a good place to look at the entire configuration and all the default settings:

- You can also browse the filesystem of your newly installed cluster, as shown in the following screenshot:

Figure: Screenshot showing the Browse Directory and how you can browse the filesystem in you newly installed cluster
At this point, we should all be able to see and use a basic HDFS cluster. But this is just a HDFS filesystem with some directories and files. We also need a job/task scheduling service to actually use the cluster for computational needs rather than just storage.
推薦閱讀
- 后稀缺:自動化與未來工作
- 21小時學通AutoCAD
- 精通MATLAB神經網絡
- Java開發技術全程指南
- 大數據時代的數據挖掘
- 小型電動機實用設計手冊
- Linux:Powerful Server Administration
- Docker on Amazon Web Services
- Windows Server 2008 R2活動目錄內幕
- R Data Analysis Projects
- 基于人工免疫原理的檢測系統模型及其應用
- 西門子S7-1200/1500 PLC從入門到精通
- Moodle 2.0 Course Conversion(Second Edition)
- Embedded Linux Development using Yocto Projects(Second Edition)
- PostgreSQL 10 High Performance