官术网_书友最值得收藏!

Defining HDFS

The Hadoop Distributed File System (HDFS), as the name suggests, is a distributed filesystem based on the lines of the Google File System written in Java. In practice, HDFS resembles closely any other UNIX filesystem with support for common file operations such as ls, cp, rm, du, cat, and so on. However what makes HDFS stand out, despite its simplicity, is its mechanism to handle node failure in the Hadoop cluster without effectively changing the search time for accessing stored files. The HDFS cluster consists of two major components: DataNodes and NameNode.

HDFS has a unique way of storing data on HDFS clusters (cheap commodity networked commodity computers). It splits the regular file in smaller chunks called blocks and then makes an exact number of copies of such chunks depending on the replication factor for that file. After that, it copies such chunks to different DataNodes of the cluster.

主站蜘蛛池模板: 镇赉县| 郑州市| 长治市| 宿州市| 廉江市| 巴林左旗| 松溪县| 鱼台县| 承德县| 贵州省| 建湖县| 西乌珠穆沁旗| 沂源县| 丰镇市| 壶关县| 通江县| 永安市| 望江县| 建湖县| 兴化市| 甘泉县| 洮南市| 竹山县| 千阳县| 湖口县| 兴化市| 彭泽县| 周口市| 内江市| 堆龙德庆县| 古田县| 双城市| 铁力市| 休宁县| 廊坊市| 河北区| 平谷区| 揭东县| 桦南县| 河源市| 台前县|