- Apache Spark 2.x for Java Developers
- Sourav Gulati Sumit Kumar
- 151字
- 2021-07-02 19:01:52
Defining HDFS
The Hadoop Distributed File System (HDFS), as the name suggests, is a distributed filesystem based on the lines of the Google File System written in Java. In practice, HDFS resembles closely any other UNIX filesystem with support for common file operations such as ls, cp, rm, du, cat, and so on. However what makes HDFS stand out, despite its simplicity, is its mechanism to handle node failure in the Hadoop cluster without effectively changing the search time for accessing stored files. The HDFS cluster consists of two major components: DataNodes and NameNode.
HDFS has a unique way of storing data on HDFS clusters (cheap commodity networked commodity computers). It splits the regular file in smaller chunks called blocks and then makes an exact number of copies of such chunks depending on the replication factor for that file. After that, it copies such chunks to different DataNodes of the cluster.
- Learning LibGDX Game Development(Second Edition)
- Learn Type:Driven Development
- Python for Secret Agents:Volume II
- vSphere High Performance Cookbook
- ASP.NET Core Essentials
- Mastering LibGDX Game Development
- Building Mapping Applications with QGIS
- 零基礎入門學習Python
- 深入淺出Serverless:技術原理與應用實踐
- MATLAB for Machine Learning
- Learning Concurrent Programming in Scala
- Vue.js 3應用開發與核心源碼解析
- Instant Zurb Foundation 4
- After Effects CC案例設計與經典插件(視頻教學版)
- After Effects CC技術大全