- Big Data Analytics with Hadoop 3
- Sridhar Alla
- 121字
- 2021-06-25 21:26:01
Intra-DataNode balancer
HDFS has a way to balance the data blocks across the data nodes, but there is no such balancing inside the same data node with multiple hard disks. Hence, a 12-spindle DataNode can have out of balance physical disks. But why does this matter to performance? Well, by having out of balance disks, the blocks at DataNode level might be the same as other DataNodes but the reads/writes will be skewed because of imbalanced disks. Hence, Hadoop 3.x introduces the intra-node balancer to balance the physical disks inside each data node to reduce the skew of the data.
This increases the reads and writes performed by any process running on the cluster, such as a mapper or reducer.
推薦閱讀
- 中文版Photoshop CS5數碼照片處理完全自學一本通
- ROS機器人編程與SLAM算法解析指南
- Matplotlib 3.0 Cookbook
- 工業控制系統測試與評價技術
- Machine Learning with Apache Spark Quick Start Guide
- Chef:Powerful Infrastructure Automation
- 從零開始學PHP
- Salesforce Advanced Administrator Certification Guide
- 格蠹匯編
- 中文版AutoCAD 2013高手速成
- 基于RPA技術財務機器人的應用與研究
- 中國戰略性新興產業研究與發展·數控系統
- PyTorch深度學習
- Win 7二十一
- 系統安裝、維護與數據備份技巧