官术网_书友最值得收藏!

Intra-DataNode balancer

HDFS has a way to balance the data blocks across the data nodes, but there is no such balancing inside the same data node with multiple hard disks. Hence, a 12-spindle DataNode can have out of balance physical disks. But why does this matter to performance? Well, by having out of balance disks, the blocks at DataNode level might be the same as other DataNodes but the reads/writes will be skewed because of imbalanced disks. Hence, Hadoop 3.x introduces the intra-node balancer to balance the physical disks inside each data node to reduce the skew of the data. 

This increases the reads and writes performed by any process running on the cluster, such as a mapper or reducer.

主站蜘蛛池模板: 五常市| 武胜县| 保定市| 苏尼特右旗| 日喀则市| 临猗县| 塔河县| 唐河县| 宜宾县| 亚东县| 甘谷县| 宜川县| 八宿县| 康马县| 茶陵县| 西峡县| 芒康县| 安丘市| 安图县| 大连市| 文化| 礼泉县| 焦作市| 仙居县| 资溪县| 教育| 芜湖县| 曲靖市| 包头市| 麻江县| 新安县| 文成县| 沙坪坝区| 涟源市| 云林县| 景德镇市| 祁连县| 山阴县| 广东省| 增城市| 应城市|