官术网_书友最值得收藏!

The fundamental premise of Hadoop

The fundamental premise of Hadoop is that instead of attempting to perform a task on a single large machine, the task can be subpided into smaller segments that can then be delegated to multiple smaller machines. These so-called smaller machines would then perform the task on their own portion of the data. Once the smaller machines have completed their tasks to produce the results on the tasks they were allocated, the inpidual units of results would then be aggregated to produce the final result.

Although, in theory, this may appear relatively simple, there are various technical considerations to bear in mind. For example:

  • Is the network fast enough to collect the results from each inpidual server?
  • Can each inpidual server read data fast enough from the disk?
  • If one or more of the servers fail, do we have to start all over?
  • If there are multiple large tasks, how should they be prioritized?

There are many more such considerations that must be considered when working with a distributed architecture of this nature.

主站蜘蛛池模板: 沁源县| 应用必备| 晋中市| 隆德县| 南岸区| 靖江市| 马公市| 青铜峡市| 华容县| 广德县| 泽库县| 浮梁县| 五家渠市| 濮阳县| 沙湾县| 罗源县| 江陵县| 中江县| 民勤县| 旬邑县| 牙克石市| 临邑县| 睢宁县| 龙海市| 明水县| 双桥区| 西昌市| 浪卡子县| 论坛| 五家渠市| 内江市| 株洲市| 武平县| 沈阳市| 阳新县| 玉环县| 霍林郭勒市| 牡丹江市| 潞城市| 大渡口区| 鄂伦春自治旗|