官术网_书友最值得收藏!

The MapReduce framework

MapReduce is a framework used to compute a large amount of data in a Hadoop cluster. MapReduce uses YARN to schedule the mappers and reducers as tasks, using the containers. 

An example of a MapReduce job to count frequencies of words is shown in the following diagram:

MapReduce works closely with YARN to plan the job and the various tasks in the job, requests computing resources from the cluster manager (resource manager), schedules the execution of the tasks on the compute resources of the cluster, and then executes the plan of execution. Using MapReduce, you can read write many different types of files of varying formats and perform very complex computations in a distributed manner. We will see more of this in the next chapter on MapReduce frameworks.

主站蜘蛛池模板: 榆树市| 安岳县| 敦化市| 遵义县| 百色市| 岳阳县| 华蓥市| 浦县| 永定县| 天气| 兴仁县| 建阳市| 德庆县| 盐池县| 南木林县| 达州市| 阳谷县| 开鲁县| 临颍县| 贵阳市| 濮阳市| 邳州市| 宜章县| 宁乡县| 北辰区| 甘肃省| 新泰市| 平山县| 武宁县| 长丰县| 吉水县| 吉首市| 常德市| 辽宁省| 北流市| 定结县| 尼勒克县| 北海市| 包头市| 蒙城县| 晋中市|