官术网_书友最值得收藏!

A dual approach

In this book, we will discuss both the building and the management of local Hadoop clusters in addition to showing how to push the processing into the cloud via EMR.

The reason for this is twofold: firstly, though EMR makes Hadoop much more accessible, there are aspects of the technology that only become apparent when manually administering the cluster. Although it is also possible to use EMR in a more manual mode, we'll generally use a local cluster for such explorations. Secondly, though it isn't necessarily an either/or decision, many organizations use a mixture of in-house and cloud-hosted capacities, sometimes due to a concern of over reliance on a single external provider, but practically speaking, it's often convenient to do development and small-scale tests on local capacity and then deploy at production scale into the cloud.

In a few of the later chapters, where we discuss additional products that integrate with Hadoop, we'll mostly give examples of local clusters, as there is no difference between how the products work regardless of where they are deployed.

主站蜘蛛池模板: 绍兴市| 报价| 上饶县| 安乡县| 阜宁县| 遵化市| 天长市| 东明县| 朝阳区| 景德镇市| 沐川县| 潞西市| 临高县| 衡南县| 噶尔县| 石嘴山市| 辽宁省| 淄博市| 军事| 达尔| 东乡| 双峰县| 商城县| 漠河县| 台北县| 宝山区| 松桃| 金坛市| 乌鲁木齐县| 嘉峪关市| 留坝县| 夏津县| 浙江省| 德令哈市| 海丰县| 昌都县| 政和县| 大兴区| 临邑县| 琼结县| 太谷县|