官术网_书友最值得收藏!

Preparing hardware for Hadoop

One important aspect of Hadoop setup is defining the hardware requirements and sizing before the start of a project. Although Apache Hadoop can run on commodity hardware, most of the implementations utilize server-class hardware for their Hadoop cluster. (Look at powered by Hadoop or go through the Facebook Data warehouse research paper in SIGMOD-2010 for more information).

There is no rule of thumb regarding the minimum hardware requirements for setting up Hadoop, but we would recommend the following configurations while running Hadoop to ensure reasonable performance:

  • CPU ≥ 2 Core 2.5 GHz or more frequency
  • Memory 8 GB RAM
  • Storage 100 GB of free space, for running programs and processing data
  • Good internet connection

There is an official Cloudera blog for cluster sizing information if you need more detail. If you are setting up a virtual machine, you can always opt for dynamically sized disks that can be increased based on your needs. We will look at how to size the cluster in the upcoming Hadoop cluster section.

主站蜘蛛池模板: 子长县| 分宜县| 崇州市| 江安县| 广东省| 庆阳市| 班戈县| 遵义市| 容城县| 肥城市| 华阴市| 岳池县| 涪陵区| 临夏市| 贵南县| 新田县| 温泉县| 台东县| 天门市| 临海市| 鸡西市| 邢台县| 元氏县| 嵩明县| 巢湖市| 宁城县| 内黄县| 全州县| 商都县| 大英县| 舒兰市| 霍山县| 万安县| 临桂县| 驻马店市| 平凉市| 洪湖市| 长兴县| 霍山县| 沭阳县| 安徽省|