官术网_书友最值得收藏!

Planning and sizing clusters

Once you start working on problems and implementing Hadoop clusters, you'll have to deal with the issue of sizing. It's not just the sizing aspect of clusters that needs to be considered, but the SLAs associated with Hadoop runtime as well. A cluster can be categorized based on workloads as follows:

  • Lightweight: This category is intended for low computation and fewer storage requirements, and is more useful for defined datasets with no growth
  • Balanced: A balanced cluster can have storage and computation requirements that grow over time
  • Storage-centric: This category is more focused towards storing data, and less towards computation; it is mostly used for archival purposes, as well as minimal processing
  • Computational-centric: This cluster is intended for high computation which requires CPU or GPU-intensive work, such as analytics, prediction, and data mining

Before we get on to solve the sizing problem of a Hadoop cluster, however, we have to understand the following topics.

主站蜘蛛池模板: 德保县| 德清县| 安乡县| 砀山县| 汤原县| 乌什县| 宁都县| 彰化市| 博白县| 沙雅县| 衢州市| 上思县| 青冈县| 洪湖市| 新竹县| 元氏县| 文登市| 陇南市| SHOW| 新余市| 宁南县| 绥江县| 台东市| 湟源县| 余姚市| 修水县| 宣化县| 兴仁县| 班玛县| 麻城市| 宝清县| 西青区| 会宁县| 毕节市| 淮安市| 讷河市| 新邵县| 塔河县| 丹凤县| 页游| 铅山县|