官术网_书友最值得收藏!

The Analytics Toolkit

There are several platforms today that are used for large-scale data analytics. At a broad level, these are pided into platforms that are used primarily for data mining, such as analysis of large datasets using NoSQL platforms, and those that are used for data science—that is, machine learning and predictive analytics. Oftentimes, the solution may have both the characteristics—a robust underlying platform for storing and managing data, and solutions that have been built on top of them that provide additional capabilities in data science.

In this chapter, we will show you how to install and configure your Analytics Toolkit, a collection of software that we'll use for the rest of the chapters:

  • Components of the Analytics Toolkit
  •  System recommendations
    • Installing on a laptop or workstation
    • Installing on the cloud
  • Installing Hadoop
    • Hadoop distributions
    • Cloudera Distribution of Hadoop (CDH)
  • Installing Spark
  • Installing R and Python
主站蜘蛛池模板: 都兰县| 新乐市| 磴口县| 遵化市| 志丹县| 定南县| 昌黎县| 竹山县| 于田县| 铜鼓县| 碌曲县| 确山县| 贺兰县| 沁源县| 宜春市| 江阴市| 淅川县| 丁青县| 宁国市| 沙坪坝区| 双城市| 永泰县| 浑源县| 合山市| 丹棱县| 昭觉县| 天全县| 松滋市| 司法| 分宜县| 扶余县| 凉山| 永平县| 平阴县| 尚志市| 兴文县| 容城县| 江北区| 莒南县| 榆社县| 弥渡县|