官术网_书友最值得收藏!

Chapter 1. Spark for Machine Learning

This chapter provides an introduction to Apache Spark from a Machine Learning (ML) and data analytics perspective, and also discusses machine learning in relation to Spark computing. Here, we first present an overview of Apache Spark, as well as Spark's advantages for data analytics, in comparison to MapReduce and other computing platforms. Then we discuss five main issues, as below:

  • Machine learning algorithms and libraries
  • Spark RDD and dataframes
  • Machine learning frameworks
  • Spark pipelines
  • Spark notebooks

All of the above are the most important topics that any data scientist or machine learning professional is expected to master, in order to fully take advantage of Apache Spark computing. Specifically, this chapter will cover all of the following six topics.

  • Spark overview and Spark advantages
  • ML algorithms and ML libraries for Spark
  • Spark RDD and dataframes
  • ML Frameworks, RM4Es and Spark computing
  • ML workflows and Spark pipelines
  • Spark notebooks introduction
主站蜘蛛池模板: 韶山市| 小金县| 监利县| 军事| 四会市| 深水埗区| 岗巴县| 汉川市| 上高县| 车险| 嘉峪关市| 府谷县| 云和县| 江安县| 黎城县| 凤庆县| 长子县| 调兵山市| 三穗县| 本溪市| 吉首市| 武功县| 海林市| 仪陇县| 穆棱市| 汝阳县| 从江县| 自贡市| 介休市| 祁门县| 双桥区| 台江县| 平昌县| 高台县| 丰台区| 外汇| 麦盖提县| 浠水县| 井研县| 阿瓦提县| 泗阳县|