- Apache Spark Machine Learning Blueprints
- Alex Liu
- 156字
- 2021-07-16 10:39:48
Chapter 1. Spark for Machine Learning
This chapter provides an introduction to Apache Spark from a Machine Learning (ML) and data analytics perspective, and also discusses machine learning in relation to Spark computing. Here, we first present an overview of Apache Spark, as well as Spark's advantages for data analytics, in comparison to MapReduce and other computing platforms. Then we discuss five main issues, as below:
- Machine learning algorithms and libraries
- Spark RDD and dataframes
- Machine learning frameworks
- Spark pipelines
- Spark notebooks
All of the above are the most important topics that any data scientist or machine learning professional is expected to master, in order to fully take advantage of Apache Spark computing. Specifically, this chapter will cover all of the following six topics.
- Spark overview and Spark advantages
- ML algorithms and ML libraries for Spark
- Spark RDD and dataframes
- ML Frameworks, RM4Es and Spark computing
- ML workflows and Spark pipelines
- Spark notebooks introduction
推薦閱讀
- Hands-On Graph Analytics with Neo4j
- Learning Microsoft Azure Storage
- 自動檢測與轉換技術
- OpenStack Cloud Computing Cookbook(Second Edition)
- 大數據技術與應用
- CompTIA Network+ Certification Guide
- Hands-On Reactive Programming with Reactor
- Salesforce for Beginners
- 實用網絡流量分析技術
- SQL Server數據庫應用基礎(第2版)
- Mastering Ceph
- 從零開始學JavaScript
- 菜鳥起飛電腦組裝·維護與故障排查
- Win 7二十一
- 微控制器的選擇與應用