- Big Data Analytics
- Venkat Ankam
- 111字
- 2021-08-20 10:32:24
Chapter 3. Deep Dive into Apache Spark
Apache Spark is growing at a fast pace in terms of technology, community, and user base. Two new APIs were introduced in 2015: the DataFrame API and DataSet API. These two APIs are built on top of the core API, which is based on RDDs. It is essential to understand the deeper concepts of RDDs including runtime architecture and behavior on various resource managers of Spark.
This chapter is divided into the following sub topics:
- Starting Spark daemons
- Spark core concepts
- Pairing RDDs
- The lifecycle of a Spark program
- Spark applications
- Persistence and caching
- Spark resource managers—Standalone, Yarn, and Mesos
推薦閱讀
- 數(shù)字媒體應(yīng)用教程
- 深入淺出Electron:原理、工程與實踐
- Interactive Data Visualization with Python
- MATLAB定量決策五大類問題
- Java EE 7 Development with NetBeans 8
- HTML5從入門到精通 (第2版)
- Visual Basic程序設(shè)計教程
- The Professional ScrumMaster’s Handbook
- JavaScript程序設(shè)計(第2版)
- Android Studio開發(fā)實戰(zhàn):從零基礎(chǔ)到App上線 (移動開發(fā)叢書)
- 軟件測試技術(shù)
- 計算機應(yīng)用基礎(chǔ)案例教程(第二版)
- Java Web開發(fā)教程:基于Struts2+Hibernate+Spring
- Visual Basic語言程序設(shè)計上機指導(dǎo)與練習(xí)(第3版)
- Python繪圖指南:分形與數(shù)據(jù)可視化(全彩)