- Hands-On Big Data Analytics with PySpark
- Rudy Lai Bart?omiej Potaczek
- 62字
- 2021-06-24 15:52:34
Parallelization with Spark RDDs
Now that we know how to create RDDs within the text file that we received from the internet, we can look at a different way to create this RDD. Let's discuss parallelization with our Spark RDDs.
In this section, we will cover the following topics:
- What is parallelization?
- How do we parallelize Spark RDDs?
Let's start with parallelization.
推薦閱讀
- DB29forLinux,UNIX,Windows數據庫管理認證指南
- Learning Spring Boot
- 新型數據庫系統:原理、架構與實踐
- MySQL從入門到精通(第3版)
- 算法與數據中臺:基于Google、Facebook與微博實踐
- Mastering Machine Learning with R(Second Edition)
- OracleDBA實戰攻略:運維管理、診斷優化、高可用與最佳實踐
- Spark大數據分析實戰
- 深入淺出Greenplum分布式數據庫:原理、架構和代碼分析
- 新基建:數據中心創新之路
- 淘寶、天貓電商數據分析與挖掘實戰(第2版)
- 計算機視覺
- MySQL性能調優與架構設計
- 數據迷霧:洞察數據的價值與內涵
- 數據分析方法及應用:基于SPSS和EXCEL環境