- Hands-On Big Data Analytics with PySpark
- Rudy Lai Bart?omiej Potaczek
- 144字
- 2021-06-24 15:52:32
Installing Pyspark and Setting up Your Development Environment
In this chapter, we are going to introduce Spark and learn the core concepts, such as, SparkContext, and Spark tools such as SparkConf and Spark shell. The only prerequisite is the knowledge of basic Python concepts and the desire to seek insight from big data. We will learn how to analyze and discover patterns with Spark SQL to improve our business intelligence. Also, you will be able to quickly iterate through your solution by setting to PySpark for your own computer. By the end of the book, you will be able to work with real-life messy data sets using PySpark to get practical big data experience.
In this chapter, we will cover the following topics:
- An overview of PySpark
- Setting up Spark on Windows and PySpark
- Core concepts in Spark and PySpark
推薦閱讀
- 漫話大數(shù)據(jù)
- 數(shù)據(jù)浪潮
- 大數(shù)據(jù)算法
- 達(dá)夢(mèng)數(shù)據(jù)庫性能優(yōu)化
- MySQL 8.x從入門到精通(視頻教學(xué)版)
- 數(shù)據(jù)庫技術(shù)及應(yīng)用教程
- SQL優(yōu)化最佳實(shí)踐:構(gòu)建高效率Oracle數(shù)據(jù)庫的方法與技巧
- 數(shù)據(jù)庫原理與設(shè)計(jì)(第2版)
- LabVIEW 完全自學(xué)手冊(cè)
- Flutter Projects
- 辦公應(yīng)用與計(jì)算思維案例教程
- HikariCP連接池實(shí)戰(zhàn)
- Oracle數(shù)據(jù)庫管理、開發(fā)與實(shí)踐
- 二進(jìn)制分析實(shí)戰(zhàn)
- R Machine Learning Essentials