官术网_书友最值得收藏!

Installing Pyspark and Setting up Your Development Environment

In this chapter, we are going to introduce Spark and learn the core concepts, such as, SparkContext, and Spark tools such as SparkConf and Spark shell. The only prerequisite is the knowledge of basic Python concepts and the desire to seek insight from big data. We will learn how to analyze and discover patterns with Spark SQL to improve our business intelligence. Also, you will be able to quickly iterate through your solution by setting to PySpark for your own computer. By the end of the book, you will be able to work with real-life messy data sets using PySpark to get practical big data experience.

In this chapter, we will cover the following topics:

  • An overview of PySpark
  • Setting up Spark on Windows and PySpark
  • Core concepts in Spark and PySpark
主站蜘蛛池模板: 平乡县| 南城县| 华宁县| 旬邑县| 万山特区| 南丰县| 甘肃省| 古丈县| 武乡县| 岑巩县| 怀宁县| 荃湾区| 德保县| 大同县| 洪湖市| 新竹县| 闸北区| 会泽县| 黄龙县| 广丰县| 布尔津县| 涟源市| 道真| 南康市| 邵阳县| 疏附县| 馆陶县| 潢川县| 永年县| 宕昌县| 麻阳| 隆化县| 周宁县| 黄山市| 长治市| 黄石市| 阳城县| 紫阳县| 伽师县| 兴国县| 大宁县|