官术网_书友最值得收藏!

What this book covers

Chapter 1Installing Spark and Setting Up Your Cluster, details some common methods for setting up Spark.

Chapter 2, Using the Spark Shell, introduces the command line for Spark. The shell is good for trying out quick program snippets or just figuring out the syntax of a call interactively.

Chapter 3, Building and Running a Spark Application, covers the ways for compiling Spark applications.

Chapter 4Creating a SparkSession Object, describe the programming aspects of the connection to a spark server regarding the Spark session and the enclosed spark context.

Chapter 5Loading and Saving Data in Spark, deals with how we can get data in and out of a spark environment.

Chapter 6Manipulating Your RDD, describes how to program Resilient Distributed Datasets, which is the fundamental data abstraction layer in Spark that makes all the magic possible.

Chapter 7Spark 2.0 Concepts, is a short, interesting chapter that discusses the evolution of Spark and the concepts underpinning the Spark 2.0 release, which is a major milestone.

Chapter 8 , Spark SQL, deals with the SQL interface in Spark. Spark SQL probably is the most widely used feature.

Chapter 9, Foundations of Datasets/DataFrames – The Proverbial Workhorse for DataScientists, is another interesting chapter, which introduces the Datasets/DataFrames that are added in the Spark 2.0 release.

Chapter 10, Spark with Big Data, describes the interfaces with Parquet and HBase.

Chapter 11Machine Learning with Spark ML Pipelines, is my favorite chapter. We talk about regression, classification, clustering, and recommendation in this chapter. This is probably the largest chapter in this book. If you are stranded in a remote island and could take only one chapter with you, this should be the one!

Chapter 12, GraphX, talks about an important capability, processing graphs at scale, and also discusses interesting algorithms such as PageRank.

主站蜘蛛池模板: 女性| 海兴县| 呼玛县| 苏尼特左旗| 正定县| 当雄县| 张家口市| 芜湖县| 泸定县| 伊宁县| 津市市| 枣阳市| 宿松县| 湘乡市| 隆子县| 安仁县| 丹寨县| 白水县| 漾濞| 苏尼特左旗| 保德县| 湖北省| 鲜城| 平泉县| 红河县| 冕宁县| 巫山县| 德安县| 丹东市| 改则县| 开封市| 新疆| 丰镇市| 天等县| 玉山县| 司法| 西林县| 临汾市| 霍邱县| 武威市| 阿鲁科尔沁旗|