官术网_书友最值得收藏!

Chapter 3. Deep Dive into Apache Spark

Apache Spark is growing at a fast pace in terms of technology, community, and user base. Two new APIs were introduced in 2015: the DataFrame API and DataSet API. These two APIs are built on top of the core API, which is based on RDDs. It is essential to understand the deeper concepts of RDDs including runtime architecture and behavior on various resource managers of Spark.

This chapter is divided into the following sub topics:

  • Starting Spark daemons
  • Spark core concepts
  • Pairing RDDs
  • The lifecycle of a Spark program
  • Spark applications
  • Persistence and caching
  • Spark resource managers—Standalone, Yarn, and Mesos
主站蜘蛛池模板: 时尚| 丹东市| 读书| 咸阳市| 白朗县| 南涧| 桦南县| 老河口市| 芦山县| 钟山县| 新宁县| 福海县| 乃东县| 宽甸| 宝兴县| 澄迈县| 五峰| 通江县| 汉寿县| 方城县| 奉节县| 庆安县| 思南县| 台北市| 鹤庆县| 门源| 日土县| 万源市| 白玉县| 腾冲县| 永德县| 宜川县| 长子县| 察隅县| 夏津县| 宜兰县| 宕昌县| 涞源县| 威宁| 华蓥市| 天津市|