官术网_书友最值得收藏!

  • Learning Apache Spark 2
  • Muhammad Asif Abbasi
  • 93字
  • 2021-07-09 18:46:01

Chapter 3. ETL with Spark

So we have gone through the architecture of Spark, and have had some detailed level discussions around RDDs. By the end of Chapter 2Transformations and Actions with Spark RDDs, we had focused on PairRDDs and some of the transformations.

This chapter focuses on doing ETL with Apache Spark. We'll cover the following topics, which hopefully will help you with taking the next step on Apache Spark:

  • Understanding the ETL process
  • Commonly supported file formats
  • Commonly supported filesystems
  • Working with NoSQL databases

Let's get started!

主站蜘蛛池模板: 诸暨市| 肥乡县| 行唐县| 楚雄市| 宜兰市| 阜平县| 济南市| 宁安市| 阳谷县| 老河口市| 梅河口市| 开鲁县| 宿州市| 周宁县| 房产| 威宁| 莒南县| 巢湖市| 进贤县| 烟台市| 宜都市| 民县| 甘泉县| 桑植县| 广平县| 闽侯县| 台北县| 东乡县| 长丰县| 常州市| 璧山县| 项城市| 正定县| 盘锦市| 新源县| 绵竹市| 越西县| 西和县| 专栏| 晋中市| 禄劝|