官术网_书友最值得收藏!

  • Learning Apache Spark 2
  • Muhammad Asif Abbasi
  • 93字
  • 2021-07-09 18:46:01

Chapter 3. ETL with Spark

So we have gone through the architecture of Spark, and have had some detailed level discussions around RDDs. By the end of Chapter 2Transformations and Actions with Spark RDDs, we had focused on PairRDDs and some of the transformations.

This chapter focuses on doing ETL with Apache Spark. We'll cover the following topics, which hopefully will help you with taking the next step on Apache Spark:

  • Understanding the ETL process
  • Commonly supported file formats
  • Commonly supported filesystems
  • Working with NoSQL databases

Let's get started!

主站蜘蛛池模板: 两当县| 玉山县| 于都县| 腾冲县| 大理市| 柳州市| 建水县| 古蔺县| 邵阳县| 安达市| 广德县| 门源| 夏河县| 淳化县| 海阳市| 庐江县| 玛沁县| 淮北市| 陆河县| 通州区| 鹤山市| 乌兰浩特市| 阿尔山市| 壤塘县| 丰镇市| 牡丹江市| 郁南县| 龙里县| 天水市| 双辽市| 武城县| 澄迈县| 南部县| 盐源县| 海淀区| 昭苏县| 桂林市| 建瓯市| 西林县| 新野县| 乌拉特后旗|