- Learning Spark SQL
- Aurobindo Sarkar
- 151字
- 2021-07-02 18:23:43
Using Spark SQL for Processing Structured and Semistructured Data
In this chapter, we will familiarize you with using Spark SQL with different types of data sources and data storage formats. Spark provides easy and standard structures (that is, RDDs and DataFrames/Datasets) to work with both structured and semistructured data. We include some of the data sources that are most commonly used in big data applications, such as, relational data, NoSQL databases, and files (CSV, JSON, Parquet, and Avro). Spark also allows you to define and use custom data sources. A series of hands-on exercises in this chapter will enable you to use Spark with different types of data sources and data formats.
In this chapter, you shall learn the following topics:
- Understanding data sources in Spark applications
- Using JDBC to work with relational databases
- Using Spark with MongoDB (NoSQL database)
- Working with JSON data
- Using Spark with Avro and Parquet Datasets
推薦閱讀
- Java EE框架整合開發(fā)入門到實(shí)戰(zhàn):Spring+Spring MVC+MyBatis(微課版)
- Django Design Patterns and Best Practices
- Working with Odoo
- C# and .NET Core Test Driven Development
- Orleans:構(gòu)建高性能分布式Actor服務(wù)
- 實(shí)戰(zhàn)Java高并發(fā)程序設(shè)計(jì)(第2版)
- 30天學(xué)通C#項(xiàng)目案例開發(fā)
- 超簡(jiǎn)單:用Python讓Excel飛起來(實(shí)戰(zhàn)150例)
- UI設(shè)計(jì)基礎(chǔ)培訓(xùn)教程(全彩版)
- 面向?qū)ο蟪绦蛟O(shè)計(jì)及C++(第3版)
- Java Web開發(fā)教程:基于Struts2+Hibernate+Spring
- SFML Game Development
- 算法超簡(jiǎn)單:趣味游戲帶你輕松入門與實(shí)踐
- Node.js應(yīng)用開發(fā)
- Android應(yīng)用開發(fā)攻略