書名： Learning Spark SQL
作者名： Aurobindo Sarkar
本章字數： 74字
更新時間： 2021-07-02 18:23:49

Summary

In this chapter, we demonstrated using Spark SQL for exploring Datasets, performing basic data quality checks, generating samples and pivot tables, and visualizing data with Apache Zeppelin.

In the next chapter, we will shift our focus to data munging/wrangling. We will introduce techniques to handle missing data, bad data, duplicate records, and so on. We will also use extensive hands-on sessions for demonstrating the use of Spark SQL for common data munging tasks.

官术网_书友最值得收藏!

Learning Spark SQL

Summary