官术网_书友最值得收藏!

  • Learning Spark SQL
  • Aurobindo Sarkar
  • 74字
  • 2021-07-02 18:23:49

Summary

In this chapter, we demonstrated using Spark SQL for exploring Datasets, performing basic data quality checks, generating samples and pivot tables, and visualizing data with Apache Zeppelin.

In the next chapter, we will shift our focus to data munging/wrangling. We will introduce techniques to handle missing data, bad data, duplicate records, and so on. We will also use extensive hands-on sessions for demonstrating the use of Spark SQL for common data munging tasks.

主站蜘蛛池模板: 麻城市| 炉霍县| 南岸区| 新邵县| 图木舒克市| 汉寿县| 孝感市| 舟山市| 镇康县| 彭阳县| 山西省| 长沙市| 崇信县| 鹤岗市| 怀集县| 疏附县| 惠安县| 合阳县| 岑巩县| 贡山| 江西省| 荥经县| 清水河县| 荥经县| 荆州市| 卓资县| 库伦旗| 大邑县| 晋江市| 尉犁县| 新乡县| 黄浦区| 子长县| 东山县| 新和县| 浠水县| 德安县| 溆浦县| 榆中县| 河南省| 崇明县|