- Learning Spark SQL
- Aurobindo Sarkar
- 113字
- 2021-07-02 18:23:45
Using Spark SQL for Data Exploration
In this chapter, we will introduce you to using Spark SQL for exploratory data analysis. We will introduce preliminary techniques to compute some basic statistics, identify outliers, and visualize, sample, and pivot data. A series of hands-on exercises in this chapter will enable you to use Spark SQL along with tools such as Apache Zeppelin for developing an intuition about your data.
In this chapter, we shall look at the following topics:
- What is Exploratory Data Analysis (EDA)
- Why is EDA important?
- Using Spark SQL for basic data analysis
- Visualizing data with Apache Zeppelin
- Sampling data with Spark SQL APIs
- Using Spark SQL for creating pivot tables
推薦閱讀
- Oracle從入門到精通(第3版)
- Spring Boot開發與測試實戰
- arc42 by Example
- Building Mobile Applications Using Kendo UI Mobile and ASP.NET Web API
- PostgreSQL 11從入門到精通(視頻教學版)
- Mastering Apache Spark 2.x(Second Edition)
- HTML5+CSS3網頁設計
- Mastering JavaScript Design Patterns(Second Edition)
- Tableau 10 Bootcamp
- Advanced Express Web Application Development
- MySQL程序員面試筆試寶典
- Simulation for Data Science with R
- Python預測分析實戰
- Oracle SOA Suite 12c Administrator's Guide
- HTML5+CSS3+jQuery Mobile+Bootstrap開發APP從入門到精通(視頻教學版)