- Mastering Apache Spark 2.x(Second Edition)
- Romeo Kienzler
- 88字
- 2021-07-02 18:55:32
The Catalyst Optimizer
The Catalyst Optimizer is one of the most exciting developments in Apache Spark. This is because it basically frees your mind from writing effective data processing pipelines, and lets the optimizer do it for you.
In this chapter, we will like to introduce the Catalyst Optimizer of Apache Spark SQL running on top of SQL, DataFrames, and Datasets.
This chapter will cover the following topics:
- The catalog
- Abstract syntax trees
- The optimization process on logical and physical execution plans
- Code generation
- One practical code walk-through
推薦閱讀
- SQL Server 從入門到項目實踐(超值版)
- Kali Linux Web Penetration Testing Cookbook
- Learning Bayesian Models with R
- 碼上行動:零基礎學會Python編程(ChatGPT版)
- Functional Kotlin
- Protocol-Oriented Programming with Swift
- Haskell Data Analysis Cookbook
- The Professional ScrumMaster’s Handbook
- 深入淺出Go語言編程
- FPGA嵌入式項目開發實戰
- Zabbix Performance Tuning
- Qlik Sense? Cookbook
- OpenCV Android Programming By Example
- Redmine Cookbook
- 基于GPU加速的計算機視覺編程:使用OpenCV和CUDA實時處理復雜圖像數據