官术网_书友最值得收藏!

What this book covers

Chapter 1, Getting Started with Breeze, serves as an introduction to the Breeze linear algebra library's API.

Chapter 2, Getting Started with Apache Spark DataFrames, introduces powerful, yet intuitive and relational-table-like, data abstraction.

Chapter 3, Loading and Preparing Data – DataFrame, showcases the loading of datasets into Spark DataFrames from a variety of sources, while also introducing the Parquet serialization format.

Chapter 4, Data Visualization, introduces Apache Zeppelin for interactive data visualization using Spark SQL and Spark UDF functions. We also briefly discuss Bokeh-Scala, which is a Scala port of Bokeh (a highly customizable visualization library).

Chapter 5, Learning from Data, focuses on machine learning using Spark MLlib.

Chapter 6, Scaling Up, walks through various deployment alternatives for Spark applications: standalone, YARN, and Mesos.

Chapter 7, Going Further, briefly introduces Spark Streaming and GraphX.

主站蜘蛛池模板: 岳阳市| 呼玛县| 金门县| 镇安县| 新营市| 宕昌县| 石棉县| 上林县| 嘉禾县| 柳江县| 南涧| 彩票| 盐边县| 武平县| 军事| 德安县| 祁连县| 重庆市| 恩平市| 汉沽区| 乡城县| 贡觉县| 山东省| 东莞市| 吉木乃县| 丹寨县| 贵定县| 县级市| 南丹县| 富川| 英超| 延安市| 普洱| 保德县| 梅河口市| 邵武市| 抚松县| 凌源市| 讷河市| 太原市| 台前县|