- Big Data Analytics
- Venkat Ankam
- 204字
- 2021-08-20 10:32:19
Preface
Big Data Analytics aims at providing the fundamentals of Apache Spark and Hadoop, and how they are integrated together with most commonly used tools and techniques in an easy way. All Spark components (Spark Core, Spark SQL, DataFrames, Datasets, Conventional Streaming, Structured Streaming, MLLib, GraphX, and Hadoop core components), HDFS, MapReduce, and Yarn are explored in great depth with implementation examples on Spark + Hadoop clusters.
The Big Data Analytics industry is moving away from MapReduce to Spark. So, the advantages of Spark over MapReduce are explained in great depth to reap the benefits of in-memory speeds. The DataFrames API, the Data Sources API, and the new Dataset API are explained for building Big Data analytical applications. Real-time data analytics using Spark Streaming with Apache Kafka and HBase is covered to help in building streaming applications. New structured streaming concept is explained with an Internet of Things (IOT) use case. Machine learning techniques are covered using MLLib, ML Pipelines and SparkR; Graph Analytics are covered with GraphX and GraphFrames components of Spark.
This book also introduces web based notebooks such as Jupyter, Apache Zeppelin, and data flow tool Apache NiFi to analyze and visualize data, offering Spark as a Service using Livy Server.
- 程序員修煉之道:程序設(shè)計(jì)入門(mén)30講
- 程序員面試白皮書(shū)
- LabVIEW 2018 虛擬儀器程序設(shè)計(jì)
- Learning Real-time Processing with Spark Streaming
- C語(yǔ)言程序設(shè)計(jì)(第3版)
- Visual FoxPro 程序設(shè)計(jì)
- 大學(xué)計(jì)算機(jī)基礎(chǔ)(第2版)(微課版)
- Yii Project Blueprints
- OpenCV 4計(jì)算機(jī)視覺(jué)項(xiàng)目實(shí)戰(zhàn)(原書(shū)第2版)
- C和C++游戲趣味編程
- C#程序設(shè)計(jì)(項(xiàng)目教學(xué)版)
- Unity 3D/2D移動(dòng)開(kāi)發(fā)實(shí)戰(zhàn)教程
- GameMaker Essentials
- 深度學(xué)習(xí)原理與PyTorch實(shí)戰(zhàn)(第2版)
- App Inventor少兒趣味編程動(dòng)手做