官术网_书友最值得收藏!

Preface

Apache Spark is a flexible in-memory framework that allows the processing of both batch and real-time data in a distributed way. Its unified engine has made it quite popular for big data use cases.

This book will help you to quickly get started with Apache Spark 2.x and help you write efficient big data applications for a variety of use cases. You will get to grip with the low-level details as well as core concepts of Apache Spark, and the way they can be used to solve big data problems. You will be introduced to RDD and DataFrame APIs, and their corresponding transformations and actions. 

This book will help you learn Spark's components for machine learning, stream processing, and graph analysis. At the end of the book, you'll learn different optimization techniques for writing efficient Spark code.

主站蜘蛛池模板: 新河县| 运城市| 靖江市| 平原县| 开鲁县| 崇文区| 米泉市| 全州县| 定陶县| 房产| 剑阁县| 南陵县| 体育| 阿拉善左旗| 延边| 远安县| 当雄县| 子长县| 庆安县| 祁东县| 纳雍县| 和政县| 鄂温| 仙居县| 靖安县| 金阳县| 阿勒泰市| 邹城市| 吉水县| 安丘市| 虹口区| 息烽县| 新龙县| 上饶县| 平泉县| 齐齐哈尔市| 迁安市| 佛学| 永平县| 济阳县| 彰化县|