官术网_书友最值得收藏!

Preface

JVM has become a clear winner in the race between different methods of scalable data analysis. The power of JVM, strong typing, simplicity of code, composability, and availability of highly abstracted distributed and machine learning frameworks make Scala a clear contender for the top position in large-scale data analysis. Thanks to its dynamic-looking, yet static type system, scientists and programmers coming from Python backgrounds feel at ease with Scala.

This book aims to provide easy-to-use recipes in Apache Spark, a massively scalable distributed computation framework, and Breeze, a linear algebra library on which Spark's machine learning toolkit is built. The book will also help you explore data using interactive visualizations in Apache Zeppelin.

Other than the handful of frameworks and libraries that we will see in this book, there's a host of other popular data analysis libraries and frameworks that are available for Scala. They are by no means lesser beasts, and they could actually fit our use cases well. Unfortunately, they aren't covered as part of this book.

主站蜘蛛池模板: 阿巴嘎旗| 甘南县| 黑龙江省| 龙井市| 安多县| 嘉善县| 淄博市| 栖霞市| 乐东| 澄迈县| 安吉县| 南木林县| 平罗县| 纳雍县| 丹棱县| 汕尾市| 东乌珠穆沁旗| 大兴区| 尉氏县| 普陀区| 滦平县| 福州市| 靖边县| 达孜县| 平阴县| 靖江市| 中阳县| 凉山| 胶州市| 抚州市| 门源| 宁津县| 灵台县| 冕宁县| 翼城县| 紫阳县| 鹤岗市| 永和县| 永丰县| 亚东县| 平罗县|