官术网_书友最值得收藏!

To get the most out of this book

All the examples have been implemented in Scala with some open source libraries, including Apahe Spark MLlib/ML and Deeplearning4j. However, to get the best out of this, you should have a powerful computer and software stack.

A Linux distribution is preferable (for example, Debian, Ubuntu, or CentOS). For example, for Ubuntu, it is recommended to have at least a 14.04 (LTS) 64-bit complete installation on VMware Workstation Player 12 or VirtualBox. You can run Spark jobs on Windows (7/8/10) or macOS X (10.4.7+) as well.

A computer with a Core i5 processor, enough storage (for example, for running Spark jobs, you'll need at least 50 GB of free disk storage for standalone cluster and for the SQL warehouse), and at least 16 GB RAM are recommended. And optionally, if you want to perform the neural network training on the GPU (for the last chapter only), the NVIDIA GPU driver has to be installed with CUDA and CuDNN configured.

The following APIs and tools are required in order to execute the source code in this book:

  • Java/JDK, version 1.8
  • Scala, version 2.11.8
  • Spark, version 2.2.0 or higher
  • Spark csv_2.11, version 1.3.0
  • ND4j backend version nd4j-cuda-9.0-platform for GPU; otherwise, nd4j-native
  • ND4j, version 1.0.0-alpha
  • DL4j, version 1.0.0-alpha          
  • Datavec, version 1.0.0-alpha
  • Arbiter, version 1.0.0-alpha
  • Eclipse Mars or Luna (latest version) or IntelliJ IDEA
  • Maven Eclipse plugin (2.9 or higher)
  • Maven compiler plugin for Eclipse (2.3.2 or higher)
  • Maven assembly plugin for Eclipse (2.4.1 or higher)
主站蜘蛛池模板: 花莲县| 清远市| 齐河县| 区。| 中宁县| 湘西| 固原市| 丹江口市| 资兴市| 金寨县| 贵州省| 西华县| 富阳市| 买车| 都江堰市| 大余县| 鲜城| 新蔡县| 镇原县| 台州市| 仁布县| 临猗县| 绥滨县| 岳阳市| 科技| 中方县| 平昌县| 康马县| 甘泉县| 昌平区| 称多县| 克拉玛依市| 杭州市| 河曲县| 邹平县| 凭祥市| 临洮县| 牡丹江市| 新泰市| 湘潭县| 冀州市|