- Hands-On Deep Learning with Apache Spark
- Guglielmo Iozzia
- 160字
- 2021-07-02 13:34:20
The Apache Spark Ecosystem
Apache Spark (http://spark.apache.org/) is an open source, fast cluster-computing platform. It was originally created by AMPLab at the University of California, Berkeley. Its source code was later donated to the Apache Software Foundation (https://www.apache.org/). Spark comes with a very fast computation speed because data is loaded into distributed memory (RAM) across a cluster of machines. Not only can data be quickly transformed, but also cached on demand for a variety of use cases. Compared to Hadoop MapReduce, it runs programs up to 100 times faster when the data fits in memory, or 10 times faster on disk. Spark provides support for four programming languages: Java, Scala, Python, and R. This book covers the Spark APIs (and deep learning frameworks) for Scala (https://www.scala-lang.org/) and Python (https://www.python.org/) only.
This chapter will cover the following topics:
- Apache Spark fundamentals
- Getting Spark
- Resilient Distributed Dataset (RDD) programming
- Spark SQL, Datasets, and DataFrames
- Spark Streaming
- Cluster mode using a different manager
- 自動控制工程設(shè)計入門
- 計算機(jī)應(yīng)用
- 中文版Photoshop CS5數(shù)碼照片處理完全自學(xué)一本通
- 圖形圖像處理(Photoshop)
- Visual Basic從初學(xué)到精通
- AutoCAD 2012中文版繪圖設(shè)計高手速成
- 網(wǎng)絡(luò)安全與防護(hù)
- 水下無線傳感器網(wǎng)絡(luò)的通信與決策技術(shù)
- 走近大數(shù)據(jù)
- 基于企業(yè)網(wǎng)站的顧客感知服務(wù)質(zhì)量評價理論模型與實證研究
- C#求職寶典
- Visual Basic項目開發(fā)案例精粹
- PHP求職寶典
- 計算機(jī)辦公應(yīng)用培訓(xùn)教程
- 系統(tǒng)安裝、維護(hù)與數(shù)據(jù)備份技巧