書名: Hands-On Big Data Analytics with PySpark作者名: Rudy Lai Bart?omiej Potaczek本章字數: 56字更新時間: 2021-06-24 15:52:34
Loading data on to Spark RDDs
In this section, we are going to look at loading data on to Spark RDDs, and will cover the following topics:
- The UCI machine learning data repository
- Getting data from the repository to Python
- Getting data into Spark
Let's start with an overview of the UCI machine learning data repository.