- Learning Spark SQL
- Aurobindo Sarkar
- 106字
- 2021-07-02 18:23:45
Defining and using custom data sources in Spark
You can define your own data sources and combine the data from such sources with data from other more standard data sources (for example, relational databases, Parquet files, and so on). In Chapter 5, Using Spark SQL in Streaming Applications, we define a custom data source for streaming data from public APIs available from Transport for London (TfL) site.
Refer to the video Spark DataFrames Simple and Fast Analysis of Structured Data - Michael Armbrust (Databricks) at https://www.youtube.com/watch?v=xWkJCUcD55w for a good example of defining a data source for Jira and creating a Spark SQL DataFrame from it.
推薦閱讀
- Puppet 4 Essentials(Second Edition)
- SOA實(shí)踐
- Java 開發(fā)從入門到精通(第2版)
- Objective-C應(yīng)用開發(fā)全程實(shí)錄
- Python程序設(shè)計(jì)(第3版)
- 新編Premiere Pro CC從入門到精通
- Koa開發(fā):入門、進(jìn)階與實(shí)戰(zhàn)
- Getting Started with SQL Server 2012 Cube Development
- 自然語言處理Python進(jìn)階
- Spring Boot+Vue全棧開發(fā)實(shí)戰(zhàn)
- OpenCV with Python By Example
- Kubernetes源碼剖析
- Maker基地嘉年華:玩轉(zhuǎn)樂動(dòng)魔盒學(xué)Scratch
- Flutter之旅
- The Applied Data Science Workshop