- Mastering Spark for Data Science
- Andrew Morgan Antoine Amend David George Matthew Hallett
- 86字
- 2021-07-09 18:49:33
Summary
In this chapter, we walked through the full setup of an Apache NiFi GDELT ingest pipeline, complete with metadata forks and a brief introduction to visualizing the resulting data. This section is particularly important as GDELT is used extensively throughout the book and the NiFi method is a highly effective way to source data in a scalable and modular way.
In the next chapter, we will get to grips with what to do with the data once it's landed, by looking at schemas and formats.
推薦閱讀
- Excel 2007函數(shù)與公式自學(xué)寶典
- Verilog HDL數(shù)字系統(tǒng)設(shè)計入門與應(yīng)用實(shí)例
- Natural Language Processing Fundamentals
- UTM(統(tǒng)一威脅管理)技術(shù)概論
- VMware Performance and Capacity Management(Second Edition)
- Mastering Elastic Stack
- 嵌入式操作系統(tǒng)原理及應(yīng)用
- 從零開始學(xué)JavaScript
- 智能鼠原理與制作(進(jìn)階篇)
- Ansible 2 Cloud Automation Cookbook
- C#求職寶典
- Hands-On Deep Learning with Go
- C#編程兵書
- 數(shù)據(jù)清洗
- Flash CS3動畫制作