Summary
In this chapter, we discussed the non-functional requirements for data storage solutions. It has become clear that a data lake, which is an evolution of a data warehouse, consists of multiple layers that have their own requirements and thus technology. We have discussed the key requirements for a raw data store where primarily flat files need to be stored in a robust way, for a historical database where temporal information is saved, and for analytics data stores where fast querying is necessary. Furthermore, we have explained the requirements for a streaming data engine and for a model development environment. In all cases, requirements management is an ongoing process in an AI project. Rather than setting all the requirements in stone at the start of the project, architects and developers should be agile, revisiting and revising the requirements after every iteration.
In the next chapter, we will connect the layers of the architecture we have explored in this chapter by creating a data processing pipeline that transforms data from the raw data layer to the historical data layer and to the analytics layer. We will do this to ensure that all the data has been prepared for use in machine learning models. We will also cover data preparation for streaming data scenarios.
- Learning SQL Server Reporting Services 2012
- ATmega16單片機項目驅動教程
- Instant uTorrent
- 龍芯應用開發標準教程
- 電腦維護與故障排除傻瓜書(Windows 10適用)
- Effective STL中文版:50條有效使用STL的經驗(雙色)
- 平衡掌控者:游戲數值經濟設計
- Mastering Manga Studio 5
- Hands-On Machine Learning with C#
- 計算機組裝與維護(第3版)
- Machine Learning with Go Quick Start Guide
- 計算機組裝維修與外設配置(高等職業院校教改示范教材·計算機系列)
- VMware Workstation:No Experience Necessary
- 單片機原理及應用:基于C51+Proteus仿真
- 微服務實戰