- Hands-On Big Data Modeling
- James Lee Tao Wei Suresh Kumar Mukhiya
- 242字
- 2021-06-10 18:58:48
Data ingestion
Data ingestion refers to the process of procuring data into the system. This can be done via manual, semi-automatic, or automatic methods.
In a smaller system, users prefer to have some kind of web form or visual interface that takes input in order to put the data into the system. However, when it comes to a larger system, such as a hospital management system, an airline management system, a government and public record management system, or a social media site, users often prefer to automate the data ingestion process as much as possible. So, when it comes to data ingestion, we need to explore a bunch of questions, such as the following:
- How many data sources are there?
- How many large data items are available?
- Will the number of data sources grow over time?
- What is the rate at which data will be consumed?
It is quite important to note that the size of an individual record is small, but the volume of data is quite enormous. When it comes to data ingestion, developers like to create a bunch of policies, called ingestion policies, that guide the handling of errors during the data ingestion, as well as the data incompleteness, and so on. Data ingestion (along with its policies) is an integral part of a big data system.
- 大數(shù)據(jù)導(dǎo)論:思維、技術(shù)與應(yīng)用
- Hands-On Deep Learning with Apache Spark
- 輕松學(xué)Java Web開(kāi)發(fā)
- 流處理器研究與設(shè)計(jì)
- JSF2和RichFaces4使用指南
- Blender Compositing and Post Processing
- Java Web整合開(kāi)發(fā)全程指南
- Kubernetes for Developers
- 水下無(wú)線(xiàn)傳感器網(wǎng)絡(luò)的通信與決策技術(shù)
- AVR單片機(jī)工程師是怎樣煉成的
- 典型Hadoop云計(jì)算
- Xilinx FPGA高級(jí)設(shè)計(jì)及應(yīng)用
- 空間機(jī)器人智能感知技術(shù)
- Hands-On Business Intelligence with Qlik Sense
- Practical AWS Networking