官术网_书友最值得收藏!

Introduction to big data modeling

Having a good idea of what big data and its characteristics are, let's now dig into what big data modeling is. Say we have the dataset, which we classify as big data, and before doing any analysis on the dataset, we need to have an idea of how the data looks. The goal of data modeling is to formally explore the nature of data so that you can figure out what kind of storage you need, and what kind of processing you can do on it.

Data modeling is a technique that helps to give meaningful insight into data by defining and categorizing it, and establishing official definitions and descriptors so that the data can be utilized by all information systems in a company.

We can hold at least two primary reasons for performing data modeling:

  • Strategic data modeling facilitates the overall information systems development strategy
  • Data modeling can help in the development of new databases

The data modeling for strategic outlining suggests defining what kind of data you will need for your company processes, while modeling in the context of analysis is more focused on representing data that exists and finding ways to classify it. In the case of big data, that process probably requires finding similarities between data from disparate sources and confirming that they, in fact, describe the same thing. In either case, the end goal is to generate a representation of your data that can be replicated in your database architecture.

主站蜘蛛池模板: 通城县| 郑州市| 汉沽区| 云安县| 称多县| 康定县| 黔江区| 无极县| 贵州省| 五寨县| 信丰县| 五峰| 海原县| 嫩江县| 同江市| 奉新县| 石河子市| 大渡口区| 克东县| 漯河市| 赣榆县| 都江堰市| 桃源县| 宁城县| 岐山县| 南京市| 瑞丽市| 南宫市| 高青县| 福建省| 友谊县| 岫岩| 云和县| 玉门市| 建昌县| 古浪县| 天全县| 县级市| 延川县| 麻城市| 铁岭市|