官术网_书友最值得收藏!

How to handle big data

Now that we are convinced the relational model is not a good fit for big data, let's try to figure out ways to handle big data. These are the solutions that paved the way for various NoSQL databases:

  • Clustering: The data should be spread across different nodes in a cluster. The data should be replicated across multiple nodes in order to sustain node failures. This helps spread the data across the cluster, and different nodes contain different subsets of data. This improves performance and provides fault tolerance.

A node is an instance of database software running on a server. Multiple instances of the same database could be running on the same server.

  • Flexible schema: Schemas should be flexible unlike the relational model and should evolve with the data.
  • Relax consistency: We should embrace the concept of eventual consistency, which means data will eventually be propagated to all the nodes in the cluster (in case of replication). Eventual consistency allows data replication across nodes with minimum overhead. This allows for fast writes with the need for distributed locking.
  • Denormalization of data: Denormalize data to optimize queries. This has to be done at the cost of writing and maintaining multiple copies of the same data.
主站蜘蛛池模板: 乌海市| 安国市| 白银市| 墨玉县| 仁怀市| 扬中市| 钟山县| 罗甸县| 富宁县| 临夏市| 安溪县| 张家川| 财经| 荃湾区| 乐山市| 密山市| 和龙市| 渭南市| 凤台县| 阿拉善左旗| 盘山县| 乌拉特中旗| 临澧县| 苍梧县| 若羌县| 镇坪县| 炎陵县| 山阳县| 潮安县| 岗巴县| 综艺| 化隆| 垦利县| 吉林省| 嘉定区| 明水县| 昌邑市| 安多县| 台湾省| 台前县| 定远县|