官术网_书友最值得收藏!

Why we are talking about big data now if data has always existed

By the early 2000’s, rapid advances in computing and technologies, such as storage, allowed users to collect and store data with unprecedented levels of efficiency. The internet further added impetus to this drive by providing a platform that had an unlimited capacity to exchange information at a global scale. Technology advanced at a breathtaking pace and led to major paradigm shifts powered by tools such as social media, connected devices such as smart phones, and the availability of broadband connections, and by extension, user participation, even in remote parts of the world.

By and large, the majority of this data consists of information generated by web-based sources, such as social networks like Facebook and video sharing sites like YouTube. In big data parlance, this is also known as unstructured data; namely, data that is not in a fixed format such as a spreadsheet or the kind that can be easily stored in a traditional database system.

The simultaneous advances in computing capabilities meant that although the rate of data being generated was very high, it was still computationally feasible to analyze it. Algorithms in machine learning, which were once considered intractable due to both the volume as well as algorithmic complexity, could now be analyzed using various new paradigms such as cluster or multinode processing in a much simpler manner that would have earlier necessitated special-purpose machines.

Chart of data generated per minute. Credit: DOMO Inc.

主站蜘蛛池模板: 江永县| 株洲县| 河津市| 武夷山市| 丰镇市| 陇川县| 沂南县| 黔西| 东安县| 乐亭县| 西昌市| 乌兰察布市| 安图县| 商丘市| 左云县| 许昌市| 黄大仙区| 西平县| 余庆县| 永清县| 泊头市| 博兴县| 浪卡子县| 敖汉旗| 蒙阴县| 平阴县| 泰顺县| 长泰县| 新营市| 封丘县| 上栗县| 阜城县| 曲麻莱县| 舟曲县| 沈阳市| 喀喇沁旗| 聂荣县| 灵宝市| 泽州县| 浠水县| 昌江|