官术网_书友最值得收藏!

Big Data

"We don't have better algorithms, We just have more data."

- Peter Norvig, Research Director, Google

Data in dictionary terms is defined as facts and statistics collected together for reference or analysis. Storage mechanisms have greatly evolved with human evolution—sculptures, handwritten texts on leaves, punch cards, magnetic tapes, hard drives, floppy disks, CDs, DVDs, SSDs, human DNA, and more. With each new medium, we are able to store more and more data in less space; it's a transition in the right direction. With the advent of the internet and the Internet of Things (IoT), data volumes have been growing exponentially.

Data volumes are exploding; more data has been created in the past two years than in the entire history of the human race.

The term Big Data was coined to represent growing volumes of data. Along with volume, the term also incorporates three more attributes, velocity, variety, and value, as follows:

  • Volume: This represents the ever increasing and exponentially growing amount of data. We are now collecting data through more and more interfaces between man-made and natural objects. For example, a patient's routine visit to a clinic now generates electronic data in the tune of megabytes. An average smartphone user generates a data footprint of at least a few GB per day. A flight traveling from one point to another generates half a terabyte of data.
  • Velocity: This represents the amount of data generated with respect to time and a need to analyze that data in near-real time for some mission critical operations. There are sensors that collect data from natural phenomenon, and the data is then processed to predict hurricanes/earthquakes. Healthcare is a great example of the velocity of the data generation; analysis and action is mission critical:

  • Variety: This represents variety in data formats. Historically, most electronic datasets were structured and fit into database tables (columns and rows). However, more than 80% of the electronic data we now generate is not in structured format, for example, images, video files, and voice data files. With Big Data, we are in a position to analyze the vast majority of structured/unstructured and semi-structured datasets. 
  • Value: This is the most important aspect of Big Data. The data is only as valuable as its utilization in the generation of actionable insight. Remember the results pyramid where actions lead to results. There is no disagreement that data holds the key to actionable insight; however, systems need to evolve quickly to be able to analyze the data, understand the patterns within the data, and, based on the contextual details, provide solutions that ultimately create value.
主站蜘蛛池模板: 双流县| 武定县| 南投县| 满洲里市| 石家庄市| 贺州市| 柳河县| 贵定县| 怀柔区| 龙游县| 东至县| 望奎县| 岑巩县| 祥云县| 平利县| 延长县| 额济纳旗| 太仆寺旗| 乌什县| 施甸县| 松滋市| 黄冈市| 开化县| 化隆| 鸡东县| 霍林郭勒市| 岳池县| 肥城市| 兴化市| 西吉县| 泾阳县| 上虞市| 青海省| 会东县| 吴江市| 杂多县| 商水县| 富裕县| 奇台县| 巴彦县| 乐昌市|