官术网_书友最值得收藏!

Chapter 1. What It's All About

This book is about Hadoop, an open source framework for large-scale data processing. Before we get into the details of the technology and its use in later chapters, it is important to spend a little time exploring the trends that led to Hadoop's creation and its enormous success.

Hadoop was not created in a vacuum; instead, it exists due to the explosion in the amount of data being created and consumed and a shift that sees this data deluge arrive at small startups and not just huge multinationals. At the same time, other trends have changed how software and systems are deployed, using cloud resources alongside or even in preference to more traditional infrastructures.

This chapter will explore some of these trends and explain in detail the specific problems Hadoop seeks to solve and the drivers that shaped its design.

In the rest of this chapter we shall:

  • Learn about the big data revolution
  • Understand what Hadoop is and how it can extract value from data
  • Look into cloud computing and understand what Amazon Web Services provides
  • See how powerful the combination of big data processing and cloud computing can be
  • Get an overview of the topics covered in the rest of this book

So let's get on with it!

主站蜘蛛池模板: 江孜县| 库尔勒市| 渭源县| 镇江市| 宁都县| 榆中县| 涟水县| 山东省| 龙山县| 岳池县| 那曲县| 额敏县| 灯塔市| 墨竹工卡县| 贵州省| 新巴尔虎左旗| 徐闻县| 潮州市| 长垣县| 大安市| 当阳市| 兴国县| 博野县| 长子县| 长武县| 儋州市| 焦作市| 海兴县| 桃园县| 潞西市| 黄浦区| 康乐县| 新郑市| 瓦房店市| 苏尼特左旗| 巴楚县| 嘉祥县| 繁峙县| 定结县| 太谷县| 抚州市|