官术网_书友最值得收藏!

Apache Storm

Apache Storm has emerged as the platform of choice for industry leaders to develop distributed, real-time, data processing platforms. It provides a set of primitives that can be used to develop applications that can process a very large amount of data in real time in a highly scalable manner.

Storm is to real-time processing what Hadoop is to batch processing. It is open source software, and managed by Apache Software Foundation. It has been deployed to meet real-time processing needs by companies such as Twitter, Yahoo!, and Flipboard. Storm was first developed by Nathan Marz at BackType, a company that provided social search applications. Later, BackType was acquired by Twitter, and it is a critical part of their infrastructure. Storm can be used for the following use cases:

  • Stream processing: Storm is used to process a stream of data and update a variety of databases in real time. This processing occurs in real time and the processing speed needs to match the input data speed.
  • Continuous computation: Storm can do continuous computation on data streams and stream the results to clients in real time. This might require processing each message as it comes in or creating small batches over a short time. An example of continuous computation is streaming trending topics on Twitter into browsers.
  • Distributed RPC: Storm can parallelize an intense query so that you can compute it in real time.
  • Real-time analytics: Storm can analyze and respond to data that comes from different data sources as they happen in real time.

In this chapter, we will cover the following topics:

  • What is a Storm?
  • Features of Storm
  • Architecture and components of a Storm cluster
  • Terminologies of Storm
  • Programming language
  • Operation modes
主站蜘蛛池模板: 淳化县| 阿城市| 勃利县| 漯河市| 利津县| 合作市| 宜都市| 高邑县| 普格县| 赤水市| 揭阳市| 怀仁县| 拜泉县| 墨竹工卡县| 洪泽县| 苍南县| 正定县| 乐都县| 修武县| 繁峙县| 确山县| 蚌埠市| 泰兴市| 永年县| 岑溪市| 偃师市| 华池县| 曲松县| 广安市| 富川| 凤庆县| 武汉市| 沁源县| 哈尔滨市| 肥西县| 鄢陵县| 沧源| 丰都县| 敦化市| 睢宁县| 青川县|