官术网_书友最值得收藏!

  • Elasticsearch for Hadoop
  • Vishal Shukla
  • 330字
  • 2021-07-09 21:34:28

Preface

The core components of Hadoop have been around from 2004-2006 as MapReduce. Hadoop's ability to scale and process data in a distributed manner has resulted in its broad acceptance across industries. Very large organizations are able to realize the value that Hadoop brings in: crunching terabytes and petabytes of data, ingesting social data, and utilizing commodity hardware to store huge volume of data. However, big data solutions must fulfill its appetite for the speed, especially when you query across unstructured data.

This book will introduce you to Elasticsearch, a powerful distributed search and analytics engine, which can make sense of your massive data in real time. Its rich querying capabilities can help in performing complex full-text search, geospatial analysis, and detect anomalies in your data. Elasticsearch-Hadoop, also widely known as ES-Hadoop, is a two-way connector between Elasticsearch and Hadoop. It opens the doors to flow your data easily to and from the Hadoop ecosystem and Elasticsearch. It can flow the streaming data from Apache Storm or Apache Spark to Elasticsearch and let you analyze it in real time.

The aim of the book is to give you practical skills on how you can harness the power of Elasticsearch and Hadoop. I will walk you through the step-by-step process of how to discover your data and find interesting insights out of massive amount of data. You will learn how to integrate Elasticsearch seamlessly with Hadoop ecosystem tools, such as Pig, Hive, Cascading, Apache Storm, and Apache Spark. This book will enable you to use Elasticsearch to build your own analytics dashboard. It will also enable you to use powerful analytics and the visualization platform, Kibana, to give different shapes, size, and colors to your data.

I have chosen interesting datasets to give you the real-world data exploration experience. So, you can quickly use these tools and techniques to build your domain-specific solutions. I hope that reading this book turns out to be fun and a great learning experience for you.

主站蜘蛛池模板: 崇文区| 柳林县| 台州市| 鲁山县| 宝山区| 巨鹿县| 阳朔县| 河津市| 县级市| 麻栗坡县| 兴海县| 东兴市| 尖扎县| 长兴县| 阳信县| 视频| 平凉市| 洪雅县| 嘉祥县| 呈贡县| 靖宇县| 周至县| 巴青县| 周至县| 五大连池市| 凉城县| 平原县| 井冈山市| 麦盖提县| 连山| 屏边| 朝阳区| 酉阳| 河北区| 营山县| 邻水| 仁化县| 罗城| 六盘水市| 新竹县| 汉中市|