官术网_书友最值得收藏!

Preface

This book is a practical guide to realizing the power of analytics in Big Data. It walks the Big Data technologist in you through the process of getting the data ready, applying analytics, and creating a value out of the data. All of this is done using appropriate design patterns in Pig. We have chosen Pig to demonstrate how useful it is, which is evident from the following:

  • The inherent amenability of Pig through its simple language constructs, which can be learned very easily, and its extensibility and applicability to structured and unstructured Big Data makes it the preferred choice over others.
  • The ease and speed with which patterns can be implemented by Pig to derive meaning out of the apparent randomness in any Big Data is commendable.
  • This book guides system architects and developers so they become more proficient at creating complex analytics solutions using Pig. It does so by exposing them to a variety of Pig design patterns, UDFs, tools, and best practices.

By reading this book, you will achieve the following goals:

  • Simplify the process of creating complex data pipelines by performing data movement across platforms, data ingestion, profiling, validation, transformations, data reduction, and egress; you'll also be able to use Pig in these design patterns
  • Create solutions that use patterns for exploratory analysis of multistructured unmodeled data to derive structure from it and move the data to downstream systems for further analysis
  • Decipher how Pig can coexist with other tools in the Hadoop ecosystem to create Big Data solutions using design patterns
主站蜘蛛池模板: 西吉县| 鸡西市| 芜湖县| 土默特右旗| 南川市| 盐池县| 屏边| 房产| 万州区| 丹棱县| 嘉峪关市| 滨州市| 游戏| 武宣县| 延吉市| 富平县| 穆棱市| 苍山县| 和田市| 丰原市| 黔西县| 苏尼特左旗| 鄂尔多斯市| 南康市| 永福县| 靖州| 榆树市| 福清市| 庐江县| 固安县| 容城县| 潍坊市| 池州市| 饶阳县| 阿图什市| 梅州市| 桂平市| 汪清县| 大名县| 故城县| 泌阳县|