This book is designed for data architects, developers, managers, and business users who want to modernize their data architectures leveraging the HDInsight distribution of Hadoop. It guides you through the business values of big data, the main points of current EDW (Enterprise Data Warehouse), steps for building the next generation Data Lake, and development tools with real life examples.
The book explains the journey to a Data Lake with a modular approach for ingesting, transforming, and reporting on a Data Lake leveraging HDInsight platform and Excel for powerful analysis and reporting.