官术网_书友最值得收藏!

Chapter 1. Getting Ready with Predictive Analytics

Analytics, predictive analytics, and data visualization are trendy topics today. The reasons are:

  • Today a lot of internal and external data is available
  • Technology to use this data has evolved a lot
  • It is commonly accepted that there is a lot of value that can be extracted from data

As in many trendy topics, there is a lot of confusion around it. In this chapter, we will cover the following concepts:

  • Introducing the key concepts of the book and tools we're going to use
  • Defining analytics, predictive analytics, and data visualization
  • Explaining the purpose of this book and the methodology we'll follow
  • Covering the installation of the environment we'll use to create examples of applications in each chapter

After this chapter, we'll learn how to use our data to make predictions that will add value to our organizations. Before starting a data project, you always need to understand how your project will add value to the organization. In an analytics project, the two main sources of value are cost reduction and revenue increase. When you're working on a fraud detection project, your objective is to reduce fraud; this will lead into a cost reduction that will improve the margin of the organization. Finally, to understand the value of your data solution, you need to evaluate the cost of your solution. The real value added to an organization is the difference between the provided value and the total cost.

Working with data to create predictive solutions sounds very glamorous, but before that we'll learn how to use Rattle to load data, to avoid some problems related to the quality of the data, and to explore it. Rattle is a tool for statisticians, and, sometimes, we need a tool that provides us with a business approach to data exploration. We'll learn how to use Qlik Sense Desktop to do this.

After learning how to explore and understand data, we'll now learn how to create predictive systems. We'll pide these systems into unsupervised learning methods and supervised learning methods. We'll explain the difference later in this book.

To achieve a better understanding, in this book we'll create three different solutions using the most common predictive techniques: Clustering, Decision Trees, and Linear Regression.

To present data to the user, we need to create an application that helps the user to understand the data and take decisions; for this reason we'll look at the basics of data visualization. Data Visualization, Predictive Analytics and most of the topics of this book are huge knowledge areas. In this book we'll introduce you to these topics and at the end of each chapter you will find a section called Further learning where you will find references to continue learning.

主站蜘蛛池模板: 濉溪县| 武清区| 韶关市| 丰都县| 开封市| 西乡县| 交城县| 博罗县| 河曲县| 沅陵县| 舟曲县| 陆川县| 咸阳市| 文化| 尼勒克县| 微山县| 灵丘县| 崇阳县| 长寿区| 潼关县| 马山县| 花垣县| 通山县| 文成县| 民权县| 虹口区| 新邵县| 临沭县| 城步| 清流县| 浠水县| 盖州市| 尼玛县| 六安市| 敦化市| 彰武县| 邹城市| 平顺县| 双流县| 本溪| 泰顺县|