官术网_书友最值得收藏!

Chapter 7. Data Analysis Application Examples

In this chapter, we want to get you acquainted with typical data preparation tasks and analysis techniques, because being fluent in preparing, grouping, and reshaping data is an important building block for successful data analysis.

While preparing data seems like a mundane task – and often it is – it is a step we cannot skip, although we can strive to simplify it by using tools such as Pandas.

Why is preparation necessary at all? Because most useful data will come from the real world and will have deficiencies, contain errors or will be fragmentary.

There are more reasons why data preparation is useful: it gets you in close contact with the raw material. Knowing your input helps you to spot potential errors early and build confidence in your results.

Here are a few data preparation scenarios:

  • A client hands you three files, each containing time series data about a single geological phenomenon, but the observed data is recorded on different intervals and uses different separators
  • A machine learning algorithm can only work with numeric data, but your input only contains text labels
  • You are handed the raw logs of a web server of an up and coming service and your task is to make suggestions on a growth strategy, based on existing visitor behavior
主站蜘蛛池模板: 五台县| 桃园市| 江孜县| 体育| 瑞安市| 汝城县| 缙云县| 邢台县| 云和县| 甘德县| 乌什县| 乐东| 静海县| 上高县| 京山县| 高台县| 仙游县| 庆阳市| 土默特左旗| 缙云县| 融水| 靖宇县| 耿马| 始兴县| 绥棱县| 合江县| 沧州市| 乌鲁木齐县| 昆山市| 沅江市| 镇江市| 建宁县| 郴州市| 抚宁县| 富蕴县| 小金县| 夏河县| 忻州市| 平武县| 忻城县| 姚安县|