官术网_书友最值得收藏!

Summary

In this chapter, we revisited the most fundamental theory behind data analysis and exploratory data analysis. EDA is one of the most prominent steps in data analysis and involves steps such as data requirements, data collection, data processing, data cleaning, exploratory data analysis, modeling and algorithms, data production, and communication. It is crucial to identify the type of data under analysis. Different disciplines store different kinds of data for different purposes. For example, medical researchers store patients' data, universities store students' and teachers' data, real estate industries store house and building datasets, and many more. A dataset contains many observations about a particular object. Most of the datasets can be divided into numerical data and categorical datasets. There are four types of data measurement scales: nominal, ordinal, interval, and ratio. 

We are going to use several Python libraries, including NumPy, pandas, SciPy, and Matplotlib, in this book for performing simple to complex exploratory data analysis. In the next chapter, we are going to learn about various types of visualization aids for exploratory data analysis. 

主站蜘蛛池模板: 岗巴县| 将乐县| 扎鲁特旗| 盐池县| 翁源县| 莱阳市| 北票市| 桂阳县| 石阡县| 南和县| 辰溪县| 乌拉特前旗| 上蔡县| 泰和县| 河南省| 邛崃市| 永德县| 漾濞| 湘乡市| 台州市| 贡嘎县| 湘潭县| 灯塔市| 绥化市| 会昌县| 罗江县| 松溪县| 宁陕县| 彩票| 隆化县| 漾濞| 和静县| 岳普湖县| 南通市| 阿鲁科尔沁旗| 子长县| 吐鲁番市| 柏乡县| 鞍山市| 耒阳市| 阿合奇县|