官术网_书友最值得收藏!

Preface

Data is a collection of discrete objects, events, and facts in the form of numbers, text, pictures, videos, objects, audio, and other entities. Processing data provides a great deal of information. But the million-dollar question is—how do we get meaningful information from data? The answer to this question is Exploratory Data Analysis (EDA), which is the process of investigating datasets, elucidating subjects, and visualizing outcomes. EDA is an approach to data analysis that applies a variety of techniques to maximize specific insights into a dataset, reveal an underlying structure, extract significant variables, detect outliers and anomalies, test assumptions, develop models, and determine best parameters for future estimations. This book, Hands-On Exploratory Data Analysis with Python, aims to provide practical knowledge about the main pillars of EDA, including data cleansing, data preparation, data exploration, and data visualization. Why visualization? Well, several research studies have shown that portraying data in graphical form makes complex statistical data analyses and business intelligence more marketable. 

You will get the opportunity to explore open source datasets including healthcare datasets, demographics datasets, a Titanic dataset, a wine quality dataset, automobile datasets, a Boston housing pricing dataset, and many others. Using these real-life datasets, you will get hands-on practice in understanding data, summarize data's characteristics, and visualizing data for business intelligence purposes. This book expects you to use pandas, a powerful library for working with data, and other core Python libraries including NumPy, scikit-learn, SciPyStatsModels for regression, and Matplotlib for visualization.

主站蜘蛛池模板: 新昌县| 房山区| 米泉市| 全南县| 正蓝旗| 长海县| 韩城市| 嘉定区| 班戈县| 武冈市| 贵港市| 梁山县| 金乡县| 阳江市| 北川| 长武县| 汶上县| 芒康县| 平和县| 津南区| 柳江县| 南宁市| 黑水县| 松原市| 保山市| 叶城县| 白玉县| 雷波县| 德钦县| 建平县| 东兴市| 会昌县| 嘉兴市| 纳雍县| 拜城县| 望江县| 和静县| 房产| 泸溪县| 八宿县| 钦州市|