- Ensemble Machine Learning Cookbook
- Dipayan Sarkar Vijayalakshmi Natarajan
- 373字
- 2021-07-02 13:21:54
How it works...
In Step 1, we started by reading and describing our data. This step provided us with summary statistics for our dataset. We looked at the number of variables for each datatype in Step 2.
In Step 3, we created two variables, namely, numerical_features and categorical_features, to hold the names of numerical and categorical variables respectively. We used these two variables in the steps when we worked with numerical and categorical features separately.
In Step 4 and Step 5, we used the seaborn library to plot our charts. We also introduced the melt() function from pandas, which can be used to reshape our DataFrame and feed it to the FacetGrid() function of the seaborn library. Here, we showed how you can paint the distribution plots for all the numerical variables in one single go. We also showed you how to use the same FacetGrid() function to plot the distribution of SalesPrice by each categorical variable.
We generated the correlation matrix in Step 6 using the corr() function of the DataFrame object. However, we noticed that with too many variables, the display does not make it easy for you to identify the correlations. In Step 7, we plotted the correlation matrix heatmap by using the heatmap() function from the seaborn library.
In Step 8, we saw how the numerical variables correlated with the sale prices of houses using a scatter plot matrix. We generated the scatter plot matrix using the regplot() function from the seaborn library. Note that we used a parameter, fit_reg=False, to remove the regression line from the scatter plots.
In Step 9, we repeated Step 8 to see the relationship of the numerical variables with the sale prices of the houses in a numerical format, instead of scatter plots. We also sorted the output in descending order by passing a [::-1] argument to the corr() function.
- 腦動力:Linux指令速查效率手冊
- 工業機器人技術及應用
- 工業機器人產品應用實戰
- Getting Started with Clickteam Fusion
- Verilog HDL數字系統設計入門與應用實例
- Windows程序設計與架構
- 電腦主板現場維修實錄
- Implementing AWS:Design,Build,and Manage your Infrastructure
- 西門子變頻器技術入門及實踐
- Practical Big Data Analytics
- Azure PowerShell Quick Start Guide
- 筆記本電腦維修90個精選實例
- 手機游戲程序開發
- INSTANT Munin Plugin Starter
- 基于RPA技術財務機器人的應用與研究