官术网_书友最值得收藏!

Introduction

Various statistical distributions have been invented, which are the equivalent of the wheel for data analysts. Just as whatever I think of comes out differently in print, data in our world doesn't follow strict mathematical laws. Nevertheless, after visualizing our data, we can see that the data follows (to certain extent) a distribution. Even without visualization, we can find a candidate distribution using rules of thumb. The next step is to try to fit the data to a known distribution. If the data is very complex, possibly due to a high number of variables, it is useful to estimate its kernel density (also useful with one variable). In all scenarios, it is good to estimate the confidence intervals or p-values of our results. When we have at least two variables, it is sometimes appropriate to have a look at the correlation between variables. In this chapter, we will apply three types of correlation.

主站蜘蛛池模板: 启东市| 无棣县| 维西| 鸡东县| 琼结县| 台山市| 辽源市| 荆门市| 天门市| 台南县| 周口市| 红桥区| 大悟县| 永仁县| 自贡市| 涪陵区| 美姑县| 弋阳县| 灌阳县| 黄平县| 进贤县| 河西区| 闻喜县| 报价| 富源县| 福海县| 瑞昌市| 隆化县| 湖南省| 通许县| 通道| 台山市| 淄博市| 临泉县| 浏阳市| 成安县| 子长县| 永川市| 徐闻县| 浑源县| 襄城县|