- Python:Advanced Predictive Analytics
- Ashish Kumar Joseph Babcock
- 203字
- 2021-07-02 20:09:26
Chapter 4. Statistical Concepts for Predictive Modelling
There are a few statistical concepts, such as hypothesis testing, p-values, normal distribution, correlation, and so on without which grasping the concepts and interpreting the results of predictive models becomes very difficult. Thus, it is very critical to understand these concepts, before we delve into the realm of predictive modelling.
In this chapter, we will be going through and learning these statistical concepts so that we can use them in the upcoming chapters. This chapter will cover the following topics:
- Random sampling and central limit theorem: Understanding the concept of random sampling through an example and illustrating the central limit theorem's application through an example. These two concepts form the backbone of hypothesis testing.
- Hypothesis testing: Understanding the meaning of the terms, such as null hypothesis, alternate hypothesis, confidence intervals, p-value, significance level, and so on. A step-by-step guide to implement a hypothesis test, followed by an example.
- Chi-square testing: Calculation of chi-square statistic. A description of usage of chi-square tests with a couple of examples.
- Correlation: The meaning and significance of correlations between two variables, the meaning and significance of correlation coefficients and calculating and visualizing the correlation between variables of a dataset.
推薦閱讀
- Greenplum:從大數(shù)據(jù)戰(zhàn)略到實(shí)現(xiàn)
- Hands-On Machine Learning with Microsoft Excel 2019
- MongoDB管理與開發(fā)精要
- Redis應(yīng)用實(shí)例
- R數(shù)據(jù)科學(xué)實(shí)戰(zhàn):工具詳解與案例分析(鮮讀版)
- 數(shù)據(jù)庫系統(tǒng)原理及應(yīng)用教程(第4版)
- 數(shù)據(jù)驅(qū)動設(shè)計(jì):A/B測試提升用戶體驗(yàn)
- 數(shù)亦有道:Python數(shù)據(jù)科學(xué)指南
- 網(wǎng)站數(shù)據(jù)庫技術(shù)
- 基于OPAC日志的高校圖書館用戶信息需求與檢索行為研究
- IPython Interactive Computing and Visualization Cookbook(Second Edition)
- 貫通SQL Server 2008數(shù)據(jù)庫系統(tǒng)開發(fā)
- 聯(lián)動Oracle:設(shè)計(jì)思想、架構(gòu)實(shí)現(xiàn)與AWR報(bào)告
- 數(shù)據(jù)庫與數(shù)據(jù)處理:Access 2010實(shí)現(xiàn)
- Oracle高性能SQL引擎剖析:SQL優(yōu)化與調(diào)優(yōu)機(jī)制詳解