官术网_书友最值得收藏!

Summary

In this chapter, we skimmed through the basic concepts of statistics. Here is a brief summary of the concepts we learned:

  • Hypothesis testing is used to test the statistical significance of a hypothesis. The one which already exists or is assumed to be true is a null hypothesis, the one which someone is not sure about or is being proposed as an alternate premise is an alternate hypothesis.
  • One needs to calculate a statistic and the associated p-value to conduct the test.
  • Hypothesis testing (p-values) is used to test the significance of the estimates of the coefficients calculated by the model.
  • The chi-square test is used to test the causal relationship between a predictor and an input variable. It can also be used to check whether the data is fair or fake.
  • The correlation coefficient can range from -1 to 1. The closer it is to the extremes, the stronger is the relationship between the two variables.

Linear regression is part of the family of algorithms called supervised algorithms as the dataset on which they are built has an output variable. In a sense, one can say that this output variable governs or supervises the development of the model and hence the name. More on this is covered in the next chapter.

主站蜘蛛池模板: 信阳市| 常熟市| 涿州市| 津南区| 盖州市| 九台市| 临桂县| 高密市| 邛崃市| 上蔡县| 固原市| 屏东县| 嘉峪关市| 宁强县| 中阳县| 南木林县| 贡觉县| 湾仔区| 岗巴县| 桃园县| 遂平县| 明星| 东明县| 石景山区| 安徽省| 密山市| 原平市| 抚远县| 巴林左旗| 油尖旺区| 荔波县| 万安县| 博兴县| 承德市| 宝应县| 南城县| 巴东县| 磐石市| 韶关市| 云和县| 上杭县|