- R Programming By Example
- Omar Trejo Navarro
- 256字
- 2021-07-02 21:30:44
Checking model assumptions
Linear models, as with any kind of models, require that we check their assumptions to justify their application. The accuracy and interpretability of the results comes from adhering to a model's assumptions. Sometimes these will be rigorous assumptions in the sense that if they are not strictly met, then the model is not considered to be valid at all. Other times, we will be working with more flexible assumptions in which a degree of criteria from the analyst will come into play.
For those of you interested, a great article about models' assumptions is David Robinson's, K-means clustering is not free lunch, 2015 (http://varianceexplained.org/r/kmeans-free-lunch/).
For linear models, the following are some of the core assumptions:
- Linearity: There is a linear relation among the variables
- Normality: Residuals are normally distributed
- Homoscedasticity: Residuals have constant variance
- No collinearity: Variables are not linear combinations of each other
- Independence: Residuals are independent or at least not correlated
We will show how to briefly check four of the them: linearity, normality, homoscedasticity, and no collinearity. We should mention that the independence assumption is probably the most difficult assumption to test, and you can generally handle it with common sense and understanding how the data was collected. We will not get into that here as it's more in the statistics side of things and we want to keep the book focused on programming techniques. For the statistically-interested reader, we recommend looking at Jeffrey M. Wooldridge's, Introductory Econometrics, 2013 and Joshua D. Angrist and Jorn-Steffen Pischke's, Mostly Harmless Econometrics, 2008.
- 智能傳感器技術(shù)與應(yīng)用
- 21天學(xué)通JavaScript
- 計(jì)算機(jī)圖形學(xué)
- 并行數(shù)據(jù)挖掘及性能優(yōu)化:關(guān)聯(lián)規(guī)則與數(shù)據(jù)相關(guān)性分析
- Getting Started with MariaDB
- Mastering Salesforce CRM Administration
- 數(shù)據(jù)庫(kù)原理與應(yīng)用技術(shù)學(xué)習(xí)指導(dǎo)
- 流處理器研究與設(shè)計(jì)
- 電氣控制與PLC技術(shù)應(yīng)用
- Dreamweaver CS6精彩網(wǎng)頁(yè)制作與網(wǎng)站建設(shè)
- 統(tǒng)計(jì)挖掘與機(jī)器學(xué)習(xí):大數(shù)據(jù)預(yù)測(cè)建模和分析技術(shù)(原書(shū)第3版)
- 啊哈C!思考快你一步
- 重估:人工智能與賦能社會(huì)
- 電腦上網(wǎng)入門(mén)
- 大數(shù)據(jù)素質(zhì)讀本