- Artificial Intelligence for Big Data
- Anand Deshpande Manish Kumar
- 247字
- 2021-06-25 21:57:14
Polynomial regression
While in linear regression, the correlation between the independent and the dependent variables is best represented with a straight line, the real-life datasets are more complex and do not represent a linear relationship between cause and effect. The straight line equation does not fit the data points and hence cannot create an effective predictive model.
In such cases, we can consider using a higher-order quadratic equation for the predictor function. Given x as an independent variable and y as a dependent variable, the polynomial function takes the following forms:
These can be visualized with a small set of sample data as follows:
Figure 3.11 Polynomial prediction function
Note that the straight line cannot accurately represent the relationship between x and y. As we model the prediction function with higher-order functions, R2 is improved. This means the model is able to be more accurate.
We may think that it will be best to use the highest possible order equation for the prediction function in order to get the best fitting model. However, that is not right because as we create the regression line that goes through all the data points, the model fails to accurately predict the outcomes for any data outside of the training sample (test data). This problem is called overfitting. On the other end, we may also encounter the problem of underfitting. This is when the model does not fit the training data well and hence performs poorly with the test data.
- Mastering Ninject for Dependency Injection
- SQL Server 2008數據庫應用技術(第二版)
- 卷積神經網絡的Python實現
- 文本數據挖掘:基于R語言
- 數據庫程序員面試筆試真題庫
- 一個64位操作系統的設計與實現
- 數據庫設計與應用(SQL Server 2014)(第二版)
- Hadoop大數據開發案例教程與項目實戰(在線實驗+在線自測)
- 新手學會計(2013-2014實戰升級版)
- Doris實時數倉實戰
- Gideros Mobile Game Development
- Rust High Performance
- 高效使用Redis:一書學透數據存儲與高可用集群
- 一本書講透數據治理:戰略、方法、工具與實踐
- Swift 2 By Example