官术网_书友最值得收藏!

Variance of an estimator

At the beginning of this chapter, we have defined the data generating process pdata, and we have assumed that our dataset X has been drawn from this distribution; however, we don't want to learn existing relationships limited to X, but we expect our model to be able to generalize correctly to any other subset drawn from pdata. A good measure of this ability is provided by the variance of the estimator:

The variance can be also defined as the square of the standard error (analogously to the standard deviation). A high variance implies dramatic changes in the accuracy when new subsets are selected, because the model has probably reached a very high training accuracy through an over-learning of a limited set of relationships, and it has almost completely lost its ability to generalize. 

主站蜘蛛池模板: 当阳市| 邯郸市| 化州市| 长白| 赫章县| SHOW| 博白县| 海伦市| 长沙县| 资源县| 永登县| 禄劝| 鹤岗市| 盐亭县| 新乐市| 铜川市| 沁源县| 明溪县| 庆云县| 台州市| 海南省| 三穗县| 苍山县| 遂川县| 信丰县| 开鲁县| 河间市| 班戈县| 仁化县| 高陵县| 石城县| 饶阳县| 永胜县| 上犹县| 和田县| 平武县| 昭平县| 陵川县| 彰化县| 周口市| 英吉沙县|