官术网_书友最值得收藏!

  • Deep Learning By Example
  • Ahmed Menshawy
  • 274字
  • 2021-06-24 18:52:43

Apparent (training set) error

This the first type of error that you don't have to care about minimizing. Getting a small value for this type of error doesn't mean that your model will work well over the unseen data (generalize). To better understand this type of error, we'll give a trivial example of a class scenario. The purpose of solving problems in the classroom is not to be able to solve the same problem again in the exam, but to be able to solve other problems that won’t necessarily be similar to the ones you practiced in the classroom. The exam problems could be from the same family of the classroom problems, but not necessarily identical.

Apparent error is the ability of the trained model to perform on the training set for which we already know the true outcome/output. If you manage to get 0 error over the training set, then it is a good indicator for you that your model (mostly) won't work well on unseen data (won't generalize). On the other hand, data science is about using a training set as a base knowledge for the learning algorithm to work well on future unseen data.

In Figure 3, the red curve represents the apparent error. Whenever you increase the model's ability to memorize things (such as increasing the model complexity by increasing the number of explanatory features), you will find that this apparent error approaches zero. It can be shown that if you have as many features as observations/samples, then the apparent error will be zero:

Figure 13: Apparent error (red curve) and generalization/true error (light blue)
主站蜘蛛池模板: 教育| 海阳市| 南华县| 容城县| 五华县| 邵武市| 四子王旗| 易门县| 武义县| 昭苏县| 临武县| 兴和县| 卫辉市| 田林县| 屏南县| 合江县| 赞皇县| 威海市| 潞城市| 墨江| 余干县| 五家渠市| 赤水市| 嘉定区| 隆回县| 大埔县| 平远县| 浙江省| 祁门县| 邵阳县| 江津市| 宜黄县| 大石桥市| 白银市| 遵义市| 聊城市| 于都县| 普陀区| 太仆寺旗| 延庆县| 惠水县|