官术网_书友最值得收藏!

Learning curves

Another method of assessing a model's performance is by evaluating the model's growth of learning or the model's ability to improve learning (obtain a better score) with additional experience (for example, more rounds of cross-validation).

Note

Learning is the act of acquiring new, or modifying and reinforcing existing, knowledge.

The information indicating a model's result or score with a data file population can be combined with other scores to show a line or curve, which is known as a model's learning curve.

A learning curve is a graphical representation of the growth of learning (the scores shown in a vertical axis) with practice (the individual data files or rounds shown in the horizontal axis).

This can also be conceptualized as:

  • The same task repeated in a series
  • A body of knowledge learned over time

The following figure illustrates a hypothetical learning curve, showing the improved learning of a predictive model using resultant scores by cross-validation round:

Learning curves

Source link: https://en.wikipedia.org/wiki/File:Alanf777_Lcd_fig01.png

Tip

It's funny; one might know that the familiar expression it's a steep learning curve is intended to describe an activity that is tough to learn, but in statistics, a learning curve with a steep start would actually represent a rapidly improving progress.

Learning curves relating model performance to experience are commonly found to be used when performing model assessments.

As we have mentioned earlier in this section, performance (or the scores) is meant to be the accuracy of a model while experience (or round) may be the number of training examples, datasets, or iterations used in optimizing the model parameters.

Plot and ping

Using two generic R functions, we can demonstrate a simple learning curve visualization. Ping will open an image file which will hold our learning curve visualization so we can easily include it in a document later, and plot will draw our graphic.

The following are our example R code statements:

# -- 5 rounds of numeric test scores saved in a vector named "v"
v <-c(74,79, 88, 90, 99)

# -- create an image file for the visualization for later use
png(file = "c:/simple example/learning curve.png", type = c("windows", "cairo", "cairo-png"))

# -- plot the model scores round by round
plot(v, type = "o", col = "red", xlab = "Round", ylab = "Score", main = "Learning Curve")

# -- close output 
dev.off()

The preceding statements create the following graphic as a file:

Plot and ping
主站蜘蛛池模板: 丰顺县| 松潘县| 迁安市| 津南区| 上栗县| 浦北县| 台中县| 阿拉善左旗| 景东| 东丽区| 鹤壁市| 邵阳县| 镇坪县| 米泉市| 长白| 芜湖市| 南涧| 勐海县| 汶川县| 台前县| 嘉善县| 六安市| 鄂州市| 濉溪县| 鲁甸县| 云浮市| 盘锦市| 湘阴县| 淄博市| 揭阳市| 乌鲁木齐市| 梅州市| 黔西县| 邮箱| 抚宁县| 安平县| 木兰县| 集贤县| 宜丰县| 长泰县| 郁南县|