官术网_书友最值得收藏!

Understanding supervised learning

We have previously established that the goal of supervised learning is always to predict the labels (or target values) of some data. However, depending on the nature of these labels, supervised learning can come in two distinct forms:

  • Classification: Supervised learning is called classification whenever we use the data to predict categories. A good example of this is when we try to predict whether an image contains a cat or a dog. Here, the labels of the data are categorical, either one or the other, but never a mixture of categories. For example, a picture contains either a cat or a dog, never 50 percent cat and 50 percent dog (before you ask, no, here we do not consider pictures of the cartoon character CatDog), and our job is simply to tell which one it is. When there are only two choices, it is called two-class or binary classification. When there are more than two categories, as when predicting what the weather will be like the next day, it is known as multi-class classification.
  • Regression: Supervised learning is called regression whenever we use the data to predict real values. A good example of this is when we try to predict stock prices. Rather than predicting stock categories, the goal of regression is to predict a target value as accurately as possible; for example, to predict the stock prices with as little an error as possible.

Perhaps the easiest way to figure out whether we are dealing with a classification or regression problem is to ask ourselves the following question: What are we actually trying to predict? The answer is given in the following figure:

Differentiating between classification and regression problems
主站蜘蛛池模板: 信丰县| 渝北区| 湖口县| 建宁县| 兖州市| 南江县| 永兴县| 乌鲁木齐县| 遂昌县| 清流县| 吉林省| 孙吴县| 肥乡县| 综艺| 绥江县| 双牌县| 边坝县| 紫阳县| 上栗县| 梅州市| 汉沽区| 翁源县| 五家渠市| 来凤县| 广灵县| 介休市| 三原县| 奉节县| 绥滨县| 大城县| 日土县| 鸡西市| 荥经县| 凤阳县| 崇左市| 资阳市| 化州市| 岑巩县| 晋宁县| 陇西县| 错那县|