官术网_书友最值得收藏!

Classification

Classification is one of the classical problems in machine learning. Data under consideration could belong to one class or another, for example, if the images provided are data, they could be pictures of cats or dogs. Thus, the classes, in this case, are cats and dogs. Classification means identifying the label or class of the objects under consideration. Classification falls under the umbrella of supervised machine learning. In classification problems, a training dataset is provided that has features or inputs and their corresponding outputs or labels. Using this training dataset, a model is trained; in other words, the parameters of the model are computed. The trained model is then used on new data to find its correct labels.

Classification problems can be of two types: binary class or multiclass. Binary class means that the data is to be classified into two distinct and discrete labels; for example, the patient has cancer or the patient does not have cancer, and the images are of cats or dogs and so on. Multiclass means that the data is to be classified among multiple classes, for example, an email classification problem will divide emails into social media emails, work-related emails, personal emails, family-related emails, spam emails, shopping offer emails, and so on. Another example would be of pictures of digits; each picture could be labeled between 0 and 9, depending on what digit the picture represents. In this chapter, we will look at examples of both kinds of classification.

The most popular method for classification is logistic regression. Logistic regression is a probabilistic and linear classifier. The probability that the vector of input features belongs to a specific class can be described mathematically by the following equation: 

In the preceding equation, the following applies:

  • Y represents the output
  • i represents one of the classes
  • x represents the inputs
  • w represents the weights
  • b represents the biases
  • z represents the regression equation 
  • ? represents the smoothing function (or model, in our case)

The ?(z) function represents the probability that x belongs to class i when w and b are given. Thus, the model has to be trained to maximize the value of this probability.

主站蜘蛛池模板: 砚山县| 泗水县| 柳州市| 五家渠市| 镇宁| 祁阳县| 山东省| 和林格尔县| 七台河市| 景洪市| 海盐县| 海城市| 渭源县| 雷山县| 嘉兴市| 榆林市| 泰州市| 巫山县| 克什克腾旗| 武功县| 石柱| 腾冲县| 肇东市| 耒阳市| 临汾市| 建德市| 庐江县| 南乐县| 镇安县| 昔阳县| 遂昌县| 西丰县| 同江市| 黔东| 香港 | 水富县| 友谊县| 广丰县| 安远县| 乳山市| 浮山县|