官术网_书友最值得收藏!

  • The Data Science Workshop
  • Anthony So Thomas V. Joseph Robert Thas John Andrew Worsley Dr. Samuel Asare
  • 294字
  • 2021-06-11 18:27:22

Introduction

In previous chapters, where an introduction to machine learning was covered, you were introduced to two broad categories of machine learning; supervised learning and unsupervised learning. Supervised learning can be further pided into two types of problem cases, regression and classification. In the last chapter, we covered regression problems. In this chapter, we will peek into the world of classification problems.

Take a look at the following Figure 3.1:

Figure 3.1: Overview of machine learning algorithms

Classification problems are the most prevalent use cases you will encounter in the real world. Unlike regression problems, where a real numbered value is predicted, classification problems deal with associating an example to a category. Classification use cases will take forms such as the following:

  • Predicting whether a customer will buy the recommended product
  • Identifying whether a credit transaction is fraudulent
  • Determining whether a patient has a disease
  • Analyzing images of animals and predicting whether the image is of a dog, cat, or panda
  • Analyzing text reviews and capturing the underlying emotion such as happiness, anger, sorrow, or sarcasm

If you observe the preceding examples, there is a subtle difference between the first three and the last two. The first three revolve around binary decisions:

  • Customers can either buy the product or not.
  • Credit card transactions can be fraudulent or legitimate.
  • Patients can be diagnosed as positive or negative for a disease.

Use cases that align with the preceding three genres where a binary decision is made are called binary classification problems. Unlike the first three, the last two associate an example with multiple classes or categories. Such problems are called multiclass classification problems. This chapter will deal with binary classification problems. Multiclass classification will be covered next in Chapter 4, Multiclass Classification with RandomForest.

主站蜘蛛池模板: 栾川县| 稻城县| 秀山| 陕西省| 鄂州市| 聂拉木县| 呈贡县| 临高县| 巴林右旗| 绥中县| 瓮安县| 临猗县| 峡江县| 星子县| 衡水市| 平度市| 海口市| 长沙县| 西乡县| 平遥县| 镇坪县| 康保县| 兰州市| 广平县| 高台县| 冕宁县| 通渭县| 抚宁县| 汪清县| 高尔夫| 寿阳县| 晋中市| 屯门区| 临沂市| 澄江县| 昌江| 麻阳| 固镇县| 合水县| 壶关县| 苏尼特左旗|