官术网_书友最值得收藏!

Supervised learning

Supervised learning is the simplest and most well-known automatic learning task. It is based on a number of predefined examples, in which the category to which each of the inputs should belong is already known, as shown in the following diagram:

The preceding diagram shows a typical workflow of supervised learning. An actor (for example, a data scientist or data engineer) performs Extraction Transformation Load (ETL) and the necessary feature engineering (including feature extraction, selection, and so on) to get the appropriate data with features and labels so that they can be fed in to the model. Then he would split the data into training, development, and test sets. The training set is used to train an ML model, the validation set is used to validate the training against the overfitting problem and regularization, and then the actor would evaluate the model's performance on the test set (that is, unseen data).

However, if the performance is not satisfactory, he can perform additional tuning to get the best model based on hyperparameter optimization. Finally, he would deploy the best model in a production-ready environment. The following diagram summarizes these steps in a nutshell:

In the overall life cycle, there might be many actors involved (for example, a data engineer, data scientist, or an ML engineer) to perform each step independently or collaboratively. The supervised learning context includes classification and regression tasks; classification is used to predict which class a data point is a part of (discrete value). It is also used for predicting the label of the class attribute. On the other hand, regression is used for predicting continuous values and making a numeric prediction of the class attribute.

In the context of supervised learning, the learning process required for the input dataset is split randomly into three sets, for example, 60% for the training set, 10% for the validation set, and the remaining 30% for the testing set.

主站蜘蛛池模板: 蓝田县| 西和县| 延津县| 大田县| 泊头市| 遵义县| 西丰县| 南召县| 正镶白旗| 中西区| 修文县| 团风县| 紫阳县| 阿图什市| 丰台区| 平顶山市| 清苑县| 左云县| 日照市| 皮山县| 巧家县| 柳河县| 高碑店市| 安吉县| 徐汇区| 新源县| 宁晋县| 万载县| 鄂州市| 双城市| 建水县| 凤阳县| 新丰县| 扶绥县| 巴林右旗| 新安县| 万载县| 炎陵县| 岑溪市| 舟山市| 佛学|