官术网_书友最值得收藏!

Problems with the existing approach

We got the baseline score using the AdaBoost and GradientBoosting classifiers. Now, we need to increase the accuracy of these classifiers. In order to do that, we first list all the areas that can be improvised but that we haven't worked upon extensively. We also need to list possible problems with the baseline approach. Once we have the list of the problems or the areas on which we need to work, it will be easy for us to implement the revised approach.

Here, I'm listing some of the areas, or problems, that we haven't worked on in our baseline iteration:

  • Problem: We haven't used cross-validation techniques extensively in order to check the overfitting issue.
    • Solution: If we use cross-validation techniques properly, then we will know whether our trained ML model suffers from overfitting or not. This will help us because we don't want to build a model that can't even be generalized properly.
  • Problem: We also haven't focused on hyperparameter tuning. In our baseline approach, we mostly use the default parameters. We define these parameters during the declaration of the classifier. You can refer to the code snippet given in Figure 1.52, where you can see the classifier taking some parameters that are used when it trains the model. We haven't changed these parameters.
    • Solution: We need to tune these hyperparameters in such a way that we can increase the accuracy of the classifier. There are various hyperparameter-tuning techniques that we need to use.

In the next section, we will look at how these optimization techniques actually work as well as discuss the approach that we are going to take. So let's begin!

主站蜘蛛池模板: 重庆市| 邵武市| 余干县| 清镇市| 铜山县| 含山县| 黎城县| 孝义市| 华阴市| 阿拉善盟| 榆林市| 思茅市| 河东区| 辰溪县| 井陉县| 奇台县| 西华县| 深泽县| 固安县| 霍邱县| 高密市| 宝鸡市| 高邑县| 商南县| 新巴尔虎右旗| 无极县| 五家渠市| 钟祥市| 襄城县| 名山县| 咸丰县| 正镶白旗| 唐山市| 泸水县| 台前县| 周宁县| 尉氏县| 建德市| 老河口市| 武乡县| 巫溪县|