官术网_书友最值得收藏!

Training different regression models

The following screenshot shows a dataframe where we are going to save performance. We are going to run four models, namely logistic regression, bagging, random forest, and boosting:

We are going to use the following evaluation metrics in this case:

  • accuracy: This metric measures how often the model predicts defaulters and non-defaulters correctly
  • precision: This metric will be when the model predicts the default and how often the model is correct
  • recall: This metric will be the proportion of actual defaulters that the model will correctly predict

The most important of these is the recall metric. The reason behind this is that we want to maximize the proportion of actual defaulters that the model identifies, and so the model with the best recall is selected.

主站蜘蛛池模板: 阜宁县| 双峰县| 乌审旗| 丰城市| 富锦市| 道真| 富顺县| 固镇县| 翼城县| 秦皇岛市| 扎囊县| 武胜县| 延寿县| 蒲城县| 炎陵县| 本溪| 东方市| 江城| 夏津县| 九寨沟县| 中阳县| 朝阳区| 平远县| 屏山县| 富源县| 平利县| 鹤岗市| 孟连| 汉中市| 高唐县| 达州市| 新乡市| 思南县| 黎川县| 响水县| 广德县| 新民市| 罗定市| 郎溪县| 靖江市| 浦北县|