官术网_书友最值得收藏!

  • Statistics for Data Science
  • James D. Miller
  • 299字
  • 2021-07-02 14:58:55

Boosting

In a manner of speaking, boosting is a process generally accepted in data science for improving the accuracy of a weak learning data science process.

Data science processes defined as weak learners are those that produce results that are only slightly better than if you would randomly guess the outcome. Weak learners are basically thresholds or a 1-level decision tree.

Specifically, boosting is aimed at reducing bias and variance in supervised learning.

What do we mean by bias and variance? Before going on further about boosting, let's take note of what we mean by bias and variance.

Data scientists describe bias as a level of favoritism that is present in the data collection process, resulting in uneven, disingenuous results and can occur in a variety of different ways. A sampling method is called biased if it systematically favors some outcomes over others.

A variance may be defined (by a data scientist) simply as the distance from a variable mean (or how far from the average a result is).

The boosting method can be described as a data scientist repeatedly running through a data science process (that has been identified as a weak learning process), with each iteration running on different and random examples of data sampled from the original population recordset. All the results (or classifiers or residue) produced by each run are then combined into a single merged result (that is a gradient).

This concept of using a random subset of the original recordset for each iteration originates from bootstrap sampling in bagging and has a similar variance-reducing effect on the combined model.

In addition, some data scientists consider boosting a means to convert weak learners into strong ones; in fact, to some, the process of boosting simply means turning a weak learner into a strong learner.

主站蜘蛛池模板: 玉门市| 涿州市| 平舆县| 黔东| 泸西县| 扶沟县| 永济市| 璧山县| 和龙市| 宜良县| 永丰县| 中方县| 靖边县| 涟水县| 佛山市| 屯昌县| 定州市| 乳山市| 伊吾县| 荆门市| 绵竹市| 措勤县| 新疆| 永兴县| 长宁县| 中超| 潜江市| 陆河县| 咸宁市| 新民市| 台州市| 南陵县| 大港区| 乐东| 独山县| 六安市| 喀喇沁旗| 扎赉特旗| 华坪县| 崇左市| 射阳县|