- Statistics for Data Science
- James D. Miller
- 299字
- 2021-07-02 14:58:55
Boosting
In a manner of speaking, boosting is a process generally accepted in data science for improving the accuracy of a weak learning data science process.
Specifically, boosting is aimed at reducing bias and variance in supervised learning.
What do we mean by bias and variance? Before going on further about boosting, let's take note of what we mean by bias and variance.
Data scientists describe bias as a level of favoritism that is present in the data collection process, resulting in uneven, disingenuous results and can occur in a variety of different ways. A sampling method is called biased if it systematically favors some outcomes over others.
A variance may be defined (by a data scientist) simply as the distance from a variable mean (or how far from the average a result is).
The boosting method can be described as a data scientist repeatedly running through a data science process (that has been identified as a weak learning process), with each iteration running on different and random examples of data sampled from the original population recordset. All the results (or classifiers or residue) produced by each run are then combined into a single merged result (that is a gradient).
This concept of using a random subset of the original recordset for each iteration originates from bootstrap sampling in bagging and has a similar variance-reducing effect on the combined model.
In addition, some data scientists consider boosting a means to convert weak learners into strong ones; in fact, to some, the process of boosting simply means turning a weak learner into a strong learner.
- 零起步輕松學(xué)單片機(jī)技術(shù)(第2版)
- 虛擬儀器設(shè)計(jì)測(cè)控應(yīng)用典型實(shí)例
- 會(huì)聲會(huì)影X5視頻剪輯高手速成
- Mobile DevOps
- 大數(shù)據(jù)改變世界
- 自動(dòng)檢測(cè)與傳感技術(shù)
- 傳感器技術(shù)應(yīng)用
- 自動(dòng)生產(chǎn)線的拆裝與調(diào)試
- Ruby on Rails敏捷開發(fā)最佳實(shí)踐
- Excel 2007技巧大全
- Mastering Game Development with Unreal Engine 4(Second Edition)
- 軟件工程及實(shí)踐
- Mastering OpenStack(Second Edition)
- 計(jì)算機(jī)硬件技術(shù)基礎(chǔ)(第2版)
- 計(jì)算機(jī)仿真技術(shù)