官术网_书友最值得收藏!

Summary

The chapter began with an introduction to some of the most important datasets that will be used in the rest of the book. The datasets covered a range of analytical problems including classification, regression, time series, survival, clustering, and a dataset in which identifying an outlier is important. Important families of classification models were then introduced in the statistical/machine learning models section. Following the introduction of a variety of models, we immediately saw the shortcoming, in that we don't have a model for all seasons. Model performance varies from dataset to dataset. Depending on the initialization, the performance of certain models (such as neural networks) is affected. Consequently, there is a need to find a way to ensure that the models can be improved upon in most scenarios.

This paves the way for the ensemble method, which forms the title of this book. We will elaborate on this method in the rest of the book. This chapter closed with quick statistical tests that will help in carrying out model comparisons. Resampling forms the core of ensemble methods, and we will look at the important jackknife and bootstrap methods in the next chapter.

主站蜘蛛池模板: 秦皇岛市| 阿克苏市| 无锡市| 冕宁县| 漯河市| 方城县| 象山县| 华蓥市| 建宁县| 琼海市| 绍兴市| 泾阳县| 榕江县| 武清区| 尚志市| 桐城市| 通河县| 永康市| 突泉县| 南江县| 建始县| 海丰县| 论坛| 图木舒克市| 涿鹿县| 郁南县| 东乌珠穆沁旗| 正镶白旗| 云阳县| 松阳县| 澄江县| 阿拉善左旗| 阿克苏市| 尚志市| 乌海市| 云和县| 乐陵市| 德清县| 阳原县| 天长市| 迁西县|