官术网_书友最值得收藏!

Summary

In a sense, this was a very theoretical chapter, as we introduced generic concepts with simple examples. We went over a few operations with a classic dataset. This, by now, is considered a very small problem. However, it has the advantage that we were able to plot it out and see what we were doing in detail. This is something that will be lost when we move on to problems with many dimensions and many thousands of examples. The intuitions we gained here will all still be valid.

Classification means generalizing from examples to build a model (that is, a rule that can automatically be applied to new, unclassified objects). It is one of the fundamental tools in machine learning, and we will see many more examples of this in forthcoming chapters.

We also learned that the training error is a misleading, over-optimistic estimate of how well the model does. We must, instead, evaluate it on testing data that was not used for training. In order to not waste too many examples in testing, a cross-validation schedule can get us the best of both worlds (at the cost of more computation).

We also had a look at the problem of feature engineering. Features are not something that is predefined for you, but choosing and designing features is an integral part of designing a machine-learning pipeline. In fact, it is often the area where you can get the most improvements in accuracy as better data beats fancier methods. The chapters on computer vision and text-based classification will provide examples for these specific settings.

In this chapter, we wrote all of our own code (except when we used NumPy, of course). This will not be the case for the next few chapters, but we needed to build up intuitions on simple cases to illustrate the basic concepts.

The next chapter looks at how to proceed when your data does not have predefined classes for classification.

主站蜘蛛池模板: 山东| 济宁市| 昌吉市| 临湘市| 忻城县| 广元市| 南涧| 新河县| 浙江省| 沧州市| 同仁县| 普安县| 茌平县| 商水县| 米林县| 镇雄县| 阳信县| 肃南| 射洪县| 浙江省| 兴文县| 望都县| 富裕县| 张家界市| 扎兰屯市| 高清| 山东省| 清河县| 南漳县| 保靖县| 黑山县| 罗平县| 淮安市| 云霄县| 仪陇县| 湘潭市| 贵阳市| 大城县| 元江| 小金县| 赤水市|