官术网_书友最值得收藏!

Summary

In this chapter, we used several of scikit-learn's methods for building a standard workflow to run and evaluate data mining models. We introduced the Nearest Neighbors algorithm, which is already implemented in scikit-learn as an estimator. Using this class is quite easy; first, we call the fit function on our training data, and second, we use the predict function to predict the class of testing samples.

We then looked at preprocessing by fixing poor feature scaling. This was done using a Transformer object and the MinMaxScaler class. These functions also have a fit method and then a transform, which takes a dataset as an input and returns a transformed dataset as an output.

In the next chapter, we will use these concepts in a larger example, predicting the outcome of sports matches using real-world data.

主站蜘蛛池模板: 东乌珠穆沁旗| 高青县| 通渭县| 池州市| 阳泉市| 枣强县| 页游| 金乡县| 旅游| 大庆市| 平远县| 宾阳县| 上栗县| 城步| 留坝县| 桃江县| 方山县| 南投市| 芦溪县| 老河口市| 吉木萨尔县| 顺昌县| 新蔡县| 仁寿县| 措美县| 虹口区| 平塘县| 施甸县| 宜城市| 晋州市| 庐江县| 巴青县| 罗甸县| 德江县| 余庆县| 全州县| 息烽县| 汶川县| 开化县| 屏山县| 延庆县|