官术网_书友最值得收藏!

Summary

In this chapter, we used several of scikit-learn's methods for building a standard workflow to run and evaluate data mining models. We introduced the Nearest Neighbors algorithm, which is already implemented in scikit-learn as an estimator. Using this class is quite easy; first, we call the fit function on our training data, and second, we use the predict function to predict the class of testing samples.

We then looked at preprocessing by fixing poor feature scaling. This was done using a Transformer object and the MinMaxScaler class. These functions also have a fit method and then a transform, which takes a dataset as an input and returns a transformed dataset as an output.

In the next chapter, we will use these concepts in a larger example, predicting the outcome of sports matches using real-world data.

主站蜘蛛池模板: 民和| 福鼎市| 马尔康县| 延庆县| 石棉县| 井陉县| 晋城| 中阳县| 阿拉善左旗| 武功县| 时尚| 盖州市| 延寿县| 公安县| 景泰县| 芜湖县| 廉江市| 思南县| 丽江市| 大厂| 共和县| 泸溪县| 库伦旗| 无为县| 长白| 高青县| 剑川县| 老河口市| 安陆市| 洛阳市| 大庆市| 泸州市| 闽清县| 沙湾县| 孝昌县| 铜梁县| 泽州县| 东海县| 昆山市| 遂宁市| 海原县|