官术网_书友最值得收藏!

Summary

ML consists of constructing models that are able to convert data into knowledge that can be used to make decisions, some of which are based on complicated mathematical concepts to understand data. Scikit-learn is an open source Python library that is meant to facilitate the process of applying these models to data problems, without much complex math knowledge required.

This chapter explained the key steps of preprocessing your input data, from separating the features from the target, to dealing with messy data and rescaling the values of the data. All these steps should be performed before ping into training a model as they help to improve the training times, as well as the performance of the models.

Next, the different components of the scikit-learn API were explained: the estimator, the predictor, and the transformer. Finally, this chapter covered the difference between supervised and unsupervised learning, and the most popular algorithms of each type of learning were introduced.

With all of this in mind, in the next chapter, we will focus on detailing the process of implementing an unsupervised algorithm for a real-life dataset.

主站蜘蛛池模板: 卢龙县| 施甸县| 西畴县| 通河县| 汽车| 荆门市| 乐亭县| 固镇县| 西贡区| 高唐县| 宁蒗| 京山县| 彭水| 云和县| 凤阳县| 武宣县| 鄂托克前旗| 绵阳市| 紫阳县| 丽水市| 丹东市| 顺昌县| 广州市| 勐海县| 洛浦县| 囊谦县| 岐山县| 阿拉尔市| 锡林郭勒盟| 大港区| 思南县| 上思县| 商水县| 孙吴县| 山阳县| 清新县| 资溪县| 沛县| 军事| 新和县| 铜山县|