官术网_书友最值得收藏!

Introduction

In the previous chapter, we saw how to build plots using the built-in function of pandas, and learned how to estimate the mean, median, and other descriptive statistics about specific consumer or product groups.

In this chapter, we will learn about clustering, a form of unsupervised learning technique, and then begin a discussion of how to calculate the similarity between two data points. Next, we will discuss how to standardize data so that multiple data features can be used without one overwhelming the others. We will also go through how similarity can be calculated by computing the distance between data points. Finally, we will discuss k-means clustering, how to perform it, and how to explore the resulting groups.

主站蜘蛛池模板: 江源县| 泸水县| 泸水县| 隆尧县| 会同县| 台东县| 文登市| 万载县| 德钦县| 汤阴县| 穆棱市| 万荣县| 军事| 万安县| 宜城市| 邢台市| 江川县| 苗栗县| 玉溪市| 瓦房店市| 临潭县| 郯城县| 宣汉县| 赤城县| 永福县| 吉木萨尔县| 云林县| 合水县| 兰西县| 德兴市| 石狮市| 锦州市| 溧水县| 土默特左旗| 修水县| 介休市| 曲松县| 比如县| 阿城市| 论坛| 内黄县|