官术网_书友最值得收藏!

Clustering

Typically, when people talk about unsupervised learning, they talk about cluster analysis or clustering. A cluster analysis algorithm takes a set of data points and tries to categorize them into groups such that similar items belong to the same group, and different items do not. There are many ways where it can be used, for example, in customer segmentation or text categorization.

Customer segmentation is an example of clustering. Given some description of customers, we try to put them into groups such that the customers in one group have similar profiles and behave in a similar way. This information can be used to understand what do the people in these groups want, and this can be used to target them with better advertisements and other promotional messages.

Another example is text categorization. Given a collection of texts, we would like to find common topics among these texts and arrange the texts according to these topics. For example, given a set of complaints in an e-commerce store, we may want to put ones that talk about similar things together, and this should help the users of the system navigate through the complaints easier.

Examples of cluster analysis algorithms are hierarchical clustering, k-means, density-based spatial clustering of applications with noise (DBSCAN), and many others. We will talk about clustering in detail in the first part of Chapter 5, Unsupervised Learning - Clustering and Dimensionality Reduction.

主站蜘蛛池模板: 莱州市| 韶山市| 莱西市| 上思县| 新竹市| 嘉荫县| 三河市| 平罗县| 类乌齐县| 阳泉市| 甘谷县| 嵩明县| 德庆县| 繁昌县| 广安市| 横峰县| 乌什县| 浠水县| 汕头市| 嫩江县| 师宗县| 土默特左旗| 司法| 泸州市| 西吉县| 乌兰察布市| 琼海市| 马鞍山市| 阜康市| 布尔津县| 建阳市| 佛冈县| 高州市| 辽阳县| 崇明县| 莒南县| 湘潭县| 厦门市| 长治县| 墨脱县| 桃江县|