- Machine Learning in Java
- AshishSingh Bhatia Bostjan Kaluza
- 217字
- 2021-06-10 19:29:59
Clustering
Clustering is a technique for grouping similar instances into clusters according to some distance measures. The main idea is to put instances that are similar (that is, close to each other) into the same cluster, while keeping the dissimilar points (that is, the ones further apart from each other) in different clusters. An example of how clusters might look like is shown in the following diagram:
The clustering algorithms follow two fundamentally different approaches. The first is a hierarchical or agglomerative approach that first considers each point as its own cluster, and then iteratively merges the most similar clusters together. It stops when further merging reaches a predefined number of clusters, or if the clusters to be merged are spread over a large region.
The other approach is based on point assignment. First, initial cluster centers (that is, centroids) are estimated, for instance, randomly, and then, each point is assigned to the closest cluster, until all of the points are assigned. The most well known algorithm in this group is k-means clustering.
The k-means clustering either picks initial cluster centers as points that are as far as possible from one another, or (hierarchically) clusters a sample of data and picks a point that is the closest to the center of each of the k-clusters.
- 大數(shù)據(jù)管理系統(tǒng)
- 嵌入式系統(tǒng)應(yīng)用
- 面向STEM的mBlock智能機(jī)器人創(chuàng)新課程
- Practical Ansible 2
- 樂(lè)高創(chuàng)意機(jī)器人教程(中級(jí) 下冊(cè) 10~16歲) (青少年iCAN+創(chuàng)新創(chuàng)意實(shí)踐指導(dǎo)叢書(shū))
- B2B2C網(wǎng)上商城開(kāi)發(fā)指南
- 新手學(xué)電腦快速入門(mén)
- Machine Learning with Apache Spark Quick Start Guide
- 網(wǎng)站入侵與腳本攻防修煉
- 精通數(shù)據(jù)科學(xué):從線性回歸到深度學(xué)習(xí)
- 大數(shù)據(jù)技術(shù)基礎(chǔ):基于Hadoop與Spark
- 從零開(kāi)始學(xué)SQL Server
- TensorFlow Deep Learning Projects
- 網(wǎng)頁(yè)設(shè)計(jì)與制作
- Learning OpenShift