官术网_书友最值得收藏!

Understanding the k-NN algorithm

The k-NN algorithm is arguably one of the simplest machine learning algorithms. The reason for this is that we basically only need to store the training dataset. Then, in order to make a prediction for a new data point, we only need to find the closest data point in the training dataset-its nearest neighbor.

In a nutshell, the k-NN algorithm argues that a data point probably belongs to the same class as its neighbors. Think about it: if our neighbor is a Reds fan, we're probably Reds fans, too; otherwise we would have moved away a long time ago. The same can be said for the Blues.

Of course, some neighborhoods might be a little more complicated. In this case, we would not just consider our closest neighbor (where k=1), but instead our k nearest neighbors. To stick with our example as mentioned earlier, if we were Reds fans, we probably wouldn't move into a neighborhood where the majority of people are Blues fans.

That's all there is to it.

主站蜘蛛池模板: 习水县| 通许县| 嵊州市| 靖远县| 彭泽县| 灯塔市| 来宾市| 金沙县| 蕉岭县| 台东市| 上高县| 阳泉市| 甘洛县| 新竹县| 泗水县| 乳源| 南涧| 叙永县| 高碑店市| 高雄县| 万年县| 马尔康县| 阿拉尔市| 公安县| 洪泽县| 丹东市| 怀安县| 石楼县| 射阳县| 若羌县| 神农架林区| 蓬安县| 济源市| 海口市| 潢川县| 荆门市| 浑源县| 乐安县| 浮梁县| 玉屏| 五莲县|