官术网_书友最值得收藏!

The curse of dimensionality

The curse of dimensionality refers to a situation where we have a large number of features, often hundreds or thousands, which lead to an extremely large space with sparse data and, consequently, to distance anomalies. For instance, in high dimensions, almost all pairs of points are equally distant from each other; in fact, almost all of the pairs have distance close to the average distance. Another manifestation of the curse is that any two vectors are almost orthogonal, which means all of the angles are close to 90 degrees. This practically makes any distance measurement useless.

A cure for the curse of dimensionality might be found in one of the data reduction techniques, where we want to reduce the number of features; for instance, we can run a feature selection algorithm, such as ReliefF, or a feature extraction or reduction algorithm, such as PCA.

主站蜘蛛池模板: 周口市| 隆安县| 宜川县| 阿图什市| 建瓯市| 康保县| 湘阴县| 绍兴县| 新竹市| 卫辉市| 汉沽区| 南部县| 肇州县| 尼勒克县| 彭水| 九江市| 云和县| 安新县| 孙吴县| 香格里拉县| 天津市| 德江县| 五河县| 攀枝花市| 高碑店市| 绥中县| 阿瓦提县| 临夏市| 浦城县| 浑源县| 华蓥市| 都昌县| 吉木萨尔县| 织金县| 五指山市| 莱西市| 四川省| 苏尼特右旗| 清原| 聊城市| 景洪市|