官术网_书友最值得收藏!

Dimensionality reduction

Dimensionality reduction is used to reduce the dimensionality of a dataset. It is really helpful in cases where the problem becomes intractable, when the number of variables increases. By using the term dimensionality, we are referring to the features. One of the basic reduction techniques is feature engineering.

Generally, we have many dimensionality reduction algorithms:

  • Low variance filter: Dropping variables that have low variance, compared to others.
  • High correlation filter: This identifies the variables with high correlation, by using pearson or polychoric, and selects one of them using the Variance Inflation Factor (VIF).
  • Backward feature elimination: This is done by computing the sum of square of error (SSE) after eliminating each variable n times.
  • Linear Discriminant Analysis (LDA): This reduces the number of dimensions, n, from the original to the number of classes?—?1 number of features.
  • Principal Component Analysis (PCA): This is a statistical procedure that transforms variables into a new set of variables (principle components).
主站蜘蛛池模板: 明溪县| 宁安市| 阜城县| 赣州市| 乐陵市| 武汉市| 临海市| 抚顺县| 乐亭县| 林周县| 沈丘县| 永仁县| 涟水县| 宾川县| 广平县| 绍兴市| 普兰县| 翼城县| 寿宁县| 彭阳县| 定南县| 龙南县| 建水县| 于田县| 伽师县| 千阳县| 比如县| 缙云县| 南康市| 上饶市| 三亚市| 瓦房店市| 宜君县| 衡山县| 剑阁县| 大荔县| 桑日县| 陆良县| 青浦区| 北票市| 扬州市|