- MATLAB for Machine Learning
- Giuseppe Ciaburro
- 284字
- 2021-07-02 19:37:36
Dimensionality reduction
Dimensionality reduction is the process of converting a set of data with many variables into data with lesser dimensions but ensuring similar information. It can help improve model accuracy and performance, improve interpretability, and prevent overfitting. The Statistics and Machine Learning Toolbox includes many algorithms and functions for reducing the dimensionality of our datasets. It can be divided into feature selection and feature extraction. Feature selection approaches try to find a subset of the original variables. Feature extraction reduces the dimensionality in the data by transforming data into new features.
As already mentioned, feature selection finds only the subset of measured features (predictor variables) that give the best predictive performance in modeling the data. The Statistics and Machine Learning Toolbox includes many feature selection methods, as follows:
- Stepwise regression: Adds or removes features until there is no improvement in prediction accuracy. Especially suited for linear regression or generalized linear regression algorithms.
- Sequential feature selection: Equivalent to stepwise regression, this can be applied with any supervised learning algorithm.
- Selecting features for classifying high-dimensional data.
- Boosted and bagged decision trees: Calculate the variable's importance from out-of-bag errors.
- Regularization: Remove redundant features by reducing their weights to zero.
Otherwise, feature extraction transforms existing features into new features (predictor variables) where less-descriptive features can be ignored.
The Statistics and Machine Learning Toolbox includes many feature extraction methods, as follows:
- PCA: This can be applied to summarize data in fewer dimensions by projection onto a unique orthogonal basis
- Non-negative matrix factorization: This can be applied when model terms must represent non-negative quantities
- Factor analysis: This can be applied to build explanatory models of data correlations
The following are step-wise regression example charts:

- Python程序設計教程(第2版)
- Learning ROS for Robotics Programming(Second Edition)
- The Android Game Developer's Handbook
- Java編程指南:基礎知識、類庫應用及案例設計
- OpenNI Cookbook
- CouchDB and PHP Web Development Beginner’s Guide
- Mastering Apache Maven 3
- Symfony2 Essentials
- 學習正則表達式
- C編程技巧:117個問題解決方案示例
- Learning Dynamics NAV Patterns
- PHP典型模塊與項目實戰大全
- Node.js Web Development
- HTML5+jQuery Mobile移動應用開發
- Java Web開發任務教程