官术网_书友最值得收藏!

Chapter 1. Unsupervised Machine Learning

In this chapter, you will learn how to apply unsupervised learning techniques to identify patterns and structure within datasets.

Unsupervised learning techniques are a valuable set of tools for exploratory analysis. They bring out patterns and structure within datasets, which yield information that may be informative in itself or serve as a guide to further analysis. It's critical to have a solid set of unsupervised learning tools that you can apply to help break up unfamiliar or complex datasets into actionable information.

We'll begin by reviewing Principal Component Analysis (PCA), a fundamental data manipulation technique with a range of dimensionality reduction applications. Next, we will discuss k-means clustering, a widely-used and approachable unsupervised learning technique. Then, we will discuss Kohenen's Self-Organizing Map (SOM), a method of topological clustering that enables the projection of complex datasets into two dimensions.

Throughout the chapter, we will spend some time discussing how to effectively apply these techniques to make high-dimensional datasets readily accessible. We will use the UCI Handwritten Digits dataset to demonstrate technical applications of each algorithm. In the course of discussing and applying each technique, we will review practical applications and methodological questions, particularly regarding how to calibrate and validate each technique as well as which performance measures are valid. To recap, then, we will be covering the following topics in order:

  • Principal component analysis
  • k-means clustering
  • Self-organizing maps
主站蜘蛛池模板: 广安市| 大足县| 苍梧县| 怀化市| 睢宁县| 磐石市| 中卫市| 高尔夫| 巩义市| 垦利县| 常州市| 简阳市| 仪征市| 和顺县| 都安| 兴和县| 龙游县| 德安县| 赣州市| 临湘市| 金寨县| 奉贤区| 阿荣旗| 舞钢市| 海原县| 潜山县| 东至县| 兴仁县| 土默特右旗| 油尖旺区| 德格县| 思茅市| 日喀则市| 吴桥县| 宁乡县| 上思县| 迁安市| 涪陵区| 仲巴县| 峡江县| 东宁县|