官术网_书友最值得收藏!

Classifying common audio

In the previous sections, we have understood the strategy to perform modeling on a structured dataset and also on unstructured text data.

In this section, we will be learning about performing a classification exercise where the input is raw audio.

The strategy we will be adopting is that we will be extracting features from the input audio, where each audio signal is represented as a vector of a fixed number of features.

There are multiple ways of extracting features from an audio—however, for this exercise, we will be extracting the Mel Frequency Cepstral Coefficients (MFCC) corresponding to the audio file.

Once we extract the features, we shall perform the classification exercise in a way that is very similar to how we built a model for MNIST dataset classification—where we had hidden layers connecting the input and output layers.

In the following section, we will be performing classification on top of an audio dataset where there are ten possible classes of output.

主站蜘蛛池模板: 遂川县| 徐汇区| 明光市| 葵青区| 牙克石市| 黄山市| 汽车| 电白县| 绥阳县| 大足县| 石狮市| 颍上县| 申扎县| 镇赉县| 梧州市| 石景山区| 蕲春县| 东城区| 龙口市| 改则县| 鸡西市| 明光市| 上饶市| 襄汾县| 长丰县| 滁州市| 通化县| 清丰县| 平舆县| 乐陵市| 许昌市| 高淳县| 辽中县| 临泉县| 汉沽区| 措勤县| 孟州市| 安平县| 临朐县| 苏尼特左旗| 旬阳县|