官术网_书友最值得收藏!

Classifying common audio

In the previous sections, we have understood the strategy to perform modeling on a structured dataset and also on unstructured text data.

In this section, we will be learning about performing a classification exercise where the input is raw audio.

The strategy we will be adopting is that we will be extracting features from the input audio, where each audio signal is represented as a vector of a fixed number of features.

There are multiple ways of extracting features from an audio—however, for this exercise, we will be extracting the Mel Frequency Cepstral Coefficients (MFCC) corresponding to the audio file.

Once we extract the features, we shall perform the classification exercise in a way that is very similar to how we built a model for MNIST dataset classification—where we had hidden layers connecting the input and output layers.

In the following section, we will be performing classification on top of an audio dataset where there are ten possible classes of output.

主站蜘蛛池模板: 潼南县| 汉川市| 花莲县| 朝阳市| 壶关县| 昌江| 玉树县| 枣阳市| 平和县| 齐齐哈尔市| 北碚区| 水富县| 肇东市| 湖北省| 安泽县| 甘洛县| 武平县| 师宗县| 开原市| 全椒县| 大竹县| 伊吾县| 邳州市| 祁阳县| SHOW| 锡林郭勒盟| 开平市| 家居| 龙胜| 顺义区| 怀柔区| 安图县| 铜陵市| 化州市| 抚州市| 漯河市| 宜春市| 阳高县| 赣州市| 辽阳市| 金坛市|