官术网_书友最值得收藏!

Classifying common audio

In the previous sections, we have understood the strategy to perform modeling on a structured dataset and also on unstructured text data.

In this section, we will be learning about performing a classification exercise where the input is raw audio.

The strategy we will be adopting is that we will be extracting features from the input audio, where each audio signal is represented as a vector of a fixed number of features.

There are multiple ways of extracting features from an audio—however, for this exercise, we will be extracting the Mel Frequency Cepstral Coefficients (MFCC) corresponding to the audio file.

Once we extract the features, we shall perform the classification exercise in a way that is very similar to how we built a model for MNIST dataset classification—where we had hidden layers connecting the input and output layers.

In the following section, we will be performing classification on top of an audio dataset where there are ten possible classes of output.

主站蜘蛛池模板: 涡阳县| 广元市| 龙陵县| 仙桃市| 万宁市| 威宁| 会同县| 南皮县| 雷州市| 永顺县| 平凉市| 柘荣县| 水富县| 德令哈市| 明溪县| 无锡市| 巫溪县| 渝北区| 章丘市| 嵩明县| 龙山县| 舒城县| 京山县| 牙克石市| 南陵县| 乾安县| 河间市| 闵行区| 大悟县| 七台河市| 穆棱市| 尚义县| 安化县| 清镇市| 桐乡市| 仲巴县| 宜阳县| 襄垣县| 梁山县| 化德县| 宜宾县|