官术网_书友最值得收藏!

Classifying common audio

In the previous sections, we have understood the strategy to perform modeling on a structured dataset and also on unstructured text data.

In this section, we will be learning about performing a classification exercise where the input is raw audio.

The strategy we will be adopting is that we will be extracting features from the input audio, where each audio signal is represented as a vector of a fixed number of features.

There are multiple ways of extracting features from an audio—however, for this exercise, we will be extracting the Mel Frequency Cepstral Coefficients (MFCC) corresponding to the audio file.

Once we extract the features, we shall perform the classification exercise in a way that is very similar to how we built a model for MNIST dataset classification—where we had hidden layers connecting the input and output layers.

In the following section, we will be performing classification on top of an audio dataset where there are ten possible classes of output.

主站蜘蛛池模板: 岳池县| 若尔盖县| 黑龙江省| 福州市| 舒兰市| 姜堰市| 宁城县| 东至县| 南澳县| 安塞县| 读书| 乌拉特后旗| 石台县| 德州市| 长垣县| 天长市| 凤山市| 滦平县| 武城县| 南木林县| 江川县| 墨竹工卡县| 白朗县| 九龙坡区| 景洪市| 肥东县| 若尔盖县| 石屏县| 库尔勒市| 洪江市| 五家渠市| 昌平区| 温州市| 合川市| 固始县| 永靖县| 泸西县| 久治县| 娱乐| 伊吾县| 株洲市|