- Neural Networks with Keras Cookbook
- V Kishore Ayyadevara
- 169字
- 2021-07-02 12:46:33
Classifying common audio
In the previous sections, we have understood the strategy to perform modeling on a structured dataset and also on unstructured text data.
In this section, we will be learning about performing a classification exercise where the input is raw audio.
The strategy we will be adopting is that we will be extracting features from the input audio, where each audio signal is represented as a vector of a fixed number of features.
There are multiple ways of extracting features from an audio—however, for this exercise, we will be extracting the Mel Frequency Cepstral Coefficients (MFCC) corresponding to the audio file.
Once we extract the features, we shall perform the classification exercise in a way that is very similar to how we built a model for MNIST dataset classification—where we had hidden layers connecting the input and output layers.
In the following section, we will be performing classification on top of an audio dataset where there are ten possible classes of output.
- 深度實(shí)踐OpenStack:基于Python的OpenStack組件開發(fā)
- Learning Real-time Processing with Spark Streaming
- C#編程入門指南(上下冊)
- 區(qū)塊鏈架構(gòu)與實(shí)現(xiàn):Cosmos詳解
- Instant Zepto.js
- C/C++常用算法手冊(第3版)
- 實(shí)戰(zhàn)低代碼
- R的極客理想:工具篇
- Reactive Programming With Java 9
- 匯編語言程序設(shè)計(jì)(第3版)
- Microsoft System Center Orchestrator 2012 R2 Essentials
- C語言程序設(shè)計(jì)
- Go語言精進(jìn)之路:從新手到高手的編程思想、方法和技巧(2)
- C編程技巧:117個問題解決方案示例
- Practical Maya Programming with Python