- Neural Networks with Keras Cookbook
- V Kishore Ayyadevara
- 169字
- 2021-07-02 12:46:33
Classifying common audio
In the previous sections, we have understood the strategy to perform modeling on a structured dataset and also on unstructured text data.
In this section, we will be learning about performing a classification exercise where the input is raw audio.
The strategy we will be adopting is that we will be extracting features from the input audio, where each audio signal is represented as a vector of a fixed number of features.
There are multiple ways of extracting features from an audio—however, for this exercise, we will be extracting the Mel Frequency Cepstral Coefficients (MFCC) corresponding to the audio file.
Once we extract the features, we shall perform the classification exercise in a way that is very similar to how we built a model for MNIST dataset classification—where we had hidden layers connecting the input and output layers.
In the following section, we will be performing classification on top of an audio dataset where there are ten possible classes of output.
- Monkey Game Development:Beginner's Guide
- C# Programming Cookbook
- Windows Forensics Cookbook
- Java Web程序設計任務教程
- 西門子S7-200 SMART PLC編程從入門到實踐
- Service Mesh實戰:基于Linkerd和Kubernetes的微服務實踐
- Raspberry Pi Robotic Projects(Third Edition)
- C陷阱與缺陷
- Python一行流:像專家一樣寫代碼
- 奔跑吧 Linux內核
- Software Development on the SAP HANA Platform
- Developer,Advocate!
- 大話C語言
- VBA Automation for Excel 2019 Cookbook
- 代碼整潔之道:程序員的職業素養