- Neural Networks with Keras Cookbook
- V Kishore Ayyadevara
- 169字
- 2021-07-02 12:46:33
Classifying common audio
In the previous sections, we have understood the strategy to perform modeling on a structured dataset and also on unstructured text data.
In this section, we will be learning about performing a classification exercise where the input is raw audio.
The strategy we will be adopting is that we will be extracting features from the input audio, where each audio signal is represented as a vector of a fixed number of features.
There are multiple ways of extracting features from an audio—however, for this exercise, we will be extracting the Mel Frequency Cepstral Coefficients (MFCC) corresponding to the audio file.
Once we extract the features, we shall perform the classification exercise in a way that is very similar to how we built a model for MNIST dataset classification—where we had hidden layers connecting the input and output layers.
In the following section, we will be performing classification on top of an audio dataset where there are ten possible classes of output.
- Boost.Asio C++ Network Programming(Second Edition)
- Visual C++程序設計教程
- 數據庫系統教程(第2版)
- 自己動手實現Lua:虛擬機、編譯器和標準庫
- Ext JS Data-driven Application Design
- 數據庫系統原理及MySQL應用教程
- Mastering LibGDX Game Development
- PostgreSQL 11從入門到精通(視頻教學版)
- Troubleshooting PostgreSQL
- Arduino家居安全系統構建實戰
- 蘋果的產品設計之道:創建優秀產品、服務和用戶體驗的七個原則
- JavaScript動態網頁編程
- SQL Server 2016 從入門到實戰(視頻教學版)
- C編程技巧:117個問題解決方案示例
- Web編程基礎:HTML5、CSS3、JavaScript(第2版)