Audio Scene Classifier

The main goal of this project is present a performance comparison when using different features. We will use the same model based on a convolutional neural network and we will train it using different features as input. The impact on the performance depending on the features used will be surveyed.

Feature extraction

We will train different networks using the following features, and we will present a comparison.

Mel spectogram: Mel-scaled power spectogram
MFCC: Mel-Frequency Cepstral Coefficients
Chroma STFT: Chromagram from a waveform or power spectrogram
Chroma CQT: Constant-Q chromagram
Spectral Contrast: Spectral contrast

All the proposed are spectral features based, to extract them we will use the python library librosa.

[Todo] Detail each transformation.

Convolutional Neural Network

Our CNN model is based in this reference, and it is as follows

For each stereo audio signal, we will pre-process independently left and right channels extracting the features. Both features will be the input for two independent CNN, as described in the previous figure, and after that we will concatenate to estimate the category through a softmax regression.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
docs/images		docs/images
etc		etc
src		src
README.md		README.md
SGN_26006___Supervised_Audio_Classification.pdf		SGN_26006___Supervised_Audio_Classification.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs/images

docs/images

etc

etc

src

src

README.md

README.md

SGN_26006___Supervised_Audio_Classification.pdf

SGN_26006___Supervised_Audio_Classification.pdf

Repository files navigation

Audio Scene Classifier

Feature extraction

Convolutional Neural Network

About

Releases

Packages

Languages

bbruhh/AudioSceneClassifation_TUT

Folders and files

Latest commit

History

Repository files navigation

Audio Scene Classifier

Feature extraction

Convolutional Neural Network

About

Resources

Stars

Watchers

Forks

Languages