The purpose of this notebook is to teach myself audio processing from scratch. This notebooks are resources summarizing and showing example of processing techniques, data augmentation and modeling techniques for audio. Those notebooks are highly based on the references cited below and the toy example is a PyTorch implementation of Sath Adam's series of videos.
- Audio Processing Techniques: Review and Summary.
- Toy example: Instrument classification.
- Data Visualization (fft, bank filters, mfcc).
- Pre-processing.
- CNN-modeling.
- RNN-modeling.
Audio Preprocessing
-
https://haythamfayek.com/2016/04/21/speech-processing-for-machine-learning.html
-
https://www.youtube.com/watch?v=Z7YM-HAz-IY&list=PLhA3b2k8R3t2Ng1WW_7MiXeh1pfQJQi_P&index=1
Data Augmentation
Further reading