Skip to content

dilne/Rainforest-Connection-Species-Audio-Detection

Repository files navigation

🦜Rainforest Connection Species Audio Detection🐸

Notebook

Open In Colab

Summary

A modified ResNet18 model is used to predict the species from 24 types. After trimming the audio samples from the dataset to the correct length to match the species' call and implementing a band pass filter to remove the frequencies outiside the range of the call, the highest validation accuracy achieved was 0.72, but there are certainly ways to improve this model, including introducing other data preprocessing techniques.

The dataset can be found from the official Kaggle page.

Future Steps for Preprocessing the Dataset

  • Pitch shift: Pitch shifting all the audio to have an equal centre frequency could improve generalisation
  • Set dB level: Making the data the same dB level (maybe having equal RMS?), could encourage the model to infer mainly from frequency data

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks