Skip to content

Contrastive language-audio pretraining for bioacoustics

License

Notifications You must be signed in to change notification settings

david-rx/BioLingual

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BioLingual

Contrastive language-audio pretraining for bioacoustics

Paper: Transferable Models for Bioacoustics with Human Language Supervision arxiv

Use the model on HuggingFace! davidrrobinson/BioLingual

BioLingual is a language-audio model for bioacoustics, useful for zero-shot audio classification and sound detection, text-to-audio search, or for fine-tuning on new bioacoustic tasks.

AnimalSpeak is a large-scale language-audio dataset used to train BioLingual, created by captioning bioacoustic archives including Xeno-canto and iNaturalist.

BioLingual.png

To recreate the BEANS benchmarking results from the paper:

pip install -r requirements.txt
cd beans

Follow instructions in beans/README.MD to download the datasets

python run_benchmark.py

The AnimalSpeak dataset is released at https://huggingface.co/datasets/davidrrobinson/AnimalSpeak

Acknowledgements

We express our gratitude to the authors of CLAP and beans, which much of this repository is based on, for making their code open-source.

About

Contrastive language-audio pretraining for bioacoustics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published