Skip to content

Wologman/Anqa

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

Anqa

Anqa is a data standard for time-frequency annotated wildlife sound files. An example can be found here

The goal for this project is to encourage regional institutions to produce and share strongly labelled regional datasets, to a common standard, enabling better regional models.

Principles

  • Tabular annotation format
  • Metadata first - One row per audio file, including lat, long in web-mercator coordinates and a date-time stamp in ISO 8601. The metadata should follow any files subsequently derived from the source files.
  • Labels are to be in a separate file, with one row per label, with a many to one relationship with the metadata file, matching by relative file name.
  • e-bird labels for birds, defaulting to inaturalist codes where no e-bird label is available
  • Every animal sound must get a time-frequency box. Where the species can not be identified, fall back to a higher taxonomic order. For example insects should use 47158.
  • A naming schema matching the above codes to what ever local scheme is to be used, plus the scientific name
  • An 'unknown' label for any wildlife sound that can not be identified.
  • Modularity - It should be possible to merge any two datasets programatically, whilst keeping the above properties
  • Open Source CC-BY licence, where no licence already exists for a given row-item in the metadata

Whilst open-sourcing the training data, regional institutions should also be encouraged to make careful use of the metadata to create and hold back independent test sets for model calibration.

By creating models that also predict time-frequency boxes in the same format, we enable model calibration, scaling and continuous improvement of the datasets through human-in-loop reviewing.

Motivation

This work has come out of development of the Kaytoo model for the Department of Conservation (New Zealand). The source data for that project came in multiple formats, some could not be used at all, whilst the rest were initially converted to the format used by Xeno-Canto and BirdCLEF. That format labels presence/absence of species for arbitrary length sound crops. The model predictions for BirdCLEF models are likewise multi-label presence-absence for an arbitrary time length (5 seconds)

The Xeno-Canto/BirdCLEF format has a number of shortcomings:

  • Working with model predictions is hard to visualise and inconvenient for deriving meaningful statistical insights for monitoring.

  • It was inefficient trying to deliver any continuous improvement to the training dataset by reviewing the short crops. It is much faster the reviewer to look at a fixed length (1-minute) soundscape with a suitable visualisation tool, and confirm or edit multiple annotations. If short crops are needed for model training they can easily be extracted programatically.

  • The training data is incomparable to model predictions, making it hard to perform any meaningful form of model calibration from data in it's short-crop form.

  • It is hard to build strong models from training on the inherently weak-labelling in the Xeno-Canto data. A large proportion of this data contains false negatives, whilst the training routine has no way to ensure sub-sampling contains sound the expected classes, leading to false positives during training.

A better value-proposition is for regional institutions to use their own experts to create strongly labelled datasets. Then for the rest of time a world-leading model is just a code-fork away.

Proposed Columns

This is still work-in progress, but for now the columns in use are derived from those used in the Raven .selections.txt tables, as well as the BirdCLEF metadata files. Additional columns have been added, for example to identify what reviewing (if any) has taken place, or if detection models were used to assist the labelling.

About

Standard for Wildlife Audio Data Annotation

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors