Skip to content

Sound Identification and Labeling Intelligence for Creatures

License

Notifications You must be signed in to change notification settings

RedbirdTaiwan/silic

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SILIC

Sound Identification and Labeling Intelligence for Creatures

SILIC

Goal

The goal of SILIC is to build an autonomous wildlife sound identification system which can help to monitoring the population status and trends of terrestrial vocal animals in Taiwan by using the data of Passive Acoustic Monitorings (PAMs).

Objects

  • Object 1: Extract robust species, sound class, time and frequency information from various and complex soundscape recordings.
  • Object 2: Model can be constructed using a dataset as small as possible, and the training audios can be easily and quickly acquired.
  • Object 3: Most species of terrestrial vocal wildlife in Taiwan should be included in model, especially those are hard to be detected with survey methods other than PAM.

Model

SILIC uses Python language and yolov5 package (Glenn Jocher et al., 2020) to construct an object detection model. Additional pydub (Robert, 2011), nnAudio (Cheuk et al., 2020) and matplotlib (Hunter, 2007) libraries were imported for audio signal processing and Time–Frequency Representation (TFR).

Datasets

Install

Tutorials

  • Model Weights:
    • ./model/exp12 , including 27 sound classes of 16 species, updated on Apr. 2021
    • ./model/exp14 , including 74 sound classes of 52 species, updated on Jul. 2021
    • ./model/exp18 , including 194 sound classes of 147 species, updated on Oct. 2021
    • ./model/exp20 , including 213 sound classes of 163 species, updated on Dec. 2021
    • ./model/exp24 , including 253 sound classes of 169 species, updated on Aug. 2022
    • ./model/exp27 , including 279 sound classes of 183 species, updated on May. 2023
    • ./model/exp28 , including 284 sound classes of 184 species, updated on Jun. 2023
    • ./model/exp29 , including 284 sound classes of 184 species, updated on Jul. 2023
    • ./model/exp30 , including 302 sound classes of 202 species, updated on Mar. 2024
    • ./model/exp31 , including 364 sound classes of 257 species, updated on Jun. 2024
  • Scripts of detection: ./silic.ipynb
  • Demo video of inference results of the SILIC on detecting an camera trap video:
    • Demo video
  • UI tool: silic-ui.py
    • SILIC-UI

Audio Sources

Publication

Application

About

Sound Identification and Labeling Intelligence for Creatures

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published