The goal of SILIC is to build an autonomous wildlife sound identification system which can help to monitoring the population status and trends of terrestrial vocal animals in Taiwan by using the data of Passive Acoustic Monitorings (PAMs).
- Object 1: Extract robust species, sound class, time and frequency information from various and complex soundscape recordings.
- Object 2: Model can be constructed using a dataset as small as possible, and the training audios can be easily and quickly acquired.
- Object 3: Most species of terrestrial vocal wildlife in Taiwan should be included in model, especially those are hard to be detected with survey methods other than PAM.
SILIC uses Python language and yolov5 package (Glenn Jocher et al., 2020) to construct an object detection model. Additional pydub (Robert, 2011), nnAudio (Cheuk et al., 2020) and matplotlib (Hunter, 2007) libraries were imported for audio signal processing and Time–Frequency Representation (TFR).
- Training and validation: ./dataset/Training_Validation_Dataset.txt
- Test with evaluation results: ./dataset/evaluation_testset.csv
- Python >= 3.10.0
- pip install -r requirements.txt
- Download and install FFmpeg from https://www.ffmpeg.org/download.html
- A guide for Windows 10/11 by Ta-Chih Chen SILIC 環境設定 for Window 10 or 11
- Model Weights:
- ./model/exp12 , including 27 sound classes of 16 species, updated on Apr. 2021
- ./model/exp14 , including 74 sound classes of 52 species, updated on Jul. 2021
- ./model/exp18 , including 194 sound classes of 147 species, updated on Oct. 2021
- ./model/exp20 , including 213 sound classes of 163 species, updated on Dec. 2021
- ./model/exp24 , including 253 sound classes of 169 species, updated on Aug. 2022
- ./model/exp27 , including 279 sound classes of 183 species, updated on May. 2023
- ./model/exp28 , including 284 sound classes of 184 species, updated on Jun. 2023
- ./model/exp29 , including 284 sound classes of 184 species, updated on Jul. 2023
- ./model/exp30 , including 302 sound classes of 202 species, updated on Mar. 2024
- ./model/exp31 , including 364 sound classes of 257 species, updated on Jun. 2024
- Scripts of detection: ./silic.ipynb
- Demo video of inference results of the SILIC on detecting an camera trap video:
- UI tool: silic-ui.py
- Macaulay Library
- xeno-canto
- Asian Soundscape
- Thinning Forest Monitoring
- iNaturalist
- Wu, S. H., Ko, J. C. J., Lin, R. S., Chang-Yang, C. H., & Chang, H. W. (2023). Evaluating community-wide temporal sampling in passive acoustic monitoring: A comprehensive study of avian vocal patterns in subtropical montane forests. F1000Research, 12, 1299. https://doi.org/10.12688/f1000research.141951.2
- Wu, S. H., Ko, J. C. J., Lin, R. S., Tsai, W. L., & Chang, H. W. (2023). An acoustic detection dataset of birds (Aves) in montane forests using a deep learning approach. Biodiversity Data Journal, 11, e97811. https://doi.org/10.3897/BDJ.11.e97811
- Wu, S. H., Chang, H. W., Lin, R. S., & Tuanmu, M. N. (2022). SILIC: A cross database framework for automatically extracting robust biodiversity information from soundscape recordings based on object detection and a tiny training dataset. Ecological Informatics, 68, 101534. https://doi.org/10.1016/j.ecoinf.2021.101534
- 吳昭頤(2024)利用自動錄音系統進行水田蛙類鳴叫物候之研究。國立東華大學自然資源與環境學系碩士論文。
- 陳明芫(2023)以人工智慧物種辨識工具探索大安森林公園的鳥類聲景。國立臺灣大學森林環境暨資源學系碩士論文。
- 謝宜樺(2021)兩種共域夜行性鳥類的鳴聲棲位區隔。國立東華大學自然資源與環境學系碩士論文。
- SILIC - Telegram Bot
- Wu S, Tsai W, Ko J C (2022). Acoustic detections of vertebrates using SILIC in Yushan National Park, Taiwan. Taiwan Endemic Species Research Institute. Sampling event dataset https://doi.org/10.15468/bbr9k7 accessed via GBIF.org on 2022-10-10.