Python code for handling the Clotho dataset.
-
Updated
Nov 24, 2020 - Python
Python code for handling the Clotho dataset.
Using pretrained encoder and language models to generate captions from multimedia inputs.
Audio captioning baseline system for DCASE 2020 challenge.
Reading list for research topics in Sound AI
Code base for WaveTransformer: A novel architecture for automated audio captioning
Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)
Audio Captioning datasets for PyTorch.
2nd place solution for 2020 DCASE challenge task 6 audio captioning. http://dcase.community/challenge2020/task-automatic-audio-captioning-results#wuyusong2020_t6
Song Describer is a data collection platform for annotating music with textual descriptions.
Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.
Tools for the evaluation of audio captioning.
Official Implementation of "Prefix tuning for Automated Audio Captioning(ICASSP 2023)"
Tracking states of the arts and recent results (bibliography) on sound tasks.
PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation"
Code for using with the Clotho dataset
Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems
6-th task solution of DCASE2020
This reporsitory code form Weakly Supervised Automaed Audio Captioning via Text Only Training
[NeurIPS 2023 - ML for Audio Workshop (Oral)] Zero-shot audio captioning with audio-language model guidance and audio context keywords
IRIT-UPS DCASE 2021 AUDIO CAPTIONING SYSTEM
Add a description, image, and links to the audio-captioning topic page so that developers can more easily learn about it.
To associate your repository with the audio-captioning topic, visit your repo's landing page and select "manage topics."