audio-captioning

Star

Here are 27 public repositories matching this topic...

audio-captioning / clotho-dataset

Star

Python code for handling the Clotho dataset.

audio natural-language-processing deep-learning audio-signal-processing captioning audio-captioning clotho-dataset

Updated Nov 24, 2020
Python

TheoCoombes / ClipCap

Star

Using pretrained encoder and language models to generate captions from multimedia inputs.

vqa image-captioning language-model encoder-decoder audio-captioning vision-transformer

Updated Mar 11, 2023
Python

audio-captioning / dcase-2020-baseline

Star

Audio captioning baseline system for DCASE 2020 challenge.

machine-learning deep-neural-networks deep-learning signal-processing audio-signal-processing captioning dcase machine-listening audio-captioning dcase2020

Updated Aug 22, 2023
Python

soham97 / awesome-sound_event_detection

Star

Reading list for research topics in Sound AI

representation-learning audio-processing zero-shot-learning icassp sound-event-detection interspeech acoustic-scene-classification audio-captioning audio-generation audio-retrieval

Updated Aug 8, 2024

an-tran528 / wavetransformer

Star

Code base for WaveTransformer: A novel architecture for automated audio captioning

audio-captioning

Updated Mar 1, 2021
Python

ilaria-manco / muscaps

Star

Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)

music-information-retrieval mir multimodal-deep-learning audio-captioning

Updated Dec 3, 2024
Jupyter Notebook

Labbeti / aac-datasets

Star

Audio Captioning datasets for PyTorch.

audio deep-learning pytorch dataset caption datasets captioning audio-captioning

Updated Nov 4, 2024
Python

lukewys / dcase_2020_T6

Star

2nd place solution for 2020 DCASE challenge task 6 audio captioning. http://dcase.community/challenge2020/task-automatic-audio-captioning-results#wuyusong2020_t6

deep-learning audio-captioning

Updated Aug 3, 2023
Python

ilaria-manco / song-describer

Star

Song Describer is a data collection platform for annotating music with textual descriptions.

annotations data-collection audio-captioning music-dataset

Updated Dec 3, 2024
Python

Labbeti / aac-metrics

Star

Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.

audio metrics text captioning audio-captioning

Updated Jan 20, 2025
Python

audio-captioning / caption-evaluation-tools

Star

Tools for the evaluation of audio captioning.

captioning machine-translation-metrics audio-captioning

Updated May 23, 2020
Jupyter Notebook

minguinho26 / Prefix_AAC_ICASSP2023

Star

Official Implementation of "Prefix tuning for Automated Audio Captioning(ICASSP 2023)"

deep-learning pytorch-implementation audio-captioning icassp2023

Updated Dec 6, 2023
Jupyter Notebook

satvik-dixit / mace

Star

Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems

audio clap evaluation-metrics audio-captioning automated-audio-captioning

Updated Jan 16, 2025
Python

slSeanWU / beats-conformer-bart-audio-captioner

Star

PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation"

transformers pytorch audio-captioning clotho-dataset dcase-challenge

Updated Jan 6, 2024
Jupyter Notebook

ExplainableML / ZerAuCap

Star

[NeurIPS 2023 - ML for Audio Workshop (Oral)] Zero-shot audio captioning with audio-language model guidance and audio context keywords

audio zero-shot opt audio-captioning clotho-dataset large-language-models neurips-2023 audiocaps

Updated Nov 30, 2024
Python

blmoistawinde / fense

Star

Fluency ENhanced Sentence-bert Evaluation (FENSE), metric for audio caption evaluation. And Benchmark dataset AudioCaps-Eval, Clotho-Eval.

benchmark evaluation-metrics audio-captioning audiocaption

Updated Feb 1, 2023
Python

dr-costas / clotho-baseline-dataset

Star

Code for using with the Clotho dataset

audio dataset zenodo machine-listening audio-captioning

Updated Dec 24, 2019
Python

soham97 / sound_ai_progress

Star

Tracking states of the arts and recent results (bibliography) on sound tasks.

audio-processing sound-event-detection music-classification acoustic-scene-classification audio-captioning audio-generation audio-retrieval

Updated Jan 10, 2023

paniquex / Automated_Audio_Captioning_DCASE2020

Star

6-th task solution of DCASE2020

audio gru attention audio-processing mixup audio-captioning

Updated Jun 22, 2022
Python

zelaki / wsac

Star

This reporsitory code form Weakly Supervised Automaed Audio Captioning via Text Only Training

clap audio-captioning dcase2023

Updated Jun 12, 2023
Python

Improve this page

Add a description, image, and links to the audio-captioning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the audio-captioning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

audio-captioning

Here are 27 public repositories matching this topic...

audio-captioning / clotho-dataset

TheoCoombes / ClipCap

audio-captioning / dcase-2020-baseline

soham97 / awesome-sound_event_detection

an-tran528 / wavetransformer

ilaria-manco / muscaps

Labbeti / aac-datasets

lukewys / dcase_2020_T6

ilaria-manco / song-describer

Labbeti / aac-metrics

audio-captioning / caption-evaluation-tools

minguinho26 / Prefix_AAC_ICASSP2023

satvik-dixit / mace

slSeanWU / beats-conformer-bart-audio-captioner

ExplainableML / ZerAuCap

blmoistawinde / fense

dr-costas / clotho-baseline-dataset

soham97 / sound_ai_progress

paniquex / Automated_Audio_Captioning_DCASE2020

zelaki / wsac

Improve this page

Add this topic to your repo