Evaluation of Generative Visual to Audio (V2A) Models

This repo provides an easy way to evaluate generative V2A models. It is based on PyTorch.

Usage

This section walks you through the process of evaluating a generative v2a model. The following steps are required:

1. Install environment

First, you need to install the required environment. You can do this by running the following command:

conda env create -f conda_env_cu12.1.yaml

Also, AudioTools by Descript Inc. is needed for audio processing and PASST model for metrics. You can install them by running the following commands:

pip install git+https://github.com/descriptinc/audiotools
pip install git+https://github.com/kkoutini/passt_hear21@0.0.19#egg=hear21passt

2. Download Synchformer checkpoints

This evaluation pipeline uses Synchformer model to analyze the audio-visual synchronization. Run the following command to download the Synchformer checkpoints:

bash ./checkpoints/download_synchformer_checkpoints.sh

3. Run the evaluation pipeline

Use run_evaluations.ipynb to run the pipeline. All the required steps are described in the notebook.

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
checkpoints		checkpoints
configs		configs
data		data
eval_utils		eval_utils
metrics		metrics
submodules		submodules
test		test
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
conda_env_cu11.8.yaml		conda_env_cu11.8.yaml
conda_env_cu12.1.yaml		conda_env_cu12.1.yaml
run_evaluations.ipynb		run_evaluations.ipynb
run_evaluations.py		run_evaluations.py
visualise_exported_evaluations.ipynb		visualise_exported_evaluations.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Evaluation of Generative Visual to Audio (V2A) Models

Usage

1. Install environment

2. Download Synchformer checkpoints

3. Run the evaluation pipeline

About

Releases

Packages

Languages

License

ilpoviertola/eval_generative_v2a_models

Folders and files

Latest commit

History

Repository files navigation

Evaluation of Generative Visual to Audio (V2A) Models

Usage

1. Install environment

2. Download Synchformer checkpoints

3. Run the evaluation pipeline

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages