AudioSet VGGish in PyTorch

Introduction

This repository includes:

A script which converts the pretrained VGGish model provided in the AudioSet repository from TensorFlow to PyTorch (along with a basic smoke test).
Sourced from: https://github.com/tensorflow/models/tree/master/research/audioset
The VGGish architecture defined in PyTorch.
Adapted from: https://github.com/harritaylor/torchvggish
The converted weights found in the Releases section.

Please note that converted model does not produce exactly the same results as the original model, but should be close in most cases.

Usage

Download the pretrained weights and PCA parameters from the AudioSet repository and place them in the working directory.
Install any dependencies required by AudioSet (e.g., resampy, numpy, TensorFlow, etc.).
Run "convert_to_pytorch.py" to generate the PyTorch formatted weights for the VGGish model or download the weights from the Releases section.

Example Usage

Please refer to the "example_usage.py" script. The output of the script should be as follows.

Input Shape: (3, 1, 96, 64)
Output Shape: (3, 128)
Computed Embedding Mean and Standard Deviation: 0.13079901 0.23851949
Expected Embedding Mean and Standard Deviation: 0.131 0.238
Computed Post-processed Embedding Mean and Standard Deviation: 123.01041666666667 75.51479501722199
Expected Post-processed Embedding Mean and Standard Deviation: 123.0 75.0

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
audioset		audioset
.gitignore		.gitignore
README.md		README.md
convert_to_pytorch.py		convert_to_pytorch.py
example_usage.py		example_usage.py
vggish.py		vggish.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

audioset

audioset

.gitignore

.gitignore

README.md

README.md

convert_to_pytorch.py

convert_to_pytorch.py

example_usage.py

example_usage.py

vggish.py

vggish.py

Repository files navigation

AudioSet VGGish in PyTorch

Introduction

Usage

Example Usage

About

Releases

Packages

Languages

tcvrick/audioset-vggish-tensorflow-to-pytorch

Folders and files

Latest commit

History

Repository files navigation

AudioSet VGGish in PyTorch

Introduction

Usage

Example Usage

About

Resources

Stars

Watchers

Forks

Languages