Variational Autoencoders for Audio Generation

This repository contains Python notebooks implementing Variational Autoencoders (VAEs) for audio generation and related probabilistic machine learning models.

Project Overview

This project explores the application of VAEs to audio generation tasks. We implement and compare several VAE architectures and related probabilistic models, focusing on their ability to generate realistic and diverse audio samples.

We worked both with the reconstruction of Mel-Spectogram and and audio data.

src/VAE_Spectogram.py: Simple Variational Auto Encoder to generate Mel-Spectogram
src/RVAE_Spectogram.py: Recurrent Variational Auto Encoder to generate Mel-Spectogram
src/CNN_Audio.py: Simple CNN-1D Variational Auto Encoder to generate audio data with one channel
src/Residual_CNN_Audio.py: Residual CNN-1D Variational Auto Encoder to generate audio data with one channel
src/RVAE_Audio.py: Recurrent Variational Auto Encoder to generate audio data with one channel

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
src		src
PML_Project.ipynb		PML_Project.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Variational Autoencoders for Audio Generation

Project Overview

Contents

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Variational Autoencoders for Audio Generation

Project Overview

Contents

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages