AmbientGAN

Overview

A Generative Model that parses piano notes from MIDI files and uses them as input to generate new note sequences. Most of the input MIDI files are sourced from the ADL Piano MIDI Dataset.

The adversarial network here follows the default architecture of simultaneously training a generative model G that captures the data distribution, and a discriminative model D that estimates the probability that a sample came from the training data rather than G.

Additionally, a dot-product self-attention layer is employed and applied to the last layers of both the generator and the discriminator.

Req: Python 3, Pytorch 1.10, CUDA 10.2.

Training Loss

Training time on GPU: 8-10 Minutes for 2000 Epochs.

Results

MIDI tempo used for all outputs ranged between 250000 and 500000. Time sig, key sig and any necessary meta messages or lyrics found in MIDI files are pre-defined, and are not part of the model output.

Samples outputs are here.

TODO

Implement S.A Layer.
Fully Unsupervised Setting.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
ambience		ambience
midi		midi
tokenizer		tokenizer
.gitignore		.gitignore
README.md		README.md
ambientnetwork.py		ambientnetwork.py
csv.lsp		csv.lsp
index.html		index.html
main.py		main.py
pre_process.lsp		pre_process.lsp
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AmbientGAN

About

Languages

Ellon-M/AmbientGAN

Folders and files

Latest commit

History

Repository files navigation

AmbientGAN

About

Topics

Resources

Stars

Watchers

Forks

Languages