GitHub - pkamath2/stylegan2-ada-pytorch: StyleGAN2-ADA

StyleGAN2-ADA — for Audio Textures

This forked repository has updates for modelling Audio Textures. Please see the original official README from NVIDIA here for details on licenses and citations.

Compatibility

Note: This version of StyleGAN2 is not compatible with PyTorch>1.8. I use PyTorch 1.7 for my experiments.

Datasets

Please see links for the datasets I used for my experiments -

TokWotel (Wood and Metal hits separated) - https://drive.google.com/file/d/1xjU868UgJBwnkrFEXlJg1S5u-SUNK6ag/view?usp=sharing
A subset and pre-processed Greatest Hits Dataset - https://drive.google.com/file/d/1U3QRj3GQTlCcLj4BriSaWd3JYIP5sE4W/view?usp=sharing (original here)

Please use the notebook called pghi-test.ipynb to visualise the spectrogram representations.

Training new networks

To training new networks use the commands below. Note that the datasets directory should contain the '*.wav' files with no sub-directory structure. Also, all my experiments were unconditional training. For conditioned training you will need an additional dataset.json as explained in the original NVIDIA README.

The flag --aug=noaug is important. The augmentations (rotation etc.,) used in the computer vision domain will not work for audio spectrograms learning.

python train.py --outdir=training-runs --data=datasets/tokwotel --gpus=1 --aug=noaug --dry-run
python train.py --outdir=training-runs --data=datasets/tokwotel --gpus=1 --aug=noaug

python train.py --outdir=training-runs --data=datasets/vis-data-256-split --gpus=1 --aug=noaug --dry-run
python train.py --outdir=training-runs --data=datasets/vis-data-256-split --gpus=1 --aug=noaug

Generate

We use PGHI method to generate Spectrograms. StyleGAN architectures for audio learn spectrogram representations as images and thus need to be scaled from [-50,0] to [0,255]. For this, please use the generate-rescaled-final.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
dnnlib		dnnlib
docs		docs
metrics		metrics
torch_utils		torch_utils
training		training
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE.txt		LICENSE.txt
README.md		README.md
calc_metrics.py		calc_metrics.py
dataset_tool.py		dataset_tool.py
docker_run.sh		docker_run.sh
generate-rescaled-final.ipynb		generate-rescaled-final.ipynb
generate.py		generate.py
legacy.py		legacy.py
pghi-test.ipynb		pghi-test.ipynb
project.ipynb		project.ipynb
projector.py		projector.py
sefa.ipynb		sefa.ipynb
style_mixing.py		style_mixing.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

StyleGAN2-ADA — for Audio Textures

Compatibility

Datasets

Training new networks

Generate

About

Releases

Packages

Languages

License

pkamath2/stylegan2-ada-pytorch

Folders and files

Latest commit

History

Repository files navigation

StyleGAN2-ADA — for Audio Textures

Compatibility

Datasets

Training new networks

Generate

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages