padster / AudioStyle Public

Notifications You must be signed in to change notification settings
Fork 0
Star 1

UBC 540 Project: Style transfer for Audio

1 star 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
docs		docs
images		images
paper		paper
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
analysis.py		analysis.py
audioUtils.py		audioUtils.py
imageTransfer.py		imageTransfer.py
imgUtils.py		imgUtils.py
losses.py		losses.py
mfcc.py		mfcc.py
readme.txt		readme.txt
run.py		run.py
vggnet.py		vggnet.py
viz.py		viz.py

Repository files navigation

AudioStyle

UBC CPSC 540 Project: Style transfer for Audio

For details, see the paper in this repository: https://github.com/padster/AudioStyle/blob/master/paper/CPSC540_FinalReport.pdf

Code uses the following implementation of the Neural Style algorithm, in Lasagne/Theano: https://github.com/Lasagne/Recipes/blob/master/examples/styletransfer/Art%20Style%20Transfer.ipynb

Audio processing logic (spectrogram & mfcc) used from: https://timsainb.github.io/spectrograms-mfccs-and-inversion-in-python.html

To test yourself: python run.py

Where the available flags are:

--cpu (whether to run theano on the CPU, default is GPU)
--spec (whether to transfer the spectrogram, default is MFCC)
--rowac (whether to include loss for row autocorrelation)
--colac (whether to include loss for column autocorrelation)

About

UBC 540 Project: Style transfer for Audio

Report repository

Releases

No releases published

Packages

No packages published

Languages