Skip to content
FFTNet vocoder implementation
Branch: master
Clone or download
Latest commit e2a1737 Jul 24, 2018
Type Name Latest commit message Commit time
Failed to load latest commit information.
LICENSE.txt Update Jul 24, 2018
TODO.txt update TODO.txt Jul 5, 2018
check-dataloader.ipynb Notebook update Jun 27, 2018
conf_test_train.json Comment progbar since it has no use on snakepit Jul 9, 2018
requirements.txt Optimize FFTNet inference Jul 3, 2018 EMA model averaging Jun 28, 2018

Unofficial Implementation of FFTNet vocode paper.

  • implement the model.
  • implement tests.
  • overfit on a single batch (sanity check).
  • linearize weights for eval time.
  • measure the run-time on GPU and CPU. (1 sec audio takes ~47 secs) If anyone knows additional tricks from the paper, let me know. So far I asked the authors but nobody returned.
  • train on LJSpeech spectrograms.
  • distill model as in Parallel WaveNet paper.
You can’t perform that action at this time.