Adding PyTorch wrapper for NV-Wavenet #7

RPrenger · 2018-05-17T23:05:39Z

No description provided.

PetrochukM · 2018-05-18T02:23:41Z

This is awesome. I've reviewed the README for the pull request, I have a couple questions:

embedding_prev and embedding_cur, these are used for global conditioning via embeddings like speaker embeddings? Why are there previous and current embeddings?
cond_input Is where we would locally condition the mel-spectrogram, right?
Does this PyTorch module allow for training or just inference? If it does not allow for training, what is the recommended method for training?
For the initial 2x1 casual convolution, where can we set the weight matrix?

@rafaelvalle Reposted after: NVIDIA/tacotron2#3 (comment)

RPrenger · 2018-05-18T04:59:58Z

Hi @PetrochukM.

The embedding_prev and embedding_cur are used on the audio, not on the conditioning inputs. Right now, nv-wavenet only works with a one-hot representation of audio and so there's an embedding matrix at the beginning. The reason there's a prev and a curr is because in the DeepVoice WaveNet implementation there was a causal convolution at the beginning with kernel size=2. The curr embedding is for the current time audio sample, and the prev embedding is for the audio sample before it. I used a similiar convention for dilated causal convolutions later. If your WaveNet just uses one embedding you can set the embedding_prev to all zeros and it'll have no effect on the output (we actually do this with our network).
cond_input is a little more complicated than just the mels or features. The nv-wavenet code is only doing the auto-regressive part of the inference, and all the computation that can be done in a non-auto-regressive way (in parallel) is done before hand. So all the input preprocessing, and upsampling are done before hand, but also the convolutions done on the upsampled features (which are potentially different for each layer). So the cond_input is a very large (2R x batch_size x num_layers x samples) tensor that needs to be calculated before the inference can be run. However because this calculation can be done in parallel across time it's much faster than the part of inference nv-wavenet is doing.
Right now this is just a wrapper for the nv-wavenet code, so just for inference. We're working on open sourcing our WaveNet training code which will include code for translating itself to the nv-wavenet wrapper (which is what we used for the nv_wavenet_test.py). But the code to translate to the wrapper isn't complicated. If your WaveNet fits the constraints of nv-wavenet it's just a matter of feeding your tensors in to the NVWaveNet constructor. The nv_wavenet_test.py example code might help (it's very short). Translation was just a matter of saving the tensors and parameters in a dictionary with the right keys.
The initial 2x1 causal convolution are set with the embedding_prev and embedding_curr inputs (See answer 1). Because nv-wavenet is only working with one-hot representations of audio, an initial convolution can be written as an embedding. If you're not using a one-hot representation nv-wavenet code won't work yet.

maozhiqiang · 2018-05-18T06:40:28Z

hi @RPrenger , When I use python pytorch/nv_wavenet_test.py
ERROR:
GPUassert: invalid device function ../nv_wavenet_util.cuh 48

PetrochukM · 2018-05-22T19:12:29Z

Hi @RPrenger,

Similarly, I get an error:

$ python3.6 nv-wavenet/pytorch/nv_wavenet_test.py
GPUassert: invalid device function ../nv_wavenet_util.cuh 48

rprenger added 2 commits May 17, 2018 16:00

Adding PyTorch wrapper

108b124

Fixing bugs in README.md

5477618

erogol mentioned this pull request May 18, 2018

Adapt nv-wavenet mozilla/TTS#32

Closed

BrianPharris merged commit cc364ca into NVIDIA:master May 18, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding PyTorch wrapper for NV-Wavenet #7

Adding PyTorch wrapper for NV-Wavenet #7

RPrenger commented May 17, 2018

PetrochukM commented May 18, 2018 •

edited

Loading

RPrenger commented May 18, 2018 •

edited

Loading

maozhiqiang commented May 18, 2018 •

edited

Loading

PetrochukM commented May 22, 2018

Adding PyTorch wrapper for NV-Wavenet #7

Adding PyTorch wrapper for NV-Wavenet #7

Conversation

RPrenger commented May 17, 2018

PetrochukM commented May 18, 2018 • edited Loading

RPrenger commented May 18, 2018 • edited Loading

maozhiqiang commented May 18, 2018 • edited Loading

PetrochukM commented May 22, 2018

PetrochukM commented May 18, 2018 •

edited

Loading

RPrenger commented May 18, 2018 •

edited

Loading

maozhiqiang commented May 18, 2018 •

edited

Loading