Questions about model structure #3

npuichigo · 2018-08-29T05:57:48Z

I used tensorboard to inspect your model structure and found that the pb model you provided just uses one softmax with 256 outputs (8 bits).

However, the paper uses two separated DNNs to predict the coarse and fine part of a sample. Is that because your model reuse the matrix of O1 and O3 (O2 and O4) or you just support 8 bits with mu-law compression?

MlWoo · 2018-10-07T12:36:41Z

@npuichigo I also have rewritten the training code by the graph. It does work and the audio sounds good but the waveform is not different from the target. The systhesised audio is delayed than the target in totally.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions about model structure #3

Questions about model structure #3

npuichigo commented Aug 29, 2018

MlWoo commented Oct 7, 2018

Questions about model structure #3

Questions about model structure #3

Comments

npuichigo commented Aug 29, 2018

MlWoo commented Oct 7, 2018