Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Questions about model structure #3

Open
npuichigo opened this issue Aug 29, 2018 · 1 comment
Open

Questions about model structure #3

npuichigo opened this issue Aug 29, 2018 · 1 comment

Comments

@npuichigo
Copy link

I used tensorboard to inspect your model structure and found that the pb model you provided just uses one softmax with 256 outputs (8 bits).

image

However, the paper uses two separated DNNs to predict the coarse and fine part of a sample. Is that because your model reuse the matrix of O1 and O3 (O2 and O4) or you just support 8 bits with mu-law compression?

image

@MlWoo
Copy link

MlWoo commented Oct 7, 2018

@npuichigo I also have rewritten the training code by the graph. It does work and the audio sounds good but the waveform is not different from the target. The systhesised audio is delayed than the target in totally.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants