Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fast speech mel normalization #35

Closed
superhg2012 opened this issue Jun 8, 2020 · 6 comments
Closed

fast speech mel normalization #35

superhg2012 opened this issue Jun 8, 2020 · 6 comments
Assignees
Labels
question ❓ Further information is requested
Projects

Comments

@superhg2012
Copy link

when compute mean and scaler for mel-spectrogram before normization, mean and scaler are computed from all dataset and only the first frame mel?

mel = mel[0].numpy()

@dathudeptrai
Copy link
Collaborator

dathudeptrai commented Jun 8, 2020

@superhg2012 mean, var for all TRAINING SET. shape mel = [1, len, 80] so i use mel[0]

@superhg2012
Copy link
Author

All right, I get it, thanks! By the way replace relu activation in ffn layer of FFT block with mish activation function really works for me.

@dathudeptrai
Copy link
Collaborator

@superhg2012 Fastspeech without duration from teacher model is comming :))) let see

@superhg2012
Copy link
Author

@superhg2012 Fastspeech without duration from teacher model is comming :))) let see

any paper reference?

@dathudeptrai dathudeptrai self-assigned this Jun 8, 2020
@dathudeptrai dathudeptrai added the question ❓ Further information is requested label Jun 8, 2020
@dathudeptrai dathudeptrai added this to Done in FastSpeech Jun 8, 2020
@dathudeptrai
Copy link
Collaborator

@superhg2012 flow-tts, glow-tts

@dathudeptrai
Copy link
Collaborator

@superhg2012 i just added fastspeech samples, can you take a look and see if it's good or not :D. https://dathudeptrai.github.io/TensorflowTTS/

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question ❓ Further information is requested
Projects
FastSpeech
  
Done
Development

No branches or pull requests

2 participants