fast speech mel normalization #35

superhg2012 · 2020-06-08T06:52:13Z

when compute mean and scaler for mel-spectrogram before normization, mean and scaler are computed from all dataset and only the first frame mel?

mel = mel[0].numpy()

The text was updated successfully, but these errors were encountered:

dathudeptrai · 2020-06-08T06:54:57Z

@superhg2012 mean, var for all TRAINING SET. shape mel = [1, len, 80] so i use mel[0]

superhg2012 · 2020-06-08T07:00:22Z

All right, I get it, thanks! By the way replace relu activation in ffn layer of FFT block with mish activation function really works for me.

dathudeptrai · 2020-06-08T07:02:08Z

@superhg2012 Fastspeech without duration from teacher model is comming :))) let see

superhg2012 · 2020-06-08T07:34:50Z

@superhg2012 Fastspeech without duration from teacher model is comming :))) let see

any paper reference?

dathudeptrai · 2020-06-09T00:11:12Z

@superhg2012 flow-tts, glow-tts

dathudeptrai · 2020-06-10T05:43:21Z

@superhg2012 i just added fastspeech samples, can you take a look and see if it's good or not :D. https://dathudeptrai.github.io/TensorflowTTS/

dathudeptrai self-assigned this Jun 8, 2020

dathudeptrai added the question ❓ Further information is requested label Jun 8, 2020

dathudeptrai added this to Done in FastSpeech Jun 8, 2020

dathudeptrai closed this as completed Jun 8, 2020

Provide feedback