End-to-end speech synthesis demo #3

r9y9 · 2017-08-01T09:51:04Z

End-to-end speech synthesis should be considered in design.

r9y9 · 2017-09-07T07:58:21Z

Seems like my Tacotron implementation started working. Learned alignment and predicted spectrogram at step 69000 attached for the record.

r9y9 · 2017-09-12T07:32:29Z

Still trying various network configurations (attention type, input mask, attention memory mask, zero-padding for embedding, etc). Generated alignment, spectrogram, and wav file by greedy decoding at global step 69000 are attached.

Text: Generative adversarial network or variational auto-encoder.
test.wav.zip

r9y9 · 2017-09-15T04:52:37Z

http://nbviewer.jupyter.org/gist/r9y9/4182248424b39cb7352b93124cbbea98 A few more generated speech samples

r9y9 · 2017-09-15T12:33:17Z

Code: https://github.com/r9y9/tacotron_pytorch.

r9y9 · 2017-10-09T12:25:16Z

I regard this as a doc issue. Added a link to the docs. a6e46a6

r9y9 added this to the 0.2.0 release milestone Aug 1, 2017

r9y9 mentioned this issue Aug 3, 2017

Demonstration (tutorial) notebooks #5

Closed

7 tasks

r9y9 modified the milestones: 0.1.0, 0.2.0 Aug 22, 2017

r9y9 mentioned this issue Sep 20, 2017

Improved support for labels #33

Closed

r9y9 closed this as completed Oct 9, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

End-to-end speech synthesis demo #3

End-to-end speech synthesis demo #3

r9y9 commented Aug 1, 2017 •

edited

Loading

r9y9 commented Sep 7, 2017

r9y9 commented Sep 12, 2017 •

edited

Loading

r9y9 commented Sep 15, 2017

r9y9 commented Sep 15, 2017

r9y9 commented Oct 9, 2017

End-to-end speech synthesis demo #3

End-to-end speech synthesis demo #3

Comments

r9y9 commented Aug 1, 2017 • edited Loading

r9y9 commented Sep 7, 2017

r9y9 commented Sep 12, 2017 • edited Loading

r9y9 commented Sep 15, 2017

r9y9 commented Sep 15, 2017

r9y9 commented Oct 9, 2017

r9y9 commented Aug 1, 2017 •

edited

Loading

r9y9 commented Sep 12, 2017 •

edited

Loading