TTS recipe #67

jimbozhang · 2020-09-01T06:52:39Z

I'm not sure if it useful to let lhotse support TTS training. If needed, I can make a TTS recipe using the LJSpeech corpus.
@danpovey @pzelasko

danpovey · 2020-09-01T07:55:44Z

That sounds like a great idea! Yes, I intend for it to support all kinds of speech-related tasks.

…

On Tue, Sep 1, 2020 at 2:52 PM Junbo Zhang ***@***.***> wrote: I'm not sure if it useful to let lhotse support TTS training. If needed, I can make a TTS recipe using the LJSpeech <https://keithito.com/LJ-Speech-Dataset/> corpus. @danpovey <https://github.com/danpovey> @pzelasko <https://github.com/pzelasko> — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#67>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAZFLO6M6P5F2PDHCW2PS2TSDSK4JANCNFSM4QRLERMQ> .

pzelasko · 2020-09-01T12:27:56Z

Yes, it will be most useful. Thanks!

jimbozhang · 2020-09-09T13:58:47Z

Just started working on this. A new recipe ljspeech will be added for end-to-end (e.g. Tacotron2) training. I'll make a PR next week.

pzelasko · 2020-09-09T14:08:09Z

Sounds great! A heads up - I made a couple of general changes in the meanwhile, e.g. the preferred manifest format is now JSON because it’s much faster to read/write.

…

On Sep 9, 2020, at 09:59, Junbo Zhang ***@***.***> wrote: Just started working on this. A new recipe ljspeech will be added for end-to-end (e.g. Tacotron2) training. I'll make a PR next week. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#67 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ADZRKQBOF4CUW4D3KADCTNDSE6C2VANCNFSM4QRLERMQ>.

jimbozhang · 2020-09-09T14:28:41Z

Sounds great! A heads up - I made a couple of general changes in the meanwhile, e.g. the preferred manifest format is now JSON because it’s much faster to read/write.

I had a little worry: if the dataset is extremely huge, the JSON would be very large. To avoid out-of-memory, the manifest has to be splitted. I think splitting YAML is easier than JSON, because if you want to split a JSON, you have to load it into memory first.

danpovey · 2020-09-09T14:50:17Z

We had also discussed JSONL format (JSON, one element per line). It will take quite a big dataset to hit that memory limit though.

…

On Wed, Sep 9, 2020 at 10:28 PM Junbo Zhang ***@***.***> wrote: Sounds great! A heads up - I made a couple of general changes in the meanwhile, e.g. the preferred manifest format is now JSON because it’s much faster to read/write. I had a little worry: if the dataset is extremely huge, the JSON would be very large. To avoid out-of-memory, the manifest has to be splitted. I think splitting YAML is easier than JSON, because if you want to split a JSON, you have to load it into memory first. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#67 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAZFLO4BVH64RAQJQWYARODSE6GKVANCNFSM4QRLERMQ> .

pzelasko · 2020-09-09T15:00:39Z

Hmm, you're raising a good point. We can add JSONL quite easily - we're already supporting two formats anyway...

I wonder if it makes sense to use a binary format for such super-large datasets. E.g. in https://github.com/huggingface/nlp/ they use Apache Arrow to traverse mmap-ed files, allowing them to iterate very large datasets with almost zero memory footprint (likely at the cost of storage size).

danpovey · 2020-09-09T15:25:58Z

Lets solve that only once it becomes a problem.

…

On Wed, Sep 9, 2020 at 11:00 PM Piotr Żelasko ***@***.***> wrote: Hmm, you're raising a good point. We can add JSONL quite easily - we're already supporting two formats anyway... I wonder if it makes sense to use a binary format for such super-large datasets. E.g. in https://github.com/huggingface/nlp/ they use Apache Arrow to traverse mmap-ed files, allowing them to iterate very large datasets with almost zero memory footprint (likely at the cost of storage size). — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#67 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAZFLOZACYMLDBKYPYFV7ETSE6KCVANCNFSM4QRLERMQ> .

jimbozhang mentioned this issue Sep 20, 2020

a TTS recipe using the LJ-Speech dataset #81

Merged

jimbozhang mentioned this issue Sep 29, 2020

Full Librispeech recipe #89

Closed

pzelasko closed this as completed Oct 9, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TTS recipe #67

TTS recipe #67

jimbozhang commented Sep 1, 2020

danpovey commented Sep 1, 2020 via email

pzelasko commented Sep 1, 2020

jimbozhang commented Sep 9, 2020

pzelasko commented Sep 9, 2020 via email

jimbozhang commented Sep 9, 2020

danpovey commented Sep 9, 2020 via email

pzelasko commented Sep 9, 2020

danpovey commented Sep 9, 2020 via email

TTS recipe #67

TTS recipe #67

Comments

jimbozhang commented Sep 1, 2020

danpovey commented Sep 1, 2020 via email

pzelasko commented Sep 1, 2020

jimbozhang commented Sep 9, 2020

pzelasko commented Sep 9, 2020 via email

jimbozhang commented Sep 9, 2020

danpovey commented Sep 9, 2020 via email

pzelasko commented Sep 9, 2020

danpovey commented Sep 9, 2020 via email