New Language #23

gaziway · 2017-10-23T15:00:11Z

First of all thank you for releasing the codes.
I would like to know how difficult will be to do the training on a speakers data on a new language such as Turkish. As far as I sow during the generation step there is need for some kind of pronunciation dictionary. But what about pre-processing steps, Merlin and other tools, are they language agnostic. Thank you in advance

adampolyak · 2017-10-29T10:00:17Z

Yes, it is possible.

Preparing new data, requires 2 steps:

Extract phonemes - it possible to do so using https://github.com/bootphon/phonemizer. The documentation suggest that Turkish is supported.
Extract acoustic features - these features are agnostic to the language. You can extract using this script - just update the relevant paths to the tools directory downloaded in this repo.

gaziway · 2017-10-31T20:56:05Z

I am trying to process your answer, for which I thank you.
Let me summarize what I understand, so you could, please, correct me if needed.
Preparing new data should be done by extract_feats.py which as input accepts folders of txt and wav files.
Hence the next natural questions is how should one combine the steps you proposed in your answer with the extract_feats.py
One alternative is:

The content of the original text files should be replaced by their phonemes codes produced by phonemizer tool.
for extracting acoustic feature try to combine codes from second point with extract_feats.py

adampolyak · 2017-11-12T13:33:07Z

You can try to run extract_feats.py as usual and then simply update the generated npz:

save_dict = dict(numpy.load(npz_path))
save_dict['text_features'] = np.array(# run phonemizer here)
np.savez_compressed(new_npz_path, **save_dict)

alex73 · 2019-01-09T14:03:09Z

I have own voice dataset like LJSpeech, with metadata.csv and wavs/*.wav files.
Also, I'm able to convert text into phonemes via phonemizer.
I started this script and it created: bap/*.bap, lf0/*.lf0, mgc/*.mgc files for each my wav file.

But what should be my next step for training ?

adampolyak closed this as completed Nov 27, 2017

adampolyak mentioned this issue Mar 22, 2018

From English into Chinese? #38

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New Language #23

New Language #23

gaziway commented Oct 23, 2017

adampolyak commented Oct 29, 2017

gaziway commented Oct 31, 2017

adampolyak commented Nov 12, 2017

alex73 commented Jan 9, 2019

New Language #23

New Language #23

Comments

gaziway commented Oct 23, 2017

adampolyak commented Oct 29, 2017

gaziway commented Oct 31, 2017

adampolyak commented Nov 12, 2017

alex73 commented Jan 9, 2019