How to train new Waveglow model for diffirent language? #189

EuphoriaCelestial · 2020-04-09T15:40:07Z

As the title, I would like to know how to train a new model using another dataset, which have the same structure as LJ Speech dataset. What modifications need to be done for a diffirent language?

lqniunjunlper · 2020-05-07T03:28:00Z

Just prepare your language specific wav files for training

Adizbek · 2020-11-11T09:37:00Z

@EuphoriaCelestial have you succeed to train your own model for your own language ?

EuphoriaCelestial · 2020-11-12T06:13:13Z

@EuphoriaCelestial have you succeed to train your own model for your own language ?

Yes I have successfully trained my model, but there are some errors in output audio, I am still working on it.

Adizbek · 2020-11-12T06:16:44Z

@EuphoriaCelestial thanks for response, can you tell how you achieved this ? Can you show any wiki or documentation to follow ?

EuphoriaCelestial · 2020-11-13T09:19:46Z

@EuphoriaCelestial thanks for response, can you tell how you achieved this ? Can you show any wiki or documentation to follow ?

actually, I just follow those steps in README file, its pretty simple if your audio matched the default sample rate, bit rate, ...
I have trained tacotron2 model before so my dataset is already pre-processed
you can try training your own model with LJ Speech dataset first to understand the workflow, then try with your own dataset

Adizbek · 2020-11-15T11:48:53Z

@EuphoriaCelestial thanks for response, can you tell how you achieved this ? Can you show any wiki or documentation to follow ?

actually, I just follow those steps in README file, its pretty simple if your audio matched the default sample rate, bit rate, ...
I have trained tacotron2 model before so my dataset is already pre-processed
you can try training your own model with LJ Speech dataset first to understand the workflow, then try with your own dataset

Finally I installed project successfully with all it's dependencies and I've synthesized a voice with pre trained model.

But I found out that training a model is pretty time consuming. I have core I7 9th gen, 16gb ram, 6 gb gpu, I understood that it's very poor hardware to train, what do you recommend to train a model? Any free cloud solutions?

Can tell how long it takes to train model for instance for you and your hardware?

Thanks

EuphoriaCelestial · 2020-11-16T02:10:11Z

@EuphoriaCelestial thanks for response, can you tell how you achieved this ? Can you show any wiki or documentation to follow ?

actually, I just follow those steps in README file, its pretty simple if your audio matched the default sample rate, bit rate, ...
I have trained tacotron2 model before so my dataset is already pre-processed
you can try training your own model with LJ Speech dataset first to understand the workflow, then try with your own dataset

Finally I installed project successfully with all it's dependencies and I've synthesized a voice with pre trained model.

But I found out that training a model is pretty time consuming. I have core I7 9th gen, 16gb ram, 6 gb gpu, I understood that it's very poor hardware to train, what do you recommend to train a model? Any free cloud solutions?

Can tell how long it takes to train model for instance for you and your hardware?

Thanks

yeah it will take a long time. I have RTX 2080ti with 11gb VRAM and it take few days. I have not tried any cloud solutions yet so I cant give any advice.

Ctibor67 · 2020-11-30T15:14:09Z

Unable to run train.py :
File "train.py", line 188, in
train(num_gpus, args.rank, args.group_name, **train_config)
File "train.py", line 90, in train
optimizer)
File "train.py", line 45, in load_checkpoint
optimizer.load_state_dict(checkpoint_dict['optimizer'])
File "C:\Python37\lib\site-packages\torch\optim\optimizer.py", line 124, in load_state_dict
raise ValueError("loaded state dict contains a parameter group "
ValueError: loaded state dict contains a parameter group that doesn't match the size of optimizer's group

Can you help me, please?

CookiePPP · 2020-11-30T15:24:51Z

just comment out the line, you can add it back the next time you generate a checkpoint.

Adizbek · 2020-11-30T15:43:20Z

@EuphoriaCelestial, I've installed successfully, how many hours (minimum) of audio need to generate correct spelled audio ?

EuphoriaCelestial · 2020-12-01T09:38:33Z

@EuphoriaCelestial, I've installed successfully, how many hours (minimum) of audio need to generate correct spelled audio ?

I dont know the minimum, I havent tested because it will take a lot of time and effort
I have 19 hours of train+test audio in total

Ctibor67 · 2020-12-02T15:22:48Z

And one more question:
I changed
"checkpoint_path": "checkpoints/waveglow_256channels_universal_v5.pt"
to
"checkpoint_path": "checkpoints/waveglow_80000.pt"

and starting
python train.py -c config.json

but iteration starts again from 1. How to start iteration from 80000?

EuphoriaCelestial · 2020-12-04T02:33:37Z

And one more question:
I changed
"checkpoint_path": "checkpoints/waveglow_256channels_universal_v5.pt"
to
"checkpoint_path": "checkpoints/waveglow_80000.pt"

and starting
python train.py -c config.json

but iteration starts again from 1. How to start iteration from 80000?

why does your checkpoint saved with .pt?
maybe you have exported model from the checkpoint, so it will start from iter 0, its normal
my checkpoints during training have no extension

Ctibor67 · 2020-12-04T23:40:30Z

I thought it should have the suffix pt, so I renamed it. How else do I create a pt file from a checkpoint?
How do I proceed then? I tried to copy my waveglow_150000 to the Tacotron2 folder, ran inference.ipynb, exchanged waveglow_256channels_universal_v5.pt for my waveglow_150000, but instead of speek there is only a crackle.

EuphoriaCelestial · 2020-12-05T01:49:48Z

I thought it should have the suffix pt, so I renamed it. How else do I create a pt file from a checkpoint?

The inference code can load the checkpoint directly so I dont have to export pt file from checkpoint

How do I proceed then? I tried to copy my waveglow_150000 to the Tacotron2 folder, ran inference.ipynb, exchanged waveglow_256channels_universal_v5.pt for my waveglow_150000, but instead of speek there is only a crackle.

it will work if you rename it to waveglow_150000.pt, weird

Ctibor67 · 2020-12-05T08:46:33Z

When I train waveglow, this message appears to me with each new epoch:
Epoch: 7
C:\myprojects\tacotron2\waveglow\mel2samp.py:57: UserWarning: The given NumPy array is not writeable, and PyTorch does not support non-writeable tensors. This means you can write to the underlying (supposedly non-writeable) NumPy array using the tensor. You may want to copy the array to protect its data or make it writeable before converting it to a tensor. This type of warning will be suppressed for the rest of this program. (Triggered internally at ..\torch\csrc\utils\tensor_numpy.cpp:141.)
return torch.from_numpy(data).float(), sampling_rate
Is it OK or is this error causing waveglow not to work?

EuphoriaCelestial · 2020-12-08T02:28:21Z

I've never encounter this problem before, maybe you should create a new issue so the creators can help.

EuphoriaCelestial closed this as completed Jun 5, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to train new Waveglow model for diffirent language? #189

How to train new Waveglow model for diffirent language? #189

EuphoriaCelestial commented Apr 9, 2020

lqniunjunlper commented May 7, 2020

Adizbek commented Nov 11, 2020

EuphoriaCelestial commented Nov 12, 2020

Adizbek commented Nov 12, 2020

EuphoriaCelestial commented Nov 13, 2020

Adizbek commented Nov 15, 2020

EuphoriaCelestial commented Nov 16, 2020

Ctibor67 commented Nov 30, 2020

CookiePPP commented Nov 30, 2020 •

edited

Adizbek commented Nov 30, 2020

EuphoriaCelestial commented Dec 1, 2020

Ctibor67 commented Dec 2, 2020

EuphoriaCelestial commented Dec 4, 2020

Ctibor67 commented Dec 4, 2020

EuphoriaCelestial commented Dec 5, 2020

Ctibor67 commented Dec 5, 2020

EuphoriaCelestial commented Dec 8, 2020

How to train new Waveglow model for diffirent language? #189

How to train new Waveglow model for diffirent language? #189

Comments

EuphoriaCelestial commented Apr 9, 2020

lqniunjunlper commented May 7, 2020

Adizbek commented Nov 11, 2020

EuphoriaCelestial commented Nov 12, 2020

Adizbek commented Nov 12, 2020

EuphoriaCelestial commented Nov 13, 2020

Adizbek commented Nov 15, 2020

EuphoriaCelestial commented Nov 16, 2020

Ctibor67 commented Nov 30, 2020

CookiePPP commented Nov 30, 2020 • edited

Adizbek commented Nov 30, 2020

EuphoriaCelestial commented Dec 1, 2020

Ctibor67 commented Dec 2, 2020

EuphoriaCelestial commented Dec 4, 2020

Ctibor67 commented Dec 4, 2020

EuphoriaCelestial commented Dec 5, 2020

Ctibor67 commented Dec 5, 2020

EuphoriaCelestial commented Dec 8, 2020

CookiePPP commented Nov 30, 2020 •

edited