How to use pretrain.model for continuing training? #8

youyou098888 · 2021-11-10T06:50:32Z

I want to add some chinese audios to the training data.

Can I use your pretrain.model and continue to train using my data,

Or Do I have to download all the VoxCeleb1data plusing my data, and train it from the beginning?

Thank you for your reply.

TaoRuijie · 2021-11-10T06:55:33Z

I think both are ok.

You can use my pretrain model to train on the Chinese audios, it will be faster than training from the random init. I guess you just want to finetune, which means the number of Chinese utterances is much smaller than VoxCeleb2. So you need to start with a small learning rate.

You can also download VoxCeleb2 and train it together. However, if your number of Chinese data is much smaller than VoxCeleb2, I do not suggest you to do that.

youyou098888 · 2021-11-10T07:42:02Z

Chinese audios are almost the same size of voxceb2.
In that case, Can I use the pretrain model ?

TaoRuijie · 2021-11-10T08:04:06Z

I think you can use the pretrain model, and reduce the initial learning rate. Then only train on your data.

You can do experiments to compare. Here is my understanding:

Train from scratch on your chinese data only
Train from the pretrain model, and train on your chinese data only.
Train from scratch and train on your data and Vox2 together.

I guess 2 and 3 might perform similar results, which might be better than 1.
2 can train much faster than 1. 3 needs a quite long time.

That is my understanding, you can do experiments to verify that.

youyou098888 · 2021-11-11T02:42:34Z

Thank you so much , it is very clear.
Another question, Are MUSAN and RIR dataset required in all of the three experiments?

TaoRuijie · 2021-11-11T02:46:00Z

That is used for augmentation,

you can add it or not in all experiments, it can make the result better;

you can also remove it. It can make training faster.

youyou098888 · 2021-11-11T03:02:23Z

Thank you so much , it is very clear.
Another question, Are MUSAN and RIR dataset required in all of the three experiments?

youyou098888 · 2021-11-11T06:44:15Z

Get it~

wwyl2000 · 2024-03-21T20:57:47Z

Thanks for the information about continuous training.
I have a question about the continuous training:
After we trained a general model (from X speakers), we want to adapt or finetune the model for the N speakers. If we have dataset for the N speakers (X>>>N, and not none of the N speakers is included in X), and we want to use these N speakers to finetune the model, how to train? The models trained has X classes. When training with the new N speakers, how to set the number of classes, and what is the relationship between the X speakers and the new N speakers? During finetuning, what is the number of classes, N (overwrite the original X speakers), or X+N (append N new speakers to the original X speakers)?

For a given N, say N=10, what value of X should be suitable for acceptable performance? Will size of X influence model size much?

Thanks!

youyou098888 closed this as completed Nov 11, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use pretrain.model for continuing training? #8

How to use pretrain.model for continuing training? #8

youyou098888 commented Nov 10, 2021

TaoRuijie commented Nov 10, 2021 •

edited

youyou098888 commented Nov 10, 2021

TaoRuijie commented Nov 10, 2021

youyou098888 commented Nov 11, 2021

TaoRuijie commented Nov 11, 2021

youyou098888 commented Nov 11, 2021

youyou098888 commented Nov 11, 2021

wwyl2000 commented Mar 21, 2024

How to use pretrain.model for continuing training? #8

How to use pretrain.model for continuing training? #8

Comments

youyou098888 commented Nov 10, 2021

TaoRuijie commented Nov 10, 2021 • edited

youyou098888 commented Nov 10, 2021

TaoRuijie commented Nov 10, 2021

youyou098888 commented Nov 11, 2021

TaoRuijie commented Nov 11, 2021

youyou098888 commented Nov 11, 2021

youyou098888 commented Nov 11, 2021

wwyl2000 commented Mar 21, 2024

TaoRuijie commented Nov 10, 2021 •

edited