Does fine-tuned model use the same vocabulary as pre-trained model? #5

zhangguanqun · 2020-12-22T11:17:43Z

The pre-trained model use learned joint vocabulary with 64867 tokens, merged by 32k operations. That is the file vocab.bpe.32000.
Does fine-tuned model (e.g. en2de) released in icloud use the same vocabulary file as pre-trained model?

linzehui · 2020-12-22T15:25:40Z

yes

zhangguanqun · 2020-12-23T04:38:43Z

appreciate

zhangguanqun · 2020-12-23T05:06:48Z

If I want to fine-tune this model to support new languages, new tokens should be added to existing file?
That is, new file has larger vocabulary size than 64867?
If do so, the embedding params of checkpoint (e.g. pretrain_checkpoint_last_RAS.pt) released in this project should be expanded before fine-tuning.

PANXiao1994 · 2020-12-23T08:28:50Z

If I want to fine-tune this model to support new languages, new tokens should be added to existing file?
That is, new file has larger vocabulary size than 64867?
If do so, the embedding params of checkpoint (e.g. pretrain_checkpoint_last_RAS.pt) released in this project should be expanded before fine-tuning.

Yes, you are right, if you want to expand the supported languages, you need to merge the newly added tokens into the existing vocabulary, then randomly initialize the embedding vectors of the new tokens.

zhangguanqun · 2020-12-24T01:41:21Z

appreciate * 2

zhangguanqun closed this as completed Dec 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does fine-tuned model use the same vocabulary as pre-trained model? #5

Does fine-tuned model use the same vocabulary as pre-trained model? #5

zhangguanqun commented Dec 22, 2020

linzehui commented Dec 22, 2020

zhangguanqun commented Dec 23, 2020

zhangguanqun commented Dec 23, 2020

PANXiao1994 commented Dec 23, 2020

zhangguanqun commented Dec 24, 2020

Does fine-tuned model use the same vocabulary as pre-trained model? #5

Does fine-tuned model use the same vocabulary as pre-trained model? #5

Comments

zhangguanqun commented Dec 22, 2020

linzehui commented Dec 22, 2020

zhangguanqun commented Dec 23, 2020

zhangguanqun commented Dec 23, 2020

PANXiao1994 commented Dec 23, 2020

zhangguanqun commented Dec 24, 2020