Training from scratch in another language #11

peregilk · 2019-11-19T18:34:30Z

I want to train AlBert from scratch in a non-English language. I have access to a corpus of 1-2 B words. Would that be sufficient?

Would training on one single Cloud TPU v3 with 128Gb RAM be feasible? Can you give an estimated training time for base, large and xlarge?

kamalkraj · 2019-11-21T08:19:43Z

@peregilk
I haven't run pretraining, so I don't know how much time it will take.
This codebase has only supported for GPU and CPU. You use the original author implementation for TPU based training. After training, you convert the weights to tf2.0 using for further fine-tuning tasks

kamalkraj closed this as completed Dec 6, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training from scratch in another language #11

Training from scratch in another language #11

peregilk commented Nov 19, 2019

kamalkraj commented Nov 21, 2019

Training from scratch in another language #11

Training from scratch in another language #11

Comments

peregilk commented Nov 19, 2019

kamalkraj commented Nov 21, 2019