Skip to content

jbarrow/bert_from_scratch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

36 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

bert_from_scratch

This directory contains all the scripts and code you need to train BERT from scratch on a TPU on Google Cloud.

Installation

TODO: Dockerize.

Running

bash ./scripts/prepare_training_data.sh [BUCKET NAME] [TRAIN FILE NAME]
bash ./scripts/train_from_scratch.sh [BUCKET NAME] [TPU NAME]
bash ./scripts/convert.sh [BUCKET NAME]

About

Training BERT from scratch on a Cloud TPU

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published