This directory contains all the scripts and code you need to train BERT from scratch on a TPU on Google Cloud.
TODO: Dockerize.
bash ./scripts/prepare_training_data.sh [BUCKET NAME] [TRAIN FILE NAME]
bash ./scripts/train_from_scratch.sh [BUCKET NAME] [TPU NAME]
bash ./scripts/convert.sh [BUCKET NAME]