Skip to content

v0.0.1: Training on AWS Trainium

Compare
Choose a tag to compare
@michaelbenayoun michaelbenayoun released this 13 Mar 14:06
· 552 commits to main since this release

The following architectures can be trained on AWS Trainium instances (trn1.2xlarge and trn1.32xlarge) :

  • ALBERT
  • BERT
  • DistilBERT
  • RoBERTa
  • XLM-RoBERTa
  • CamemBERT
  • Electra
  • GPT-2
  • GPT-Neo
  • MarianMT
  • T5
  • BART
  • ViT

Training examples for many tasks are provided here.