Skip to content

V0.10.0 DeepSpeed integration revamp and TPU speedup

Compare
Choose a tag to compare
@sgugger sgugger released this 15 Jun 18:07
· 1106 commits to main since this release

This release adds two major new features: the DeepSpeed integration has been revamped to match the one in Transformers Trainer, with multiple new options unlocked, and the TPU integration has been sped up.

This version also officially stops supporting Python 3.6 and requires Python 3.7+

DeepSpeed integration revamp

Users can now specify a DeepSpeed config file when they want to use DeepSpeed, which unlocks many new options. More details in the new documentation.

TPU speedup

If you're using TPUs we have sped up the dataloaders and models quite a bit, on top of a few bug fixes.

  • Revamp TPU internals to be more efficient + enable mixed precision types by @muellerzr in #441

What's new?