Skip to content

New Torch 2.1 Version

Latest
Compare
Choose a tag to compare
@JonasGeiping JonasGeiping released this 13 Jun 16:46
· 27 commits to main since this release

This release is the new version for torch 2.1. The code is nicer to read, has fewer dependencies (no more flash attention installations), data can now be easily streamed, and training is faster.

The new checkpoints are about 2% better on GLUE with the same budget.