Skip to content

v0.0.12

Compare
Choose a tag to compare
@jalammar jalammar released this 05 Jan 07:44
· 189 commits to main since this release
c3b1528
  • Larger GPT2 models can now work with long sequences in GPU without running out of memory.
  • Neuron activations: ability to specify capturing activations from certain layers

Thanks to contributor @nostalgebraist