Skip to content

v0.4.1 beta (<100s)

Latest

Choose a tag to compare

@tysam-code tysam-code released this 22 Mar 00:09
a9f158f

This is a big one. New attention block, new architecture scales, one-parameter scaling to 1.5B, and so much more.

(twitter thread will be updated here when possible).