Releases · tysam-code/hlb-gpt

22 Mar 00:09

tysam-code

0.4.1

a9f158f

v0.4.1 beta (<100s) Latest

Latest

This is a big one. New attention block, new architecture scales, one-parameter scaling to 1.5B, and so much more.

(twitter thread will be updated here when possible).

Assets 2

26 Mar 13:10

tysam-code

0.3.0

c8485a0

v0.3.0 beta (~136-140s)

Hiya there! In this release, we upgrade the MLP a bit to include the SiGLU activation function (over the default non-linearly-gated GELU function), convert the network over to pure bfloat16 (from a mixed precision dynamic), and perform various optimizations to bring our training time down another 18-22 seconds or so (woop woop!) For more info, check out the twitter thread detailing some of the tweaks for this patch (https://twitter.com/hi_tysam/status/1639975149951672321)! <3 :D :)))) <3 🎆 🎇 🎇 🎆

Assets 2

22 Mar 02:39

tysam-code

0.2.0

4535aa0

v0.2.0 beta

Hi there! In this release, we add sequence length scheduling and make a few other tweaks! For more info on the sequence length scheduling (and the relevant supporting changes), please see the release tweet at https://twitter.com/hi_tysam/status/1637691454012153856?cxt=HHwWgICzgevsn7otAAAA

Assets 2

20 Mar 00:34

tysam-code

0.1.0

0ce502e

beta v0.1.0

Greetings. In this release (originally from 3/12/23), we add a few features that cuts the training time nearly in half. This tag also includes a hotfix to restore backwards compatibility for people with torch versions less than 2.0.

For a more detailed summary of this release, please check out https://twitter.com/hi_tysam/status/1635123488674697218?cxt=HHwWhMDSpcqJkLEtAAAA

Assets 2

06 Mar 02:18

tysam-code

0.0.0

d7ea4d2

baseline 0.0.0

Hi hi hiya there! <3 :D Feel free to check out the README.md on this tag, it has the best summary of this release that I could probably give (also, so much typing and proofreading today, as always on release days I suppose, I am beat! :'D)

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Releases: tysam-code/hlb-gpt

v0.4.1 beta (<100s)

Uh oh!

v0.3.0 beta (~136-140s)

Uh oh!

v0.2.0 beta

Uh oh!

beta v0.1.0

Uh oh!

baseline 0.0.0

Uh oh!