feat: nanoGPT example #255

alexander-camuto · 2023-05-23T09:40:55Z

Here we add nanoGPT as an example to the repo. We leverage divide-by rather than multiply-by rescaling to accomodate large model depth and chained linear layers.

to run nanoGPT mock test:

cargo test --release --verbose tests::large_mock_::large_tests_1_expects

to run the full test:

cargo test --release --verbose tests::large_kzg_prove_and_verify_::large_tests_1_expects

thanks to @biancaganescu for helping make this happen :)

alexander-camuto and others added 7 commits May 22, 2023 15:24

feat: dividing rescaling

398f385

patch iff

0809644

self attention tests

4773686

Update rust.yml

512360a

patch rescaled tests

99858cd

cleanup

f7021fb

Update rust.yml

bc8fa05

alexander-camuto changed the title ~~refactor: divide-by rescaling~~ feat: nanoGPT example May 23, 2023

alexander-camuto added 4 commits May 23, 2023 14:24

full model

c77f3da

Update rust.yml

ef11787

Update rust.yml

b1f8e16

Update rust.yml

a8402bc

alexander-camuto marked this pull request as ready for review May 23, 2023 13:38

alexander-camuto added 2 commits May 23, 2023 15:00

reduce logrows

88bd364

cleanup self attention

4d4dfc3

alexander-camuto merged commit 66fa25b into main May 23, 2023
19 checks passed

alexander-camuto deleted the ac/switch-rescaling-strategy branch June 16, 2023 22:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: nanoGPT example #255

feat: nanoGPT example #255

alexander-camuto commented May 23, 2023 •

edited

feat: nanoGPT example #255

feat: nanoGPT example #255

Conversation

alexander-camuto commented May 23, 2023 • edited

alexander-camuto commented May 23, 2023 •

edited