From 4b3ff759e40ec91db5e5a0ffda396cb45ae8a51d Mon Sep 17 00:00:00 2001 From: Adithya Ganesh Date: Sun, 17 Sep 2023 22:10:23 -0700 Subject: [PATCH] impl --- docs/index.html | 10 ++ docs/index.xml | 12 +- .../2019-07-13-polya-burnside/index.html | 2 + docs/posts/index.html | 18 +++ docs/posts/index.xml | 12 +- docs/posts/transformers/index.html | 152 ++++++++++++------ docs/sitemap.xml | 9 +- site/public/index.html | 10 ++ site/public/index.xml | 12 +- .../2019-07-13-polya-burnside/index.html | 2 + site/public/posts/index.html | 18 +++ site/public/posts/index.xml | 12 +- site/public/posts/transformers/index.html | 152 ++++++++++++------ site/public/sitemap.xml | 9 +- 14 files changed, 318 insertions(+), 112 deletions(-) diff --git a/docs/index.html b/docs/index.html index 4fbf703..94c1f2e 100644 --- a/docs/index.html +++ b/docs/index.html @@ -73,6 +73,16 @@

Adi Ganesh

+

+ + GPT in words and code + +

+ + +

Notes on transformers / LLMs from the ground up.

+ +

Pólya-Burnside enumeration in combinatorics diff --git a/docs/index.xml b/docs/index.xml index cf5fc39..79297ba 100644 --- a/docs/index.xml +++ b/docs/index.xml @@ -6,7 +6,17 @@ Recent content on Adi Ganesh Hugo -- gohugo.io en-us - Sat, 13 Jul 2019 00:00:00 +0000 + Sun, 20 Aug 2023 00:00:00 +0000 + + GPT in words and code + https://acganesh.github.io/posts/transformers/ + Sun, 20 Aug 2023 00:00:00 +0000 + + https://acganesh.github.io/posts/transformers/ + I find that the best way to understand how machine learning papers work is to write the code for a model forward pass. If you can load the weights from a pre-trained model and get the same outputs from a single model inference, you can be pretty confident that you’ve re-implemented all of the details from a model. The advantages of doing this are: +Does not require any training, which can be time-consuming and expensive. + + Pólya-Burnside enumeration in combinatorics https://acganesh.github.io/posts/2019-07-13-polya-burnside/ diff --git a/docs/posts/2019-07-13-polya-burnside/index.html b/docs/posts/2019-07-13-polya-burnside/index.html index 8bc010e..0da4f1b 100644 --- a/docs/posts/2019-07-13-polya-burnside/index.html +++ b/docs/posts/2019-07-13-polya-burnside/index.html @@ -324,6 +324,8 @@