zig_gpt2

GPT-2 inference engine written in Zig. Generation time: ~28ms per token.

Features:

No third-party dependencies besides BLAS (Accelerate or OpenBLAS).
No memory allocations at runtime.
Can run NanoGPT.

How to Run:

Download the GPT-2 checkpoint from OpenAI.

python3 download_weights.py

Build the Zig binary and run it with a prompt to generate completions:

zig build -DOptimize=ReleaseFast
./zig-out/bin/zig_gpt2 "Marcus Aurelius said"

How to Test:

Generate test data by forwarding random tensors through PyTorch ops.

python3 generate_test_data.py

Run tests. Verifies Zig ops produce the same output as PyTorch.

zig build test

TODO

Implementation:

✅ Implement basic ops: Embedding, Linear, LayerNorm, GELU, Softmax, CausalSelfAttention.
✅ Implement transformer modules: MLP, Transformer block.
✅ Implement the full GPT model.
✅ Implement sampling from the model.
✅ Implement BPE encoding/decoding.

Efficiency:

✅ Replace custom linear algebra kernels with BLAS.
✅ Stream output as each new token is generated.
✅ Create central set of memory buffers and reuse them for each layer. No allocations at runtime.
✅ Add KV cache.
Parallelize softmax and gelu operations.

Name		Name	Last commit message	Last commit date
Latest commit History 90 Commits
src		src
.gitignore		.gitignore
README.md		README.md
build.zig		build.zig
download_weights.py		download_weights.py
generate_nano_gpt.py		generate_nano_gpt.py
generate_test_data.py		generate_test_data.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

src

src

.gitignore

.gitignore

README.md

README.md

build.zig

build.zig

download_weights.py

download_weights.py

generate_nano_gpt.py

generate_nano_gpt.py

generate_test_data.py

generate_test_data.py

Repository files navigation

zig_gpt2

Features:

How to Run:

How to Test:

TODO

About

Languages

EugenHotaj/zig_gpt2

Folders and files

Latest commit

History

Repository files navigation

zig_gpt2

Features:

How to Run:

How to Test:

TODO

About

Topics

Resources

Stars

Watchers

Forks

Languages