PeekGPT

From-scratch implementation of a GPT-style transformer allowing to peek inside during inference/training.

Runs entirely on CPU
No network/API calls nor ML frameworks
Pure Go, OpenBLAS can be optionally linked in for faster matrix products

Example

Training

$ go run . -mode train -model models/names -data ./data/names -text -v 200 \
    -dmodel 32 -ctx 8 -blocks 2 -attn 2 -mlp 2 \
    -iters 1000 -lr 0.01 -ub 64

This trains a character-level transformer to generate names:

Model:

32-dimensional embedding vectors
context size of 8 tokens
2 blocks
2 attention heads per-block
~19k parameters

Training:

location of training data data/names
validation set size 200
1000 iterations (Adam, learning rate 0.01)
batch size 64

Training above takes 2 seconds on my Zen5 CPU.

Text generation

$ go run . -mode prompt -model ./models/names -text -prompt 'adam' -n 50

Sample output:

adam
allaunex
bandero
briestyn
nelun
kad
feren
dondlyn

Inspecting the model

Peek into how the model processes a prompt:

$ go run . -mode peek -model ./models/names -prompt 'adam'

Inspect attention matrices:

$ go run . -mode peek -attention -model ./models/names -prompt 'briestyn'

Run unit tests

$ go test

Overview

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
data		data
models		models
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
backprop.go		backprop.go
blas.go		blas.go
go.mod		go.mod
go.sum		go.sum
main.go		main.go
main_test.go		main_test.go
model.go		model.go
numeric.go		numeric.go
output.go		output.go
training.go		training.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PeekGPT

Example

Training

Text generation

Inspecting the model

Peek into how the model processes a prompt:

Inspect attention matrices:

Run unit tests

Overview

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

PeekGPT

Example

Training

Text generation

Inspecting the model

Peek into how the model processes a prompt:

Inspect attention matrices:

Run unit tests

Overview

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 1

Languages

Packages