llama.go

A pure Go implementation of the LLaMA model for inference and educational purposes. Supports LLaMA 1, 2, and 3 architectures.

This repository demonstrates how to run LLaMA inference using only Go's standard library, making it ideal for learning and understanding transformer internals without heavy dependencies.

Features

HF-aligned Architecture – Matches HuggingFace reference implementation with clean, structured codebase matching official model layouts
Concurrent MHA – Multi-head attention parallelized across goroutines for 2-4x speedup on multi-core systems
Int8 Quantization – Post-training quantization reduces model size by 4x while maintaining inference quality
Zero Dependencies – 100% pure Go standard library, no external packages or CGO
Educational – Line-by-line readable transformer implementation with inline documentation

Usage

go run cmd/llama/main.go ./cmd/llama/stories15M.bin

The examples use small models trained by Andrej Karpathy for demonstration.

Related Work

If you're interested in LLaMA implementations in other languages:

llama.np – NumPy-based implementation
llama.cu – CUDA-accelerated implementation

Acknowledgments

Inspired by llama2.c and go-llama2. Licensed under their respective terms.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.github		.github
cmd		cmd
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
go.mod		go.mod
go.sum		go.sum
llama.go		llama.go
llamaq8.go		llamaq8.go
tokenizer.bin		tokenizer.bin
tokenizer.go		tokenizer.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

llama.go

Features

Usage

Related Work

Acknowledgments

License

About

Uh oh!

Releases

Uh oh!

Contributors 2

Uh oh!

Languages

License

gitctrlx/llama.go

Folders and files

Latest commit

History

Repository files navigation

llama.go

Features

Usage

Related Work

Acknowledgments

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Uh oh!

Contributors 2

Uh oh!

Languages