llama.np

A pure NumPy implementation of the LLaMA model for inference and educational purposes. Supports LLaMA 1, 2, and 3 architectures (LLaMA 4 is not supported).

This repository demonstrates how to run LLaMA inference using only NumPy, making it ideal for learning and understanding transformer internals without heavy dependencies.

Usage

python llama.py "I have a dream"

The example uses a small model trained by Andrej Karpathy for demonstration.

Acknowledgments

Inspired by llama3.np and Hugging Face Transformers. Licensed under their respective terms.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
config.py		config.py
llama.py		llama.py
pyproject.toml		pyproject.toml
stories15M.npz		stories15M.npz
tokenizer.np		tokenizer.np
tokenizer.py		tokenizer.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

llama.np

Usage

Acknowledgments

License

About

Uh oh!

Releases

Uh oh!

Languages

License

gitctrlx/llama.np

Folders and files

Latest commit

History

Repository files navigation

llama.np

Usage

Acknowledgments

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Uh oh!

Languages