GitHub - nardinan/sloth: A LLM written while learning how to write a LLM from scratch. In C.

sloth is a small educational LLM written entirely from scratch in pure C, with no non-unix dependencies.

The goal of the project is not raw speed, scale, or production readiness. The goal is to understand how a transformer-based language model actually works, end to end, by implementing the core mechanisms manually and documenting them heavily in the source code (not necessarily in the most efficient way).

If you want to follow a minimal but fairly complete implementation of a GPT-style model (including embeddings, positional encoding,self-attention, feedforward layers, backpropagation) this project is for you.

The code is intentionally verbose and heavily commented so you can follow the logic function by function.

Philosophy

sloth is meant to be:

educational
readable
hackable
dependency-light
Unix-friendly

This is not a highly optimized training system, and it is not trying to compete with existing frameworks. It is a project for learning, experimentation, and understanding the internals of LLMs by building them yourself.

That said, if you want to squeeze as much speed as possible out of this implementation, compile it with:

gcc -O3 -march=native -lm llm.c -o llm

This README.md has been written by chatGPT. Apologies.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
LICENSE		LICENSE
README.md		README.md
llm.c		llm.c
sloth.png		sloth.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Philosophy

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Philosophy

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages