🧠 LLMs From Scratch — My Personal Deep Dive

This repository is my learning lab for building, training, and understanding large language models (LLMs) from the ground up.
Everything here is coded manually — no shortcuts, no black boxes — to deeply understand how transformers, tokenizers, optimizers, and distributed training truly work.

📖 Overview

I’m implementing and experimenting with different LLM architectures, attention mechanisms, and modern model designs that keep appearing in research and open-source projects. The goal is simple — to understand how these models actually work by building them myself, one at a time.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
GPT-2		GPT-2
datasets/shakespeare		datasets/shakespeare
moe-with-null-experts		moe-with-null-experts
tokenizers		tokenizers
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 LLMs From Scratch — My Personal Deep Dive

📖 Overview

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🧠 LLMs From Scratch — My Personal Deep Dive

📖 Overview

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages