Makemore — Character-Level Language Models

A series of character-level language models for name generation, implemented in PyTorch following Andrej Karpathy's "Building makemore" tutorial series.

Re-implemented from a working understanding of language modeling fundamentals (not copy-paste) to solidify my knowledge of how autoregressive models work from bigrams up to MLPs.

Bigram.ipynb — Bigram character-level model (counting + neural net approach)
MLP.ipynb — MLP with character embeddings, trained on name data
MLP3.ipynb — Extended MLP experiments (WIP)
names.txt — Training dataset (32K names)
makemore.py — Refactored model classes, training loop, and sampling utilities
test_makemore.py — Tests for the models and training pipeline

What the project covers

Bigram language model via explicit counting and normalization
Bigram model as a single-layer neural network with softmax
Character embeddings learned jointly with the model
MLP architecture: embedding → hidden layer (tanh) → output softmax
Train/validation/test split for proper evaluation
Mini-batch SGD with learning rate tuning
Name generation via autoregressive sampling from the trained model
Refactored MakemoreModel class with configurable architecture
Training loop with loss tracking, learning rate scheduling, and early stopping
Top-k and temperature-based sampling for controllable generation

What I learned

How bigram models bridge counting and gradient-based approaches
Why embeddings let the model share statistical strength across similar characters
The role of the hidden layer in capturing character interactions beyond bigrams
How mini-batching stabilizes gradient estimates and speeds up training
Why train/val/test splits matter even for generative models
How temperature and top-k sampling trade diversity vs. quality in generation

Next steps

Add BatchNorm layer for more stable training
Implement Wavenet-style residual connections
Add a learning rate finder utility

Credits

Based on Andrej Karpathy's makemore tutorial and his original implementation. All educational credit to him.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Makemore — Character-Level Language Models

Contents

What the project covers

What I learned

Next steps

Credits

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Bigram.ipynb		Bigram.ipynb
MLP.ipynb		MLP.ipynb
README.md		README.md
makemore.py		makemore.py
names.txt		names.txt
test_makemore.py		test_makemore.py

Folders and files

Latest commit

History

Repository files navigation

Makemore — Character-Level Language Models

Contents

What the project covers

What I learned

Next steps

Credits

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages