Tiny Diffusion Stories

This project is part of the Google #TPUSprint program and showcases a JAX/Flax/NNX implementation of Masked Diffusion Language Models. This codebase borrows structure and snippets from the PyTorch implementation tiny-diffusion.

Why diffusion language models? Instead of generating predictions autoregressively, the model learns to recover masked tokens by denoising masked blocks in parallel.

Setup

uv sync

Train

uv run main.py --train

Trained model will be saved to:

weights/diffusion_checkpoint.pkl

Generate

uv run main.py --prompt "Once upon a time"

Uses checkpoint from last step if present.

Google Colab TPU Training

Open the Colab notebook.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
data.txt		data.txt
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tiny Diffusion Stories

Setup

Train

Generate

Google Colab TPU Training

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Tiny Diffusion Stories

Setup

Train

Generate

Google Colab TPU Training

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages