Quiet STAR

An implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf) including a full test suite to guarantee correctness.

This implementation is optimized so that it performs a minimal amount of computation when generating thoughts. The tricks used to reduce the computation were alluded to in the paper. However, it still needs to be configured to use Flash Attention, so a batch of size 1 takes a little over 1 second on a 4090. (Recall that one batch requires, among other things, generating multiple thoughts of length N at all locations in the input sequence, so all things considered, it is still fairly fast.)

See TODO.md for planned improvements.

Usage

To set up the environment:

poetry config keyring.enabled false
poetry install
source .venv/bin/activate

Then to fine tune a Qwen 0.5B model (default config requires an nvidia GPU like a 3090 or 4090 with 24GB of RAM):

python run_train_qwen.py

By default, the dataset used is just a small part of the dataset used in the paper.

There is code in this repository to train a model using MLX on Apple silicon, but it is not currently set up to use a pretrained HuggingFace model.

Development

To begin:

pre-commit install

Before committing:

pytest

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
quiet_star		quiet_star
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
TODO.md		TODO.md
debug_bfloat.ipynb		debug_bfloat.ipynb
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
run_train_gpt.py		run_train_gpt.py
run_train_mlx.py		run_train_mlx.py
run_train_qwen.py		run_train_qwen.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

quiet_star

quiet_star

tests

tests

.gitignore

.gitignore

.pre-commit-config.yaml

.pre-commit-config.yaml

LICENSE

LICENSE

README.md

README.md

TODO.md

TODO.md

debug_bfloat.ipynb

debug_bfloat.ipynb

poetry.lock

poetry.lock

pyproject.toml

pyproject.toml

run_train_gpt.py

run_train_gpt.py

run_train_mlx.py

run_train_mlx.py

run_train_qwen.py

run_train_qwen.py

Repository files navigation

Quiet STAR

Usage

Development

About

Releases

Packages

Languages

License

expz/quiet-star

Folders and files

Latest commit

History

Repository files navigation

Quiet STAR

Usage

Development

About

Resources

License

Stars

Watchers

Forks

Languages