Implement Karpathy's llm.c in Safe Rust

Currently work in progress, some parts are very crappy.

This is an attempt to migrate Karpathy's llm.c to safe rust. By saying safe, I mean no unsafe is introduced in the code of this repository (without considering the dependencies).

For now, the performance of each training step of this rust version (opt-level=3) is similar to llm.c (in -O3). Note that more parallelism has been added to the rust version than C, so this is not actually a fair comparison.

Quick Start

Firstly, follow the CPU quick start steps in llm.c. Note the python scripts are under scripts/ and are the same as in llm.c.

pip install -r requirements.txt
python scripts/prepro_tinyshakespeare.py
python scripts/train_gpt2.py

Then just build and run with cargo:

cargo run --release

The rayon is used for parallelism, and most iterations have been parallelized. The number of threads is the default number in rayon.

To compare with the C version, change the optimization level in llm.c makefile to -O3 and make.

make train_gpt2
OMP_NUM_THREADS=12 ./train_gpt2

Haven't figured out how to compile with any equivalent of -Ofast in rust yet.

Benchmark

No serious benchmarking has been done yet but the performance of the rust version is similar to the C version for each training step on a MacBook M2 Max. Each step takes ~4.4s to finish.

There are more parallelism in this rust version than llm.c, so the performance should be (at least slightly) better.

Future Work

Some parallelization strategies are not optimal, and some parts are not parallelized yet. The performance can be further improved.

Additionally, maybe wgpu can be used to accelerate the process.

Acknowledgments

I would like to express my sincere gratitude to ChatGPT for the assistance and support in this project.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
scripts		scripts
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Implement Karpathy's llm.c in Safe Rust

Quick Start

Benchmark

Future Work

Acknowledgments

About

Releases

Packages

Languages

JuniMay/llm.rs

Folders and files

Latest commit

History

Repository files navigation

Implement Karpathy's llm.c in Safe Rust

Quick Start

Benchmark

Future Work

Acknowledgments

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages