Skip to content

Inference Llama 2 in one file of pure C

License

Notifications You must be signed in to change notification settings

poudels14/llama2_rs

 
 

Repository files navigation

llama2 in Rust!

This is a Rust port of https://github.com/karpathy/llama2.c.

Usage

# Clone the repo
git clone https://github.com/poudels14/llama2_rs

# Download the model
wget -P models https://huggingface.co/karpathy/tinyllamas/resolve/main/stories15M.bin

# Run the inference

cargo run --release -- -t tokenizer.bin models/stories15M.bin 0.9

# For help, run:
cargo run --release -- --help

License

MIT

About

Inference Llama 2 in one file of pure C

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages

  • Python 56.4%
  • Rust 22.6%
  • C 19.6%
  • Makefile 1.4%