Add GPT-NeoX/Pythia support + benchmark results #4

tomaarsen · 2023-10-03T14:57:48Z

Hello!

Pull Request overview

Add support for all GPT-NeoX/Pythia models.
Add benchmark results for Pythia
Add benchmarking script for Pythia

Details

As simple as

from attention_sinks import AutoModel

model = AutoModel.from_pretrained("EleutherAI/pythia-6.9b", device_map="auto")

Benchmarks

python benchmark/perplexity.py --model_name_or_path EleutherAI/pythia-6.9b-deduped --experiment attention_sinks --output_dir benchmark/outputs_pythia_6.9b
python benchmark/perplexity.py --model_name_or_path EleutherAI/pythia-6.9b-deduped --experiment transformers --output_dir benchmark/outputs_pythia_6.9b
python benchmark/perplexity.py --model_name_or_path EleutherAI/pythia-6.9b-deduped --experiment windowed --output_dir benchmark/outputs_pythia_6.9b

python benchmark/plot_perplexity.py --features perplexity vram --title "Log perplexity & VRAM usage of Pythia 6.9B as a function of input lengths" --output_dir benchmark/outputs_pythia_6.9b --log_perplexity_limit 4

Tom Aarsen

tomaarsen added 2 commits October 3, 2023 16:56

Add GPT-NeoX/Pythia support + benchmark results

9b9a641

Fix table in README

c081faf

tomaarsen merged commit 4be3831 into main Oct 3, 2023

tomaarsen deleted the model/pythia branch October 3, 2023 14:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add GPT-NeoX/Pythia support + benchmark results #4

Add GPT-NeoX/Pythia support + benchmark results #4

tomaarsen commented Oct 3, 2023

Add GPT-NeoX/Pythia support + benchmark results #4

Add GPT-NeoX/Pythia support + benchmark results #4

Conversation

tomaarsen commented Oct 3, 2023

Pull Request overview

Details

Benchmarks