Skip to content

sergioahp/gpt2-experiments

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GPT-2 experiments

This repo is a fork of llm.c

Features

  • HellaSwag benchmarking and model sampling while using torch.compile
  • A shorter time for downloading fineweb10B thanks to the use of aria2
  • Visualization with Tensorboard
  • Checkpointing that allows to resume training

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors