Skip to content

Pre-training language models with limited data

Notifications You must be signed in to change notification settings

misraya/baby_lm

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Pre-training language models with limited data

  • Dataset exploration
  • Tokenizer analysis
  • Baseline training
  • Tuning with task rewards

About

Pre-training language models with limited data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages