Skip to content

Pull requests: karpathy/llm.c

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Remove redundant CPU computation in encoder bwd
#512 opened Jun 1, 2024 by gordicaleksa Loading…
adding wsd schedule with (1-sqrt) decay
#508 opened Jun 1, 2024 by eliebak Loading…
Add DockerFile
#501 opened May 30, 2024 by banyan-god Loading…
Realtime training visualization using wandb
#489 opened May 29, 2024 by chinthysl Loading…
MFU for other GPUs
#486 opened May 28, 2024 by ngc92 Loading…
Trigger CI template
#483 opened May 28, 2024 by rosslwheeler Loading…
train_gpt2.c: Add gpt2_write_to_checkpoint method
#467 opened May 26, 2024 by faxe1008 Loading…
.gitignore: ignore more for windows devs
#466 opened May 26, 2024 by nietras Loading…
test_gpt2.c: detailed timings
#465 opened May 26, 2024 by nietras Loading…
Fix MSVC compilation error with openMP
#460 opened May 25, 2024 by avflyer Loading…
Add cmake and fix OMP
#452 opened May 23, 2024 by avflyer Loading…
Use proper GeLU on CPU
#441 opened May 21, 2024 by jart Loading…
only save missing bits to reconstruct fp32 master weights
#432 opened May 19, 2024 by ngc92 Loading…
weight reordering: attempt 1
#427 opened May 17, 2024 by ngc92 Draft
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.