-
Notifications
You must be signed in to change notification settings - Fork 77
Pull requests: huggingface/nanotron
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Move MoE Implementation into src/, add Load Balancing Losses
#192
opened Jun 6, 2024 by
haeggee
Loading…
Add utility to preview samples used for training. See https://github.com/huggingface/nanotron/issues/184.
#190
opened Jun 4, 2024 by
kylematoba
Loading…
Supporting datatrove tokenized documents with Nanosets
#189
opened May 31, 2024 by
TJ-Solergibert
Loading…
Adapt topology-agnostic optimizer shard loading to MoE (fixes #106)
#107
opened Mar 15, 2024 by
nopperl
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.