generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 2k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: Initial implementation of RePO trainer and components
#3655
opened Jun 26, 2025 by
celsowm
Loading…
5 tasks
Ensure Chat Template Safe Prompt Truncation
#3646
opened Jun 25, 2025 by
pramodith
Loading…
4 of 5 tasks
🔍 Add guidance on choosing
max_length
value and include visualizati…
#3630
opened Jun 22, 2025 by
qgallouedec
Loading…
5 tasks
ClearML logging of visualization in RewardTrainer evaluation
#3602
opened Jun 16, 2025 by
ioverho
Loading…
2 of 5 tasks
🎀 New defaults:
gradient_checkpointing=True
#3510
opened May 29, 2025 by
qgallouedec
Loading…
5 tasks
Add Bidirectional Knowledge Distillation Option to GKDTrainer
#3508
opened May 29, 2025 by
shaischaudhry
Loading…
3 of 5 tasks
[GRPO] Pad per minibatch instead of per generation batch
#3495
opened May 26, 2025 by
edbeeching
•
Draft
3 tasks
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.