generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 1.7k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Enable direct loading of LoRA adapters in vLLM to streamline GRPO/Online Dpo training
#3133
opened Mar 22, 2025 by
maoulee
Loading…
💎
Gemma 3
VLM SFT example script for single-image and multi-image
#3131
opened Mar 21, 2025 by
sergiopaniego
Loading…
5 tasks
improvement(utils.py): simplify repeating completion string
#3122
opened Mar 20, 2025 by
tpoisonooo
Loading…
feat: Add Interleaved Trainer implementation
#3107
opened Mar 18, 2025 by
ucalyptus2
Loading…
3 tasks done
Co-Locating vLLM Instances with Training Processes Via External Launcher
#3105
opened Mar 18, 2025 by
toslali-ibm
Loading…
2 of 5 tasks
Update sft trainer to include better packing
#3100
opened Mar 17, 2025 by
Ishan-Kumar2
Loading…
4 tasks done
Speeding up dataset packing code to use itertools.chain() instead of sum()
#3095
opened Mar 16, 2025 by
max-kaufmann
Loading…
[GRPO] add vlm training capabilities to the trainer
#3072
opened Mar 13, 2025 by
CompN3rd
Loading…
3 of 5 tasks
Fix: Multi gpu hang for ORPO and CPO Trainer
#3069
opened Mar 13, 2025 by
NanoCode012
Loading…
1 of 5 tasks
Fixing GRPO
reward_func
being a model with DeepSpeed ZeRO-3
#2984
opened Feb 28, 2025 by
jamesbraza
Loading…
Feature: Add SGLang as inference backend for generation in GRPO
#2981
opened Feb 28, 2025 by
jhinpan
Loading…
5 tasks done
Provide more accurate error messages to make the program more robust.
😴 stale
No update from the author, will be closed soon
#2932
opened Feb 22, 2025 by
dignfei
Loading…
4 tasks
Add the metrics completion_length_max and completion_length_min
#2930
opened Feb 22, 2025 by
dignfei
Loading…
4 tasks
Remove CUDA synchronization in mean_token_accuracy
😴 stale
No update from the author, will be closed soon
#2902
opened Feb 19, 2025 by
cyyever
Loading…
1 task done
Previous Next
ProTip!
Updated in the last three days: updated:>2025-03-18.