Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

add vllm support for token ids as input
#3280 opened Apr 11, 2025 by wybryan Loading…
Reward takes completion ids
#3272 opened Apr 9, 2025 by qgallouedec Draft
5 tasks
🦙 Llama 4
#3267 opened Apr 9, 2025 by qgallouedec Draft
5 tasks
[SFT] support for ring_attn in SFTTrainer
#3262 opened Apr 8, 2025 by kashif Loading…
5 tasks
Add a raw generate API to the vLLM server
#3227 opened Apr 3, 2025 by wilrop Loading…
5 tasks
Support iterable datasets in GRPO
#3226 opened Apr 3, 2025 by wilrop Loading…
5 tasks
update weight update process group
#3211 opened Apr 2, 2025 by ji-huazhong Draft
5 tasks
Adding sampling parameters for vllm generation
#3210 opened Apr 2, 2025 by shaipranesh2 Loading…
GRPO: Scalable training with one LLM/node
#3186 opened Mar 31, 2025 by jglaser Loading…
3 of 5 tasks
Extend BCO Trainer dataset format support
#3134 opened Mar 22, 2025 by reihig-ut Loading…
1 of 5 tasks
feat: Add Interleaved Trainer implementation
#3107 opened Mar 18, 2025 by ucalyptus2 Loading…
3 tasks done
Update sft trainer to include better packing
#3100 opened Mar 17, 2025 by Ishan-Kumar2 Loading…
4 tasks done
[GRPO] add vlm training capabilities to the trainer
#3072 opened Mar 13, 2025 by CompN3rd Loading…
3 of 5 tasks
[WIP] PEFT 🤝 Liger DPO
#3065 opened Mar 12, 2025 by SalmanMohammadi Draft
5 tasks
[WIP] Iterative training scripts for SPIN and SPPO
#3011 opened Mar 5, 2025 by jkx19 Draft
3 of 5 tasks
Feature: Add SGLang as inference backend for generation in GRPO
#2981 opened Feb 28, 2025 by jhinpan Loading…
5 tasks done
ProTip! Add no:assignee to see everything that’s not assigned.