Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Research] Layer Skip SFT
#3111 opened Mar 19, 2025 by ariG23498 Loading…
feat: Add Interleaved Trainer implementation
#3107 opened Mar 18, 2025 by ucalyptus2 Loading…
3 tasks done
Update sft trainer to include better packing
#3100 opened Mar 17, 2025 by Ishan-Kumar2 Loading…
4 tasks done
add cli dict parsing for grpo_config
#3082 opened Mar 14, 2025 by Tavish9 Draft
2 of 5 tasks
[GRPO] add vlm training capabilities to the trainer
#3072 opened Mar 13, 2025 by CompN3rd Loading…
3 of 5 tasks
Fix: Multi gpu hang for ORPO and CPO Trainer
#3069 opened Mar 13, 2025 by NanoCode012 Loading…
1 of 5 tasks
[WIP] PEFT 🤝 Liger DPO
#3065 opened Mar 12, 2025 by SalmanMohammadi Draft
5 tasks
Static cache GRPO
#3023 opened Mar 7, 2025 by qgallouedec Draft
5 tasks
[WIP] Iterative training scripts for SPIN and SPPO
#3011 opened Mar 5, 2025 by jkx19 Draft
3 of 5 tasks
Fast packing and truncation
#3009 opened Mar 4, 2025 by mariosasko Loading…
3 of 5 tasks
Feature: Add SGLang as inference backend for generation in GRPO
#2981 opened Feb 28, 2025 by jhinpan Loading…
5 tasks done
Support ReMax Algorithm
#2955 opened Feb 25, 2025 by liziniu Loading…
3 tasks done
[Models] Activation checkpointing from TrorchTune
#2954 opened Feb 25, 2025 by kashif Loading…
Agents
#2936 opened Feb 23, 2025 by August-murr Loading…
Provide more accurate error messages to make the program more robust. 😴 stale No update from the author, will be closed soon
#2932 opened Feb 22, 2025 by dignfei Loading…
4 tasks
Add the metrics completion_length_max and completion_length_min
#2930 opened Feb 22, 2025 by dignfei Loading…
4 tasks
Supporting multi-vLLM inference for GRPO
#2929 opened Feb 22, 2025 by ghrua Loading…
2 of 5 tasks
Liger GRPO support
#2926 opened Feb 21, 2025 by SalmanMohammadi Draft
4 tasks
Remove CUDA synchronization in mean_token_accuracy 😴 stale No update from the author, will be closed soon
#2902 opened Feb 19, 2025 by cyyever Loading…
1 task done
ProTip! Updated in the last three days: updated:>2025-03-18.