Skip to content

Issues: OpenRLHF/OpenRLHF

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

PRM loss 疑问
#510 opened Nov 7, 2024 by EthanChen1234
Support for PPO for PRM? enhancement New feature or request
#498 opened Nov 1, 2024 by ljb121002
[RFC] Support SGLang generation in RLHF enhancement New feature or request
#487 opened Oct 28, 2024 by hijkzzz
知识蒸馏结果复现
#456 opened Oct 7, 2024 by jinchenyu
会不会支持异步生成训练 enhancement New feature or request
#353 opened Jul 11, 2024 by syx11237744
可以增加支持SimPO吗
#311 opened May 29, 2024 by victorShawFan
ProTip! What’s not been updated in a month: updated:<2024-10-10.