-
Notifications
You must be signed in to change notification settings - Fork 248
Issues: OpenRLHF/OpenRLHF
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
assert state_dict_keys.issubset( [rank0]: AssertionError: mismatch keys
#501
opened Nov 5, 2024 by
anoxia-1
Can openrlhf support using soft label during prm training process?
enhancement
New feature or request
#488
opened Oct 28, 2024 by
banksy23
[RFC] Support SGLang generation in RLHF
enhancement
New feature or request
#487
opened Oct 28, 2024 by
hijkzzz
Evaluate the PPO Process: Compatibility issues between DeepSpeed checkpoints and Transformers models
#426
opened Aug 20, 2024 by
Ricardokevins
A worker died or was killed while executing a task by an unexpected system error.
#360
opened Jul 15, 2024 by
lusongshuo-mt
Previous Next
ProTip!
What’s not been updated in a month: updated:<2024-10-10.