Skip to content

Navigation Menu

Explore
For
- Enterprise
- Teams
- Startups
- Education
By Solution
Resources
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

OpenLLMAI / OpenRLHF Public

Notifications You must be signed in to change notification settings
Fork 137
Star 1.6k

Code
Issues 48
Pull requests
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Releases: OpenLLMAI/OpenRLHF

Releases · OpenLLMAI/OpenRLHF

Release v0.2.0

19 Feb 14:50

hijkzzz

Compare

Choose a tag to compare

Release v0.2.0

Changes

Supported vLLM 0.3.1 @wuxibin89

Contributors

wuxibin89

Assets 2

All reactions

Release v0.1.10

18 Feb 16:06

hijkzzz

Compare

Choose a tag to compare

Release v0.1.10

Changes

Fixed save_models for named_buffer @wuxibin89
Fixed vLLM generation hang bug (requires vLLM<0.2.7) @hijkzzz

Contributors

wuxibin89 and hijkzzz

Assets 2

All reactions

Release v0.1.9

29 Jan 05:35

hijkzzz

Compare

Choose a tag to compare

Release v0.1.9

Changes

Supported input_template #203 @rbao2018
Supported KTO #201 @Dylancer1998
Upgrade HuggingFace Transformers to 4.37.1

Contributors

Dylancer1998 and rbao2018

Assets 2

All reactions

Release v0.1.8

25 Jan 23:42

hijkzzz

Compare

Choose a tag to compare

Release v0.1.8

Changes

Upgraded transformers to version 4.37
Fixed gradient checkpoint configuration in Ray RLHF @wuxibin89
Fixed loss coefficient for PPO-ptx @hijkzzz

Contributors

wuxibin89 and hijkzzz

Assets 2

All reactions

Release v0.1.7

23 Jan 04:25

hijkzzz

Compare

Choose a tag to compare

Release v0.1.7

Changes

Fixed LLaMA RoPE initialization bug for ZeRO3 @wuxibin89
Fixed a DPO training script bug @hijkzzz

Contributors

wuxibin89 and hijkzzz

Assets 2

All reactions

Release v0.1.6

15 Jan 14:21

hijkzzz

Compare

Choose a tag to compare

Release v0.1.6

Changes

Fixed DeepSpeed configs to improve PPO training stability @hijkzzz

Contributors

hijkzzz

Assets 2

All reactions

Release v0.1.5

11 Jan 10:13

hijkzzz

Compare

Choose a tag to compare

Release v0.1.5

Changes

Optimized deepspeed configuration and improved performance by 30%+ with Adam Offload @hijkzzz
Added support for QLora and Lora in all stages @hijkzzz
Fixed Mixtral 8*7b balancing loss bugs @hijkzzz

Contributors

hijkzzz

Assets 2

All reactions

Release v0.1.4

10 Jan 05:29

hijkzzz

Compare

Choose a tag to compare

Release v0.1.4

Changes

Fixed reward model training when using the Huggingface ZeRO3 initialization API (for models with 70 billion+ parameters) @wuxibin89
Added support for Mixtral 8x7b balancing loss (--balancing_loss_coef) @hijkzzz
Fixed issue with vllm_engine when tp=1 @wuxibin89
Fixed ZeRO2 model saving bugs @hijkzzz
Added --grad_accum_dtype args to save memory of the CPUAdam @hijkzzz

Contributors

wuxibin89 and hijkzzz

Assets 2

All reactions

Release v0.1.3

08 Jan 16:28

hijkzzz

Compare

Choose a tag to compare

Release v0.1.3

Changes

Fixed Huggingface Reward model saving @wuxibin89
Improved mask_mean for loss function @hijkzzz
Fixed num_actions and action_mask @ZiyiLiubird
Optimized PPO performance of example scripts (set micro_batch_size=4) @hijkzzz

Contributors

wuxibin89, hijkzzz, and ZiyiLiubird

Assets 2

All reactions

Release v0.1.2

05 Jan 07:45

hijkzzz

Compare

Choose a tag to compare

Release v0.1.2

Changes

Fix Reward model hidden size and value_head initialization @wuxibin89
Fix save bugs @hijkzzz

Contributors

wuxibin89 and hijkzzz

Assets 2

All reactions

Previous 1 2 3 Next

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.