Skip to content

Improves the GRPO script to be more configurable.#840

Merged
copybara-service[bot] merged 1 commit intomainfrom
lance-updates
Dec 5, 2025
Merged

Improves the GRPO script to be more configurable.#840
copybara-service[bot] merged 1 commit intomainfrom
lance-updates

Conversation

@wang2yn84
Copy link
Collaborator

@wang2yn84 wang2yn84 commented Dec 4, 2025

This PR adds more config knobs to the GRPO script, and make it Pathways compatible.

Reference

Colab Notebook

Checklist

  • I have added all the necessary unit tests for my change.
  • I have verified that my change does not break existing code and all unit tests pass.
  • I have added all appropriate doc-strings/documentation.
  • My PR is based on the latest changes of the main branch (if unsure, rebase the code).
  • I have signed the Contributor License Agreement.
  • I have followed Contribution Guidelines.

@copybara-service copybara-service bot merged commit d6a2cc4 into main Dec 5, 2025
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant