Skip to content

use np instead of jnp for reward fn and GRPO group adv#891

Merged
copybara-service[bot] merged 1 commit intomainfrom
test_844938035
Dec 17, 2025
Merged

use np instead of jnp for reward fn and GRPO group adv#891
copybara-service[bot] merged 1 commit intomainfrom
test_844938035

Conversation

@copybara-service
Copy link

use np instead of jnp for reward fn and GRPO group adv

PiperOrigin-RevId: 845477002
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant