Skip to content

feat: textarena refinements#24

Merged
lkevinzc merged 3 commits intomainfrom
textarena-rl-dev
Jun 19, 2025
Merged

feat: textarena refinements#24
lkevinzc merged 3 commits intomainfrom
textarena-rl-dev

Conversation

@vermouthdky
Copy link
Collaborator

modify textarena envs, especially reward shaping, to be compatible with rl training

@vermouthdky vermouthdky requested a review from lkevinzc June 18, 2025 12:25
@lkevinzc lkevinzc changed the title Textarena rl dev feat: textarena refinements Jun 18, 2025
@lkevinzc
Copy link
Contributor

@vermouthdky could you please help to check this PR (we have resolved many conflicts, not sure if any breaks the original code).

here are results of this pr's code:
image

once you have results ready on the modified rewards, you could compare and decide which one is better

@lkevinzc lkevinzc merged commit 90e6301 into main Jun 19, 2025
@lkevinzc lkevinzc deleted the textarena-rl-dev branch June 19, 2025 14:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants