feat: textarena refinements by vermouthdky · Pull Request #24 · axon-rl/gem

vermouthdky · 2025-06-18T12:25:34Z

modify textarena envs, especially reward shaping, to be compatible with rl training

lkevinzc · 2025-06-19T02:30:37Z

@vermouthdky could you please help to check this PR (we have resolved many conflicts, not sure if any breaks the original code).

here are results of this pr's code:

once you have results ready on the modified rewards, you could compare and decide which one is better

vermouthdky requested a review from lkevinzc June 18, 2025 12:25

working envs in textarena rl

a15532e

lkevinzc force-pushed the textarena-rl-dev branch from b376867 to a15532e Compare June 18, 2025 12:42

minor refactor

09346f0

lkevinzc changed the title ~~Textarena rl dev~~ feat: textarena refinements Jun 18, 2025

Merge branch 'main' into textarena-rl-dev

c4f7141

lkevinzc merged commit 90e6301 into main Jun 19, 2025

lkevinzc deleted the textarena-rl-dev branch June 19, 2025 14:12

Provide feedback