v0.1.23
What's Changed
- add kakurasu env by @olliestanley in #460
- Update README.md - Star History by @zafstojano in #463
- add survo env by @olliestanley in #461
- fix color_cubes answer strings, update gallery with latest envs by @olliestanley in #464
- Fix/verl example by @joesharratt1229 in #465
- fix: Rounding issues in score_answer and add unit tests by @Adefioye in #462
- add minimal verifiers example by @olliestanley in #472
- tutorial(training): Add a minimal example with
trlby @zafstojano in #473 - Update README.md by @zafstojano in #475
- corrected countdown issue by @joesharratt1229 in #479
- better usage demo in readme by @olliestanley in #477
- Update README.md (RLSwarm GenRL) by @Miserlou in #480
- Feat/unsloth example by @joesharratt1229 in #482
- Update README.md by @zafstojano in #483
Full Changelog: v0.1.22...v0.1.23