v1.0.0rc5

Pre-release

Pre-release

bruno-f-cruz released this 24 May 00:08

· 40 commits to main since this release

5a87865

What's Changed

Ensure patch reward available depletes on reward delivery by @bruno-f-cruz in #544
Generalize environment sampling and implement sequential sampling by @bruno-f-cruz in #547
Refactor available reward to not deplete implicitly by @bruno-f-cruz in #548
Add support for automatically inferring dataset version by @bruno-f-cruz in #550
Ensure previous task logic schema (>=0.6) is backwards deserializable by @bruno-f-cruz in #551
Add deterministic reversals curriculum by @tiffanyona in #545

Full Changelog: v1.0.0rc4...v1.0.0rc5

Contributors

bruno-f-cruz and tiffanyona

Assets 6