v1.0.0rc5
Pre-release
Pre-release
What's Changed
- Ensure patch reward available depletes on reward delivery by @bruno-f-cruz in #544
- Generalize environment sampling and implement sequential sampling by @bruno-f-cruz in #547
- Refactor available reward to not deplete implicitly by @bruno-f-cruz in #548
- Add support for automatically inferring dataset version by @bruno-f-cruz in #550
- Ensure previous task logic schema (>=0.6) is backwards deserializable by @bruno-f-cruz in #551
- Add deterministic reversals curriculum by @tiffanyona in #545
Full Changelog: v1.0.0rc4...v1.0.0rc5