v0.7.0
Fixes
- Restore R1 penalty gradients for discriminator regularization
- Fix gradient accumulation to zero grads only at accumulation boundaries
- Add tokenizer fallback for batch_encode_plus compatibility
Testing
- Stabilize dataset tests by ensuring tokenizer call returns tensor encodings