Some hacky fork of TRL.
Forked to enable Whisper GRPO training for various purposes. Not suitable for production use.
Code probably won't work beside Whisper models. Also probably won't work for whisper-large-v3 (Due to new sampling_rate).
| Name | Name | Last commit date | ||
|---|---|---|---|---|
Some hacky fork of TRL.
Forked to enable Whisper GRPO training for various purposes. Not suitable for production use.
Code probably won't work beside Whisper models. Also probably won't work for whisper-large-v3 (Due to new sampling_rate).