GitHub - Neulus/trl: Train transformer language & seq2seq(Whisper only) models with reinforcement learning.

Some hacky fork of TRL.

Forked to enable Whisper GRPO training for various purposes. Not suitable for production use.

Code probably won't work beside Whisper models. Also probably won't work for whisper-large-v3 (Due to new sampling_rate).

Name		Name	Last commit message	Last commit date
Latest commit History 1,241 Commits
.github		.github
commands		commands
docker		docker
docs/source		docs/source
examples		examples
scripts		scripts
tests		tests
trl		trl
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CITATION.cff		CITATION.cff
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback