Stars
Distill
5 repositories
verl: Volcano Engine Reinforcement Learning for LLMs
Train transformer language models with reinforcement learning.