Skip to content

Latest commit

 

History

History
15 lines (13 loc) · 680 Bytes

index.rst

File metadata and controls

15 lines (13 loc) · 680 Bytes

Welcome to trlX's documentation!

trlX is a library for training large language models with reinforcement learning. Training can be done with two RL algorithms: PPO (Schulman et al. 2017) for online training and ILQL (Snell et al. 2022) for offline training. For distributed training two backends are supported: Huggingface 🤗 Accelerate and NVIDIA NeMo.

installation api examples configs trainers pipelines data