verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
reinforcement-learning agent-framework large-language-models llm-training llm-agents deepseek-r1 grpo gigpo
-
Updated
Jul 7, 2025 - Python