Agentic RL Lab

An extensible agentic RL framework for training multi-turn agents across tool use, memory, and expert routing.

This repository is organized as a reusable training stack for:

multi-turn rollouts
task-level rewards
trajectory conversion
benchmark inspection
HPC-scale training workflows

The project is meant to be a clean base for building new agentic RL methods, not a collection of one-off training scripts.

What It Supports

loop_agent: planner / executor / verifier style tool-use RL
memory_agent: chunk-wise memory compression for long-context QA
expert_router: routing across retrieval and external expert models under cost-aware preferences

Core Framework Pieces

agentic_rl.multi_turn: shared trajectory expansion and GRPO reward normalization
agentic_rl.core: shared runtime utilities such as LLM engine adapters
agentic_rl.methods.registry: typed method registry and method metadata
agentic_rl.cli: unified inspection entrypoint for methods and benchmarks

Examples:

agentic-rl list-methods
agentic-rl show-method loop_agent
agentic-rl benchmarks

HPC Training

The repository includes a minimal HPC training layer:

requirements-train.txt for environment setup
configs/hpc.env.example for cluster paths and runtime variables
configs/models/*.sh and configs/methods/*.sh for model/method launch configs
scripts/launch_train.sh as the shared Ray + training entrypoint
scripts/*.sbatch templates for debug and formal jobs
scripts/preflight_check.py for dataset/checkpoint/path validation

See docs/hpc_training.md before submitting jobs.

Notes

Install slime separately or through your preferred environment setup.
expert_router expects external services for retrieval and expert models.
For func_call mode, set AGENTIC_RL_TAU2_ROOT to an external TAU2 checkout or asset directory.

Attribution

This repository includes work developed with reference to upstream open-source projects. See THIRD_PARTY_NOTICES.md for redistribution and attribution details.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
agentic_rl		agentic_rl
configs		configs
docs		docs
scripts		scripts
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_zh.md		README_zh.md
THIRD_PARTY_NOTICES.md		THIRD_PARTY_NOTICES.md
pyproject.toml		pyproject.toml
requirements-train.txt		requirements-train.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agentic RL Lab

What It Supports

Core Framework Pieces

HPC Training

Notes

Attribution

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Agentic RL Lab

What It Supports

Core Framework Pieces

HPC Training

Notes

Attribution

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages