Forge

A PyTorch-native agentic RL library that lets you focus on algorithms—not infra.

Overview

The primary purpose of the Forge ecosystem is to delineate infra concerns from model concerns thereby making RL experimentation easier. Forge delivers this by providing clear RL abstractions and one scalable implementation of these abstractions. When you need fine-grained control over placement, fault handling/redirecting training loads during a run, or communication patterns, the primitives are there. When you don’t, you can focus purely on your RL algorithm.

Key features:

Usability for rapid research (isolating the RL loop from infrastructure)
Hackability for power users (all parts of the RL loop can be easily modified without interacting with infrastructure)
Scalability (ability to shift between async and synchronous training and across thousands of GPUs)

⚠️ Early Development Warning Forge is currently in an experimental stage. You should expect bugs, incomplete features, and APIs that may change in future versions. The project welcomes bugfixes, but to make sure things are well coordinated you should discuss any significant change before starting the work. It's recommended that you signal your intention to contribute in the issue tracker, either by filing a new issue or by claiming an existing one.

📖 Documentation (Coming Soon)

View Forge's hosted documentation (coming soon)

Tutorials

You can also find our notebook tutorials (coming soon)

Installation

Basic

Forge requires the latest PyTorch nightly with Monarch, vLLM, and torchtitan. For convenience, we have pre-packaged these dependencies as wheels in assets/wheels. (Note that the basic install script uses DNF, but could be easily extended to other Linux OS.)

Forge requires the Github CLI (gh) to download a compatible vLLM package. See here for gh install instructions before continuting. Please login to gh with your Github account before continuing with gh auth login. You may use either https or ssh as the protocol for authentication.

conda create -n forge python=3.10
conda activate forge
./scripts/install.sh

Optional: By default, the packages installation uses conda. If user wants to install system packages on the target machine instead of conda, they can pass the --use-sudo to the installation script: ./script/install.sh --use-sudo.

After install, you can run the following command and should see output confirming GRPO training is running (you need a minimum 3 GPU devices):

python -m apps.grpo.main --config apps/grpo/qwen3_1_7b.yaml

If you need to re-build the wheels for whatever reason, you can do so with:

./scripts/build_wheels.sh

For your information, since the vLLM wheel is too large for GitHub, we uploaded it as a release in the install.sh script:

$ gh release create v0.0.0 assets/wheels/vllm-*.whl --title "Forge Wheels v0.0.0"

Quick Start

To run SFT for Llama3 8B, run

uv run forge download meta-llama/Meta-Llama-3.1-8B-Instruct --output-dir /tmp/Meta-Llama-3.1-8B-Instruct --ignore-patterns "original/consolidated.00.pth"
uv run forge run --nproc_per_node 2 apps/sft/main.py --config apps/sft/llama3_8b.yaml

Citation

License

Source code is made available under a BSD 3 license, however you may have other legal obligations that govern your use of other content linked in this repository, such as the license or terms of service for third-party data and models.

Name		Name	Last commit message	Last commit date
Latest commit History 196 Commits
.github		.github
apps		apps
assets		assets
docs		docs
launcher		launcher
scripts		scripts
src/forge		src/forge
tests		tests
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
demo.ipynb		demo.ipynb
pyproject.toml		pyproject.toml
version.txt		version.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Forge

A PyTorch-native agentic RL library that lets you focus on algorithms—not infra.

Overview

📖 Documentation (Coming Soon)

Tutorials

Installation

Basic

Quick Start

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 22

Languages

License

meta-pytorch/forge

Folders and files

Latest commit

History

Repository files navigation

Forge

A PyTorch-native agentic RL library that lets you focus on algorithms—not infra.

Overview

📖 Documentation (Coming Soon)

Tutorials

Installation

Basic

Quick Start

Citation

License

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 22

Languages

Packages