Adds support for custom replicas, updates policy to spawn correctly within `spawn_service` #96

allenwang28 · 2025-08-29T18:50:25Z

This PR does a few things:

Moves Replica's create_proc_mesh, spawn_actors, and stop functionality into ForgeActor as @classmethod for launch and shutdown. Why?
- This couples definition of how an actor should be launched with the actor def itself, rather than being in a separate object.
- This gives flexibility for more complex actors, like Policy, which spawns multiple proc meshes and actor types
Uses ForgeActor for everything we expect to be spawned as a service
- GPUManager becomes a regular actor to avoid circular imports (it doesn't really need to be a ForgeActor either)
Modifies Policy to pick up these changes
GRPO example mods:
- Dataset adds in **kwargs, launch() doesn't play well with *args and I couldn't figure out the exact right way to make args work correctly. Therefore spawn_service really only accepts kwargs at the moment.
- Spawns and shutdowns all services at the same time to reduce the initialization time
vLLM example changes:
- Use the with policy.session context manager and generally QoL updates

Copilot

Pull Request Overview

This PR adds support for custom replicas by moving process and actor lifecycle management from Replica into ForgeActor as @classmethod methods. It also updates Policy to use this new pattern and implement correct spawning within the service framework.

Moves replica management functionality from Replica class to ForgeActor as launch() and shutdown() class methods
Updates Policy to implement custom launch/shutdown logic that manages multiple process meshes and actor types
Modifies service spawn/shutdown to use ForgeActor pattern and removes positional arguments support

Reviewed Changes

Copilot reviewed 14 out of 14 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
src/forge/controller/actor.py	Adds `launch()` and `shutdown()` class methods to `ForgeActor` for managing actor lifecycle
src/forge/controller/service/replica.py	Removes proc mesh management and delegates to `ForgeActor.launch()`/`shutdown()`
src/forge/controller/service/spawn.py	Adds type validation and `shutdown_service()` function, removes positional args
src/forge/actors/policy.py	Implements custom `launch()`/`shutdown()` to manage multiple proc meshes
tests/unit_tests/test_service.py	Updates test class and shutdown calls to use new service patterns
apps/vllm/main.py	Updates to use new service context manager and shutdown function
apps/grpo/main.py	Updates DatasetActor constructor and uses concurrent service spawning/shutdown

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

src/forge/controller/service/spawn.py

src/forge/controller/service/replica.py

apps/grpo/main.py

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

joecummings

Few questions, but overall looks good.

joecummings · 2025-08-29T19:17:19Z

apps/vllm/main.py

    return policy_config, service_config


 async def run_vllm(service_config: ServiceConfig, config: PolicyConfig, prompt: str):


Can't be delete this now that we have the vllm app?

hmm I'm not sure I'm following, we still need a ServiceConfig for spawning a service here regardless?

joecummings · 2025-08-29T19:17:51Z

apps/grpo/main.py

Is it out of scope to move the vllm processing loop within the policy instead of having the start that in main.py?

? yeah it should have been moved in this PR

joecummings · 2025-08-29T19:18:09Z

src/forge/controller/system_controllers/gpu_manager.py



-class GpuManager(ForgeActor):
+class GpuManager(Actor):


Why don't we want this to be a ForgeActor?

Circular import since proc_mesh.py wants to use GpuManager, and ForgeActor uses proc_mesh :/ can be re-arranged later

joecummings · 2025-08-29T19:19:17Z

src/forge/controller/service/spawn.py


 async def spawn_service(
-    service_cfg: ServiceConfig, actor_def: Type[Actor], *actor_args, **actor_kwargs
+    service_cfg: ServiceConfig, actor_def: Type[ForgeActor], **actor_kwargs


Hmmm there will be things that likely need *args, not just **kwargs - is there a long term plan to make that possible?

src/forge/actors/policy.py

Jack-Khuu · 2025-08-29T19:52:43Z

src/forge/actors/policy.py

+        self._run_task: asyncio.Task | None = None
+        self._policy_proc: ProcMesh | None = None
+        self._worker_procs: ProcMesh | None = None


Jack-Khuu · 2025-08-29T19:59:00Z

apps/grpo/main.py

+        dataloader,
+        policy,
+        trainer,
+        replay_buffer,
+        compute_advantages,
+        ref_model,
+        reward_actor,
+    ) = await asyncio.gather(


Nice optimization

Will note that this is harder to read and more error prone if the order gets changed or services list mutated. Not sure if there's a way to get our cake and eat it too

hmm we could do something like:

dataloader_task = spawn_service(...) # don't await yet policy_task = spawn_service(...)

then do the bulk await at the end:

dataloader, policy = await asyncio.gather(...)

but I'm not sure if it fully solves the problem

Allen Wang added 5 commits August 29, 2025 09:49

initial commit - but vllm is frozen

f1e8654

gets grpo working

4436bc6

working on dataset

3165051

fixes grpo

c9062d2

more changes

7018129

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Aug 29, 2025

fix test

d77f736

allenwang28 marked this pull request as ready for review August 29, 2025 19:08

allenwang28 requested review from Jack-Khuu, Copilot, joecummings and pbontrager and removed request for joecummings August 29, 2025 19:12

Copilot AI reviewed Aug 29, 2025

View reviewed changes

src/forge/controller/service/spawn.py Outdated Show resolved Hide resolved

src/forge/controller/service/replica.py Outdated Show resolved Hide resolved

apps/grpo/main.py Outdated Show resolved Hide resolved

Update src/forge/controller/service/replica.py

8c48bfb

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

joecummings approved these changes Aug 29, 2025

View reviewed changes

pbontrager reviewed Aug 29, 2025

View reviewed changes

src/forge/actors/policy.py Outdated Show resolved Hide resolved

Jack-Khuu reviewed Aug 29, 2025

View reviewed changes

src/forge/actors/policy.py Show resolved Hide resolved

Allen Wang added 2 commits August 29, 2025 12:37

address a few comments

a42c7fb

move policyworker back for diff view

6c19e0b

Jack-Khuu approved these changes Aug 29, 2025

View reviewed changes

allenwang28 merged commit ccd2377 into meta-pytorch:main Aug 29, 2025
4 checks passed

allenwang28 deleted the policy_replica branch August 29, 2025 23:57

		return policy_config, service_config


		async def run_vllm(service_config: ServiceConfig, config: PolicyConfig, prompt: str):

Adds support for custom replicas, updates policy to spawn correctly within spawn_service #96

Adds support for custom replicas, updates policy to spawn correctly within spawn_service #96

Uh oh!

Conversation

allenwang28 commented Aug 29, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

joecummings left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

allenwang28 Aug 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Adds support for custom replicas, updates policy to spawn correctly within `spawn_service` #96

Adds support for custom replicas, updates policy to spawn correctly within `spawn_service` #96

allenwang28 Aug 29, 2025 •

edited

Loading