[agent][Fix] Fix `SkyRLAgentPPOTrainer` after switch to `async` by SumanthRH · Pull Request #1237 · NovaSky-AI/SkyRL

SumanthRH · 2026-02-28T05:06:43Z

What does this PR do?

Fixes SkyRLAgentPPOTrainer after #1235 . Previously the SkyRLAgentPPOTrainer.train was a sync function, even though we switched to making the base class's method RayPPOTrainer.train async in #868 . Training still progressed as usual but it would have errored out at the end of training when the return value would be evaluated by asyncio.run(...)

This PR is a follow-up to #1235 to transition the SkyRLAgentPPOTrainer.train to an async function.

Signed-off-by: SumanthRH <sumanthrh99@gmail.com>

gemini-code-assist

Code Review

This pull request correctly transitions the SkyRLAgentPPOTrainer.train method to be asynchronous, aligning it with the async method in its base class. The changes replace blocking asyncio.run() calls with non-blocking await expressions, which is the correct approach for handling coroutines within an async function. The asyncio import is also correctly removed as it's no longer used with these changes. I have one point of feedback regarding blocking calls that remain in the train method.

gemini-code-assist · 2026-02-28T05:29:51Z

skyrl-agent/skyrl_agent/integrations/skyrl_train/trainer.py

        if self.colocate_all:
            self.policy_model.offload_to_cpu(offload_optimizer=True, offload_model=False)
-            asyncio.run(self.inference_engine_client.wake_up(tags=["weights"]))
+            await self.inference_engine_client.wake_up(tags=["weights"])


While this change to use await is correct, the train method still contains blocking calls like ray.get() on lines 302 and 422. These calls will block the asyncio event loop, which can negate the benefits of making this method asynchronous. To make this method fully non-blocking, these should be replaced with asynchronous equivalents. For example, you can await Ray's ObjectRefs, possibly using asyncio.gather for lists of them. This would require re-importing asyncio.

This is correct and meant to be synchronous.

The new trainer uses the equivalent dispatch.save_weights_for_sampler() method

Signed-off-by: SumanthRH <sumanthrh99@gmail.com>

SumanthRH added 2 commits February 28, 2026 05:00

fix skyrlagent trainer after switch to async

4c43d6b

Signed-off-by: SumanthRH <sumanthrh99@gmail.com>

x

a2911a1

Signed-off-by: SumanthRH <sumanthrh99@gmail.com>

SumanthRH marked this pull request as ready for review February 28, 2026 05:27

gemini-code-assist bot reviewed Feb 28, 2026

View reviewed changes

This comment was marked as resolved.

Sign in to view

x

1b13711

Signed-off-by: SumanthRH <sumanthrh99@gmail.com>

SumanthRH merged commit b2f6105 into main Feb 28, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[agent][Fix] Fix `SkyRLAgentPPOTrainer` after switch to `async`#1237

[agent][Fix] Fix `SkyRLAgentPPOTrainer` after switch to `async`#1237
SumanthRH merged 3 commits intomainfrom
fix-async-skyrlagent

SumanthRH commented Feb 28, 2026 •

edited by devin-ai-integration bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Feb 28, 2026

Uh oh!

SumanthRH Feb 28, 2026

Uh oh!

This comment was marked as resolved.

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

SumanthRH commented Feb 28, 2026 • edited by devin-ai-integration bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Feb 28, 2026

Choose a reason for hiding this comment

Uh oh!

SumanthRH Feb 28, 2026

Choose a reason for hiding this comment

Uh oh!

This comment was marked as resolved.

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

SumanthRH commented Feb 28, 2026 •

edited by devin-ai-integration bot

Loading