Skip to content

feat: add AsyncRolloutManager for native async per-prompt rollouts#2566

Draft
yuki-97 wants to merge 8 commits into
yukih/per-prompt-rollout-gymfrom
yukih/per-prompt-rollout-native-async
Draft

feat: add AsyncRolloutManager for native async per-prompt rollouts#2566
yuki-97 wants to merge 8 commits into
yukih/per-prompt-rollout-gymfrom
yukih/per-prompt-rollout-native-async

Conversation

@yuki-97
Copy link
Copy Markdown
Contributor

@yuki-97 yuki-97 commented May 26, 2026

Issue

Part of RL-729. Based on #2528.

Summary

Adds AsyncRolloutManager to rollout_manager.py to support native async per-prompt multi-turn rollouts.

  • run_rollout fans out num_generations_per_prompt concurrent _run_single_rollout coroutines via asyncio.gather and returns a PromptGroupRecord.
  • _run_single_rollout is a pure mirror of run_sample_multi_turn_rollout (multi-reward support dropped for now).
  • _generate_response mirrors async_generate_response_for_sample_turn + generate_responses_async, with simplified logic (single-sample path, no batching indirection).

Tests

  • Extended tests/unit/experience/test_rollouts.py with unit tests for AsyncRolloutManager.

yuki-97 added 8 commits May 25, 2026 23:07
Signed-off-by: Yuki Huang <yukih@nvidia.com>
Signed-off-by: Yuki Huang <yukih@nvidia.com>
Signed-off-by: Yuki Huang <yukih@nvidia.com>
Signed-off-by: Yuki Huang <yukih@nvidia.com>
… support for now

Signed-off-by: Yuki Huang <yukih@nvidia.com>
Signed-off-by: Yuki Huang <yukih@nvidia.com>
Signed-off-by: Yuki Huang <yukih@nvidia.com>
…nc and cleanup unused

Signed-off-by: Yuki Huang <yukih@nvidia.com>
@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot Bot commented May 26, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant