[multi-lora] skip pause/resume for lora with merge=false by erictang000 · Pull Request #1677 · NovaSky-AI/SkyRL

erictang000 · 2026-05-16T01:35:59Z

pausing and resuming generation are not needed when updating lora weights in vLLM - we can simply rely on load_lora_adapter (https://docs.vllm.ai/en/stable/features/lora/#in-place-lora-reloading) to directly update the weights in the engine without worrying about corrupted weight state.

Results for fully async RL with megatron + vllm LoRA - we can see that removing the pause/resume for the merge lora = false case still works and has an identical reward curve!

gemini-code-assist

Code Review

This pull request adds a training script for fully async GRPO on GSM8K with Megatron and LoRA. It also updates the weight synchronization logic to avoid pausing generation when using non-merged LoRA. Feedback indicates that the condition for skipping the pause is too broad and could cause errors on non-Megatron strategies; a more robust implementation using getattr and a backend check for vLLM is recommended.

…1678) Reverts #1657, since we can skip pause/resume for multi-tenant lora after #1677

…_skip_pause

erictang000 added 2 commits May 16, 2026 00:01

x'

40b711b

x

a25052f

gemini-code-assist Bot reviewed May 16, 2026

View reviewed changes

Comment thread skyrl/backends/skyrl_train/workers/worker_dispatch.py Outdated

erictang000 mentioned this pull request May 16, 2026

Revert "[lora][tinker] Add pause and resume for multi-tenant lora " #1678

Merged

erictang000 added a commit that referenced this pull request May 16, 2026

Revert "[lora][tinker] Add pause and resume for multi-tenant lora " (#…

ddc68ee

…1678) Reverts #1657, since we can skip pause/resume for multi-tenant lora after #1677

erictang000 added 5 commits May 16, 2026 20:29

Merge branch 'main' of https://github.com/erictang000/SkyRL into lora…

4c0ead3

…_skip_pause

x'

4f41d3e

Merge branch 'main' of https://github.com/erictang000/SkyRL into lora…

4e85ad3

…_skip_pause

fix cpu tests and add integration tests

db348d4

x

2e00868

erictang000 merged commit 654d7ab into NovaSky-AI:main May 16, 2026
4 of 5 checks passed

erictang000 deleted the lora_skip_pause branch May 16, 2026 21:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[multi-lora] skip pause/resume for lora with merge=false#1677

[multi-lora] skip pause/resume for lora with merge=false#1677
erictang000 merged 7 commits into
NovaSky-AI:mainfrom
erictang000:lora_skip_pause

erictang000 commented May 16, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

erictang000 commented May 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

erictang000 commented May 16, 2026 •

edited

Loading