feat: Add merged vLLM rollout weights by vivekkalyan · Pull Request #631 · OpenPipe/ART

vivekkalyan · 2026-03-25T02:59:42Z

Enable ART to serve merged LoRA weights through dedicated vLLM so Qwen3.5-MoE training works on the current vLLM build.

Changes

Add rollout_weights_mode: "lora" | "merged" with dedicated-only validation and a merged-mode requirement for Qwen/Qwen3.5-35B-A3B and Qwen/Qwen3.5-397B-A17B
Push merged weights into dedicated vLLM with native weight transfer while keeping LoRA checkpoints for training and persistence
Update dedicated server wiring, validation, and Qwen3.5 smoke scripts/tests for the merged-inference path

angkywilliam · 2026-03-25T19:47:58Z

src/art/dev/validate.py

+
+QWEN3_5_MOE_MODELS = {
+    "Qwen/Qwen3.5-35B-A3B",
+    "Qwen/Qwen3.5-397B-A17B",


Are we able to support training for 397B-A17B?

not tested for now, but it should work with megatron. we need to add the merging logic to our megatron service as well

angkywilliam · 2026-03-25T19:48:51Z

src/art/unsloth/service.py

+                    )
+                    response.raise_for_status()
+
+                peft_model.merge_adapter()


Would this be more intuitive if we perform the merging on _merged_checkpoint_weights?

i prefer to leave _merged_checkpoint_weights to only collect the checkpoint-format weights and normalize the names into the surface vLLM expects.

this keeps the pausing, merging, send weights, unmerge, resume part of the same flow in _sync_merged_weights and would be clearer to understand the flow

i renamed the function to _merged_checkpoint_weights_for_vllm to make it a little clearer that its just a transformation function

angkywilliam · 2026-03-25T21:00:30Z

src/art/dev/get_model_config.py

We'll probably need to configure separate target_modules for Qwen3.5 Moe model.
This is the config I’ve been using when training Qwen3.5, but it’s worth double checking on the latest Unsloth version.

target_modules=[ # Full attention layers (25% of layers) "q_proj", "k_proj", "v_proj", "o_proj", # DeltaNet linear attention layers (75% of layers) "in_proj_qkv", "in_proj_z", "in_proj_b", "in_proj_a", "out_proj", # MLP (all layers) "gate_proj", "up_proj", "down_proj", # MoE shared expert gate (only present in MoE models, ignored for dense) "shared_expert_gate", ],

vivekkalyan added 5 commits March 24, 2026 19:49

feat: Add dedicated rollout weights mode config

22654f4

feat: Add merged weight updates for dedicated vLLM

831b4d5

chore: Add merged rollout mode to Qwen3.5 smoke scripts

b723470

fix: Type dedicated vLLM app patch

e48304b

fix: Pin cuDNN frontend for backend installs

a0b7594

vivekkalyan requested a review from bradhilton March 25, 2026 17:50

angkywilliam reviewed Mar 25, 2026

View reviewed changes

vivekkalyan added 2 commits March 25, 2026 14:06

feat: Add Qwen3.5 MoE target module defaults

6eae34b

refactor: Clarify merged vLLM weight helper name

8588585

angkywilliam approved these changes Mar 25, 2026

View reviewed changes

vivekkalyan merged commit fb26124 into main Mar 25, 2026
5 checks passed

vivekkalyan deleted the feat/merged-inference branch March 25, 2026 23:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add merged vLLM rollout weights#631

feat: Add merged vLLM rollout weights#631
vivekkalyan merged 7 commits intomainfrom
feat/merged-inference

vivekkalyan commented Mar 25, 2026 •

edited

Loading

Uh oh!

angkywilliam Mar 25, 2026

Uh oh!

vivekkalyan Mar 25, 2026

Uh oh!

angkywilliam Mar 25, 2026

Uh oh!

vivekkalyan Mar 25, 2026

Uh oh!

angkywilliam Mar 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

vivekkalyan commented Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Uh oh!

angkywilliam Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

vivekkalyan Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

angkywilliam Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

vivekkalyan Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

angkywilliam Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vivekkalyan commented Mar 25, 2026 •

edited

Loading