Qwen 3.5 MoE Metal: Use max-sized prefill example for dynamic inputs by manuelcandales · Pull Request #18956 · pytorch/executorch

manuelcandales · 2026-04-16T22:04:22Z

With alloc_graph_input=False, ExecuTorch sets the input tensor's
numel_bound_ from the serialized example size. A small example (T=2)
prevents runtime inputs larger than 2 tokens. Use max_seq_len-1 as
the prefill example size so any prompt length is accepted at runtime.

Authored with Claude.

[ghstack-poisoned]

manuelcandales · 2026-04-16T22:04:23Z

Stack from ghstack (oldest at bottom):

pytorch-bot · 2026-04-16T22:04:27Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18956

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

[CI[B200] Smoke test encounters CUDA Unknown error for dgxb200-03 and dgxb200-04

❌ 1 New Failure, 1 Cancelled Job, 4 Unrelated Failures

As of commit 58fe35f with merge base 5707e2a ():

NEW FAILURE - The following job has failed:

pull / unittest-editable / macos / macos-job (gh)
export/tests/test_target_recipes.py::TestTargetRecipes::test_mv3_model

CANCELLED JOB - The following job was cancelled. Please retry:

pull / unittest / windows / windows-job (gh)

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

pull / test-models-linux (resnet18, xnnpack-quantization-delegation, linux.2xlarge) / linux-job (gh) (detected as infra flaky with no log or failing log classifier)
pull / test-models-linux (resnet50, portable, linux.2xlarge) / linux-job (gh) (detected as infra flaky with no log or failing log classifier)
pull / test-samsung-quantmodels-linux / linux-job (gh) (detected as infra flaky with no log or failing log classifier)

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest-editable / windows / windows-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

With alloc_graph_input=False, ExecuTorch sets the input tensor's numel_bound_ from the serialized example size. A small example (T=2) prevents runtime inputs larger than 2 tokens. Use max_seq_len-1 as the prefill example size so any prompt length is accepted at runtime. Authored with Claude. ghstack-source-id: 7118150 ghstack-comment-id: 4263712315 Pull-Request: #18956

Update

58fe35f

[ghstack-poisoned]

manuelcandales requested review from larryliu0820, lucylq and mergennachin as code owners April 16, 2026 22:04

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 16, 2026

manuelcandales requested review from metascroy and removed request for larryliu0820 and lucylq April 16, 2026 22:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qwen 3.5 MoE Metal: Use max-sized prefill example for dynamic inputs#18956

Qwen 3.5 MoE Metal: Use max-sized prefill example for dynamic inputs#18956
manuelcandales wants to merge 1 commit intogh/manuelcandales/176/headfrom
gh/manuelcandales/177/head

manuelcandales commented Apr 16, 2026

Uh oh!

manuelcandales commented Apr 16, 2026 •

edited

Loading

Uh oh!

pytorch-bot bot commented Apr 16, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

manuelcandales commented Apr 16, 2026

Uh oh!

manuelcandales commented Apr 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Apr 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18956

❗ 1 Active SEVs

❌ 1 New Failure, 1 Cancelled Job, 4 Unrelated Failures

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

manuelcandales commented Apr 16, 2026 •

edited

Loading

pytorch-bot bot commented Apr 16, 2026 •

edited

Loading