enroll mxtensor in vllm integration tests #3081

vkuzo · 2025-09-26T14:57:29Z

Summary:

enrolls mxtensor in existing vllm slice and copy test, make it pass
by moving to TorchAOBaseTensor's copy
add an additional test for vllm narrow, make that test pass by fixing
an incorrect slice implementation. This may be useful for other
tensor, they can opt-in in separate PRs.

Test Plan:

pytest test/prototype/mx_formats/ -s -x

also, this PR enables running mxfp4 weight-only Qwen MoE models in vllm

Reviewers:

Subscribers:

Tasks:

Tags:

[ghstack-poisoned]

vkuzo · 2025-09-26T14:57:30Z

Stack from ghstack (oldest at bottom):

pytorch-bot · 2025-09-26T14:57:32Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3081

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit f962851 with merge base a53a4db ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Summary: 1. enrolls mxtensor in existing vllm slice and copy test, make it pass by moving to TorchAOBaseTensor's copy 2. add an additional test for vllm narrow, make that test pass by fixing an incorrect slice implementation. This may be useful for other tensor, they can opt-in in separate PRs. Test Plan: ``` pytest test/prototype/mx_formats/ -s -x ``` also, this PR enables running mxfp4 weight-only Qwen MoE models in vllm Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: e79dfc3 ghstack-comment-id: 3339065416 Pull Request resolved: #3081

[ghstack-poisoned]

Summary: 1. enrolls mxtensor in existing vllm slice and copy test, make it pass by moving to TorchAOBaseTensor's copy 2. add an additional test for vllm narrow, make that test pass by fixing an incorrect slice implementation. This may be useful for other tensor, they can opt-in in separate PRs. Test Plan: ``` pytest test/prototype/mx_formats/ -s -x ``` also, this PR enables running mxfp4 weight-only Qwen MoE models in vllm Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: bd1c041 ghstack-comment-id: 3339065416 Pull Request resolved: #3081

[ghstack-poisoned]

Summary: 1. enrolls mxtensor in existing vllm slice and copy test, make it pass by moving to TorchAOBaseTensor's copy 2. add an additional test for vllm narrow, make that test pass by fixing an incorrect slice implementation. This may be useful for other tensor, they can opt-in in separate PRs. Test Plan: ``` pytest test/prototype/mx_formats/ -s -x ``` also, this PR enables running mxfp4 weight-only Qwen MoE models in vllm Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 609a88e ghstack-comment-id: 3339065416 Pull Request resolved: #3081

[ghstack-poisoned]

Summary: 1. enrolls mxtensor in existing vllm slice and copy test, make it pass by moving to TorchAOBaseTensor's copy 2. add an additional test for vllm narrow, make that test pass by fixing an incorrect slice implementation. This may be useful for other tensor, they can opt-in in separate PRs. Test Plan: ``` pytest test/prototype/mx_formats/ -s -x ``` also, this PR enables running mxfp4 weight-only Qwen MoE models in vllm Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 609a88e ghstack-comment-id: 3339065416 Pull Request resolved: #3081

[ghstack-poisoned]

Summary: 1. enrolls mxtensor in existing vllm slice and copy test, make it pass by moving to TorchAOBaseTensor's copy 2. add an additional test for vllm narrow, make that test pass by fixing an incorrect slice implementation. This may be useful for other tensor, they can opt-in in separate PRs. Test Plan: ``` pytest test/prototype/mx_formats/ -s -x ``` also, this PR enables running mxfp4 weight-only Qwen MoE models in vllm Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 609a88e ghstack-comment-id: 3339065416 Pull Request resolved: #3081

[ghstack-poisoned]

Summary: 1. enrolls mxtensor in existing vllm slice and copy test, make it pass by moving to TorchAOBaseTensor's copy 2. add an additional test for vllm narrow, make that test pass by fixing an incorrect slice implementation. This may be useful for other tensor, they can opt-in in separate PRs. Test Plan: ``` pytest test/prototype/mx_formats/ -s -x ``` also, this PR enables running mxfp4 weight-only Qwen MoE models in vllm Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 609a88e ghstack-comment-id: 3339065416 Pull Request resolved: #3081

[ghstack-poisoned]

Summary: 1. enrolls mxtensor in existing vllm slice and copy test, make it pass by moving to TorchAOBaseTensor's copy 2. add an additional test for vllm narrow, make that test pass by fixing an incorrect slice implementation. This may be useful for other tensor, they can opt-in in separate PRs. Test Plan: ``` pytest test/prototype/mx_formats/ -s -x ``` also, this PR enables running mxfp4 weight-only Qwen MoE models in vllm Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 609a88e ghstack-comment-id: 3339065416 Pull Request resolved: #3081

[ghstack-poisoned]

Summary: 1. enrolls mxtensor in existing vllm slice and copy test, make it pass by moving to TorchAOBaseTensor's copy 2. add an additional test for vllm narrow, make that test pass by fixing an incorrect slice implementation. This may be useful for other tensor, they can opt-in in separate PRs. Test Plan: ``` pytest test/prototype/mx_formats/ -s -x ``` also, this PR enables running mxfp4 weight-only Qwen MoE models in vllm Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 609a88e ghstack-comment-id: 3339065416 Pull Request resolved: #3081

vkuzo added 15 commits September 25, 2025 12:56

Update

08faa8e

[ghstack-poisoned]

Update

c97533c

[ghstack-poisoned]

Update

b88afaf

[ghstack-poisoned]

Update

00d2634

[ghstack-poisoned]

Update

5a840c1

[ghstack-poisoned]

Update

ff57676

[ghstack-poisoned]

Update

4edba12

[ghstack-poisoned]

Update

6d6e465

[ghstack-poisoned]

Update

1f1fc5e

[ghstack-poisoned]

Update

f58607e

[ghstack-poisoned]

Update

263ad98

[ghstack-poisoned]

Update

235494e

[ghstack-poisoned]

Update

ebd3226

[ghstack-poisoned]

Update

b9dbfa8

[ghstack-poisoned]

Update

19ac204

[ghstack-poisoned]

vkuzo mentioned this pull request Sep 26, 2025

expose emulation in mxfp4 inference workflow #3066

Merged

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 26, 2025

vkuzo added 3 commits September 26, 2025 08:00

Update

15248b1

[ghstack-poisoned]

Update

1cc9581

[ghstack-poisoned]

Update

3ea5b94

[ghstack-poisoned]

vkuzo added 4 commits September 26, 2025 12:33

Update

4c8a966

[ghstack-poisoned]

Update

088a286

[ghstack-poisoned]

Update

1b0ec76

[ghstack-poisoned]

Update

3d22740

[ghstack-poisoned]

vkuzo added 5 commits September 26, 2025 12:35

Update

9ea1221

[ghstack-poisoned]

Update

e9cea19

[ghstack-poisoned]

Update

4fb76ae

[ghstack-poisoned]

Update

e4f2855

[ghstack-poisoned]

Update

9e7094e

[ghstack-poisoned]

vkuzo added 4 commits September 26, 2025 12:36

Update

1b57f52

[ghstack-poisoned]

Update

2e73cef

[ghstack-poisoned]

Update

0f12582

[ghstack-poisoned]

Update

e3c719a

[ghstack-poisoned]

vkuzo added 3 commits September 26, 2025 12:37

Update

5d4f713

[ghstack-poisoned]

Update

deeef68

[ghstack-poisoned]

Update

f07fb27

[ghstack-poisoned]

vkuzo added 2 commits September 26, 2025 12:38

Update

3467ee3

[ghstack-poisoned]

Update

e7de5db

[ghstack-poisoned]

jerryzh168 approved these changes Sep 26, 2025

View reviewed changes

Update

f962851

[ghstack-poisoned]

vkuzo changed the base branch from gh/vkuzo/127/head to main September 27, 2025 00:35

This was referenced Sep 27, 2025

nvfp4tensor: improve printing #3086

Merged

nvfp4tensor: remove duplicated tests #3090

Merged

vkuzo merged commit de92bdc into main Sep 29, 2025
50 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

enroll mxtensor in vllm integration tests #3081

enroll mxtensor in vllm integration tests #3081

vkuzo commented Sep 26, 2025

Uh oh!

vkuzo commented Sep 26, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Sep 26, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

enroll mxtensor in vllm integration tests #3081

enroll mxtensor in vllm integration tests #3081

Conversation

vkuzo commented Sep 26, 2025

Uh oh!

vkuzo commented Sep 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3081

✅ No Failures

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vkuzo commented Sep 26, 2025 •

edited

Loading

pytorch-bot bot commented Sep 26, 2025 •

edited

Loading