enable select for NVFP4Tensor #3117

vkuzo · 2025-10-02T12:53:07Z

Summary:

This is useful for vLLM 2d -> 3d MoE weight surgery

Test Plan:

unit tests:

pytest test/prototype/mx_formats/ -s

Also, after this PR we can run a Qwen 1.5 MoE model quantized with nvfp4
in vLLM, with vllm-project/vllm#25480

Reviewers:

Subscribers:

Tasks:

Tags:

[ghstack-poisoned]

vkuzo · 2025-10-02T12:53:09Z

Stack from ghstack (oldest at bottom):

Summary: This is useful for vLLM 2d -> 3d MoE weight surgery Test Plan: unit tests: ``` pytest test/prototype/mx_formats/ -s ``` Also, after this PR we can run a Qwen 1.5 MoE model quantized with nvfp4 in vLLM, with vllm-project/vllm#25480 Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: db79143 ghstack-comment-id: 3361104836 Pull-Request: #3117

pytorch-bot · 2025-10-02T12:53:12Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3117

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit b245367 with merge base 8955739 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

[ghstack-poisoned]

Summary: This is useful for vLLM 2d -> 3d MoE weight surgery Test Plan: unit tests: ``` pytest test/prototype/mx_formats/ -s ``` Also, after this PR we can run a Qwen 1.5 MoE model quantized with nvfp4 in vLLM, with vllm-project/vllm#25480 Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 6d7d385 ghstack-comment-id: 3361104836 Pull-Request: #3117

jcaip · 2025-10-02T20:48:59Z

torchao/prototype/mx_formats/nvfp4_tensor.py

+    new = old.__class__(
+        old.qdata[index],
+        old._scale_e4m3[index],
+        old._block_size,


should a dim get knocked off block size after you select?

currently block_size is an integer for this tensor, 16 for NVFP4. If we change it to a multidimensional block, we'd have to update this code.

Update

9c10d0c

[ghstack-poisoned]

vkuzo mentioned this pull request Oct 2, 2025

enable 3d weights for NVFP4Tensor #3109

Open

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 2, 2025

vkuzo added the topic: improvement Use this tag if this PR is an improvement (doesn't fit into any of the other categories) label Oct 2, 2025

Update

b245367

[ghstack-poisoned]

jcaip reviewed Oct 2, 2025

View reviewed changes

jcaip approved these changes Oct 2, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

enable select for NVFP4Tensor #3117

enable select for NVFP4Tensor #3117

Uh oh!

vkuzo commented Oct 2, 2025

Uh oh!

vkuzo commented Oct 2, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Oct 2, 2025 •

edited

Loading

Uh oh!

jcaip Oct 2, 2025

Uh oh!

vkuzo Oct 3, 2025

Uh oh!

Uh oh!

enable select for NVFP4Tensor #3117

Are you sure you want to change the base?

enable select for NVFP4Tensor #3117

Uh oh!

Conversation

vkuzo commented Oct 2, 2025

Uh oh!

vkuzo commented Oct 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3117

✅ No Failures

Uh oh!

jcaip Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

vkuzo Oct 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

vkuzo commented Oct 2, 2025 •

edited

Loading

pytorch-bot bot commented Oct 2, 2025 •

edited

Loading