Add nvcc flags to explicitly build mxfp8 dim1 cast kernel for sm100a #2979

danielvegamyhre · 2025-09-11T02:13:05Z

I think we need to explicitly add the nvcc flags to build for sm100a in the extension itself. Right now, we check if the build_for_sm100a flag is true, which is set to true for cuda 12.8+, but we don't actually modify the nvcc args passed in to build the extension.

Seems like building from source is accidentally working? Looking into this..

Test plan

CI Job building for CUDA 12.8 DOES have building 'torchao.prototype.mxfp8_cuda' extension logs but does NOT have this warning ("MXFP8 quantization requires SM90+ (Hopper) or SM100+ (Blackwell) architecture. Kernel will be disabled for this architecture.") in it: https://github.com/pytorch/ao/actions/runs/17631942858/job/50100852741
- (previously, without this fix, in the CI build logs indicating the extension was being built, but also see the warning that cuda arch not supported so kernel will not be built)

…only build runners

pytorch-bot · 2025-09-11T02:13:08Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2979

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 8514b11 with merge base 83e8e60 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

drisspg · 2025-09-11T12:48:57Z

We have separate 100a modules if you look lower in the file

danielvegamyhre · 2025-09-11T15:39:19Z

We have separate 100a modules if you look lower in the file

Are you referring to this?

ao/setup.py

Line 706 in cc35151

# Only build the cutlass_100a extension if sm100a is in the architecture flags

setup.py

danielvegamyhre · 2025-09-11T16:58:38Z

Confirmed build is successful and pytest test/prototype/mx_formats/test_mx_linear.py -k test_linear_eager_vs_hp still passes

Add nvcc flags for building MXFP8 dim1 cast kernel for sm100a on CPU-…

6b1b451

…only build runners

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 11, 2025

danielvegamyhre added ci mx topic: bug fix Use this tag for PRs that fix bugs labels Sep 11, 2025

danielvegamyhre changed the title ~~Add nvcc flags for building MXFP8 dim1 cast kernel for sm100a on CPU-only build runners~~ Add nvcc flags to explicitly build mxfp8 dim1 cast kernel for sm100a Sep 11, 2025

danielvegamyhre requested review from drisspg and vkuzo September 11, 2025 02:58

drisspg reviewed Sep 11, 2025

View reviewed changes

setup.py Outdated Show resolved Hide resolved

build for sm100 and sm120

8514b11

drisspg approved these changes Sep 11, 2025

View reviewed changes

danielvegamyhre merged commit f1e118b into main Sep 11, 2025
34 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add nvcc flags to explicitly build mxfp8 dim1 cast kernel for sm100a #2979

Add nvcc flags to explicitly build mxfp8 dim1 cast kernel for sm100a #2979

Uh oh!

danielvegamyhre commented Sep 11, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Sep 11, 2025 •

edited

Loading

Uh oh!

drisspg commented Sep 11, 2025

Uh oh!

danielvegamyhre commented Sep 11, 2025

Uh oh!

Uh oh!

danielvegamyhre commented Sep 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add nvcc flags to explicitly build mxfp8 dim1 cast kernel for sm100a #2979

Add nvcc flags to explicitly build mxfp8 dim1 cast kernel for sm100a #2979

Uh oh!

Conversation

danielvegamyhre commented Sep 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test plan

Uh oh!

pytorch-bot bot commented Sep 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2979

✅ No Failures

Uh oh!

drisspg commented Sep 11, 2025

Uh oh!

danielvegamyhre commented Sep 11, 2025

Uh oh!

Uh oh!

danielvegamyhre commented Sep 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

danielvegamyhre commented Sep 11, 2025 •

edited

Loading

pytorch-bot bot commented Sep 11, 2025 •

edited

Loading