[ROCm] Refine from_recipe_name to support mxfp8 on rocm. by RuibinCheung · Pull Request #3620 · pytorch/ao

RuibinCheung · 2026-01-12T08:31:03Z

Summary

Support mxfp8 on gfx950 by refine MXLinearConfig.from_recipe_name

It will encounter error when called triton's quantize kernel with rceil on rocm because PTX instructions was called in kernel. So we chose implementation of torch.compile to workaround.

TODO

Implement quantize kernel with rceil mode in triton.

pytorch-bot · 2026-01-12T08:31:07Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3620

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 60a5e32 with merge base 4b3ebc4 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

vkuzo · 2026-01-12T11:36:24Z

torchao/prototype/mx_formats/config.py

            return MXLinearConfig(
                kernel_preference=KernelPreference.AUTO,
-                mxfp8_cast_kernel_choice=MXFP8Dim1CastKernelChoice.CUDA,
+                mxfp8_cast_kernel_choice=MXFP8Dim1CastKernelChoice.TRITON


style nit: can we rewrite as below to improve readability

# add a descriptive comment here mxfp8_cast_kernel_choice = MXFP8Dim1CastKernelChoice.TRITON is if_ROCM() else MXFP8Dim1CastKernelChoice.CUDA return MXLinearConfig(..., mxfp8_cast_kernel_choice)

ACK. Thanks.

vkuzo · 2026-01-12T11:36:53Z

lgtm, could we update the style per my comment

RuibinCheung · 2026-01-13T02:28:43Z

lgtm, could we update the style per my comment

Thanks for your suggestion. But the ruff-format will modify the code if I follow your style. I can't find a method to disable ruff-format in a line.

Do you have any further suggestions?

RuibinCheung · 2026-01-15T02:13:20Z

Hi @vkuzo, I saw my PR was blocked by Dr.CI. Could you help me trigger it ?

* Support mxfp8 on gfx950. It depends on TorchAO (pytorch/ao#3620).

* [ROCm] Refine from_recipe_name to support mxfp8 on rocm. * resolve reviewer issue

* Support mxfp8 on gfx950. It depends on TorchAO (pytorch/ao#3620).

[ROCm] Refine from_recipe_name to support mxfp8 on rocm.

8632932

pytorch-bot bot added the device: rocm label Jan 12, 2026

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 12, 2026

vkuzo reviewed Jan 12, 2026

View reviewed changes

xiaobochen-amd mentioned this pull request Jan 12, 2026

[ROCM] Add MI350 support for MXFP8 colwise quantization. #3544

Closed

resolve reviewer issue

60a5e32

RuibinCheung requested a review from vkuzo January 13, 2026 02:29

RuibinCheung mentioned this pull request Jan 13, 2026

[ROCm] Support mxfp8 on gfx950. pytorch/torchtitan#2222

Merged

vkuzo added the topic: improvement Use this tag if this PR is an improvement (doesn't fit into any of the other categories) label Jan 13, 2026

vkuzo merged commit f222b9e into pytorch:main Jan 15, 2026
21 of 22 checks passed

RuibinCheung deleted the rocm/support_mxfp8 branch January 16, 2026 02:17

tianyu-l pushed a commit to pytorch/torchtitan that referenced this pull request Jan 20, 2026

[ROCm] Support mxfp8 on gfx950. (#2222)

a25dd8f

* Support mxfp8 on gfx950. It depends on TorchAO (pytorch/ao#3620).

jcaip pushed a commit that referenced this pull request Jan 22, 2026

[ROCm] Refine from_recipe_name to support mxfp8 on rocm. (#3620)

dcae9c6

* [ROCm] Refine from_recipe_name to support mxfp8 on rocm. * resolve reviewer issue

wwwjn pushed a commit to wwwjn/torchtitan that referenced this pull request Jan 30, 2026

[ROCm] Support mxfp8 on gfx950. (pytorch#2222)

1bd4b61

* Support mxfp8 on gfx950. It depends on TorchAO (pytorch/ao#3620).

xrsrke pushed a commit to NousResearch/torchtitan that referenced this pull request Feb 13, 2026

[ROCm] Support mxfp8 on gfx950. (pytorch#2222)

d60851e

* Support mxfp8 on gfx950. It depends on TorchAO (pytorch/ao#3620).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ROCm] Refine from_recipe_name to support mxfp8 on rocm.#3620

[ROCm] Refine from_recipe_name to support mxfp8 on rocm.#3620
vkuzo merged 2 commits intopytorch:mainfrom
RuibinCheung:rocm/support_mxfp8

RuibinCheung commented Jan 12, 2026

Uh oh!

pytorch-bot bot commented Jan 12, 2026 •

edited

Loading

Uh oh!

vkuzo Jan 12, 2026

Uh oh!

RuibinCheung Jan 13, 2026

Uh oh!

vkuzo commented Jan 12, 2026

Uh oh!

RuibinCheung commented Jan 13, 2026 •

edited

Loading

Uh oh!

RuibinCheung commented Jan 15, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

RuibinCheung commented Jan 12, 2026

Summary

TODO

Uh oh!

pytorch-bot bot commented Jan 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3620

✅ No Failures

Uh oh!

vkuzo Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

RuibinCheung Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

vkuzo commented Jan 12, 2026

Uh oh!

RuibinCheung commented Jan 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RuibinCheung commented Jan 15, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pytorch-bot bot commented Jan 12, 2026 •

edited

Loading

RuibinCheung commented Jan 13, 2026 •

edited

Loading