test rowwise fp32 #2431

y-sq · 2025-06-24T06:32:23Z

Summary:
Running rowwise scaling on fp32 tensors got the error, P1794222725

RuntimeError: Only bf16 high precision output types are supported for row-wise scaling.

This pr adds an option to explicitly use bfloat16 as the output of rowwise_scaled, and cast it back to the original precision.

It can be enabled by setting

config = dataclasses.replace(config, convert_dtypes_for_rowwise_scaled_mm=True)

Differential Revision: D73552660

pytorch-bot · 2025-06-24T06:32:26Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2431

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit b0240a2 with merge base 2025b75 ():

NEW FAILURE - The following job has failed:

Run TorchAO Experimental Tests / test-mps-ops (macos-m1-stable) (gh)
Process completed with exit code 127.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-06-24T06:32:43Z

This pull request was exported from Phabricator. Differential Revision: D73552660

vkuzo · 2025-06-24T12:14:54Z

torchao/float8/float8_ops.py

+
+    if convert_dtypes_for_rowwise_scaled_mm and is_rowwise_scaling:
+        output_dtype = torch.bfloat16
+


instead of adding a flag, TBH I think we can just enable this on-by-default, like this:

file issue in PyTorch core to add float32 output to scaled_mm

output_dtype_to_use = output_dtype if is_rowwise_scaling: # work around torch._scaled_mm not having float32 output type # TODO(issue number): remove this once torch._scaled_mm supports float32 output output_dtype_to_use = torch.bfloat16 output = torch._scaled_mm(..., output_dtype_to_use, ...) ... if is_rowwise_scaling and output_dtype == torch.float32: # work around torch._scaled_mm not having float32 output type # TODO(issue number): remove this once torch._scaled_mm supports float32 output output = output.to(orig_dtype)

makes sense, I'll change to enable by default and file an issue.

vkuzo

can we file an issue in core to add this to torch._scaled_mm, and enable the workaround without a config for now? also add a test?

y-sq · 2025-06-25T00:16:07Z

Updated to enable the workaround by default. Included fp16 and fp32 dtypes in the existing test cases. The additional changes are formatting things generated by linter.
(May need some time for this pr to be updated with the diff.)

The pytorch issue: pytorch/pytorch#156771

vkuzo · 2025-06-25T11:39:52Z

@y-sq , maybe export again?

…ytorch#2431) Summary: Running rowwise scaling on fp32 tensors got the error, P1794222725 ``` RuntimeError: Only bf16 high precision output types are supported for row-wise scaling. ``` This pr adds an option to explicitly use bfloat16 as the output of rowwise_scaled, and cast it back to the original precision. It can be enabled by setting ``` config = dataclasses.replace(config, convert_dtypes_for_rowwise_scaled_mm=True) ``` Differential Revision: D73552660

facebook-github-bot · 2025-07-03T21:36:53Z

This pull request was exported from Phabricator. Differential Revision: D73552660

y-sq · 2025-07-03T21:37:44Z

@vkuzo sorry there were some un-synced files between github and fbcode so the previous export all failed. The pr should be updated now.

…ytorch#2431) Summary: Running rowwise scaling on fp32 tensors got the error, P1794222725 ``` RuntimeError: Only bf16 high precision output types are supported for row-wise scaling. ``` This pr adds an option to explicitly use bfloat16 as the output of rowwise_scaled, and cast it back to the original precision. It can be enabled by setting ``` config = dataclasses.replace(config, convert_dtypes_for_rowwise_scaled_mm=True) ``` Differential Revision: D73552660

facebook-github-bot · 2025-07-03T22:34:31Z

This pull request was exported from Phabricator. Differential Revision: D73552660

…ytorch#2431) Summary: Running rowwise scaling on fp32 tensors got the error, P1794222725 ``` RuntimeError: Only bf16 high precision output types are supported for row-wise scaling. ``` This pr adds an option to explicitly use bfloat16 as the output of rowwise_scaled, and cast it back to the original precision. It can be enabled by setting ``` config = dataclasses.replace(config, convert_dtypes_for_rowwise_scaled_mm=True) ``` Differential Revision: D73552660

facebook-github-bot · 2025-07-03T23:40:19Z

This pull request was exported from Phabricator. Differential Revision: D73552660

…ytorch#2431) Summary: Running rowwise scaling on fp32 tensors got the error, P1794222725 ``` RuntimeError: Only bf16 high precision output types are supported for row-wise scaling. ``` This pr adds an option to explicitly use bfloat16 as the output of rowwise_scaled, and cast it back to the original precision. It can be enabled by setting ``` config = dataclasses.replace(config, convert_dtypes_for_rowwise_scaled_mm=True) ``` Differential Revision: D73552660

facebook-github-bot · 2025-07-03T23:55:44Z

This pull request was exported from Phabricator. Differential Revision: D73552660

…ytorch#2431) Summary: Pull Request resolved: pytorch#2431 Running rowwise scaling on fp32 tensors got the error, P1794222725 ``` RuntimeError: Only bf16 high precision output types are supported for row-wise scaling. ``` This pr adds an option to explicitly use bfloat16 as the output of rowwise_scaled, and cast it back to the original precision. It can be enabled by setting ``` config = dataclasses.replace(config, convert_dtypes_for_rowwise_scaled_mm=True) ``` Differential Revision: D73552660

facebook-github-bot · 2025-07-03T23:59:21Z

This pull request was exported from Phabricator. Differential Revision: D73552660

facebook-github-bot added the CLA Signed label Jun 24, 2025

facebook-github-bot added the fb-exported label Jun 24, 2025

y-sq requested review from vkuzo, danielvegamyhre and drisspg and removed request for vkuzo June 24, 2025 06:35

y-sq added float8 topic: not user facing labels Jun 24, 2025

vkuzo reviewed Jun 24, 2025

View reviewed changes

y-sq force-pushed the export-D73552660 branch from 0a45ccd to 4ab8986 Compare July 3, 2025 21:36

y-sq force-pushed the export-D73552660 branch from 4ab8986 to d19f362 Compare July 3, 2025 22:34

y-sq force-pushed the export-D73552660 branch from d19f362 to 92c3668 Compare July 3, 2025 23:40

y-sq force-pushed the export-D73552660 branch 2 times, most recently from abbed3a to 303d6a6 Compare July 3, 2025 23:55

y-sq force-pushed the export-D73552660 branch from 303d6a6 to b0240a2 Compare July 3, 2025 23:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

test rowwise fp32 #2431

test rowwise fp32 #2431

Uh oh!

y-sq commented Jun 24, 2025

Uh oh!

pytorch-bot bot commented Jun 24, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented Jun 24, 2025

Uh oh!

vkuzo Jun 24, 2025 •

edited

Loading

Uh oh!

y-sq Jun 24, 2025

Uh oh!

vkuzo left a comment

Uh oh!

y-sq commented Jun 25, 2025 •

edited

Loading

Uh oh!

vkuzo commented Jun 25, 2025

Uh oh!

facebook-github-bot commented Jul 3, 2025

Uh oh!

y-sq commented Jul 3, 2025

Uh oh!

facebook-github-bot commented Jul 3, 2025

Uh oh!

facebook-github-bot commented Jul 3, 2025

Uh oh!

facebook-github-bot commented Jul 3, 2025

Uh oh!

facebook-github-bot commented Jul 3, 2025

Uh oh!

Uh oh!


		if convert_dtypes_for_rowwise_scaled_mm and is_rowwise_scaling:
		output_dtype = torch.bfloat16

test rowwise fp32 #2431

Are you sure you want to change the base?

test rowwise fp32 #2431

Uh oh!

Conversation

y-sq commented Jun 24, 2025

Uh oh!

pytorch-bot bot commented Jun 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2431

❌ 1 New Failure

Uh oh!

facebook-github-bot commented Jun 24, 2025

Uh oh!

vkuzo Jun 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

y-sq Jun 24, 2025

Choose a reason for hiding this comment

Uh oh!

vkuzo left a comment

Choose a reason for hiding this comment

Uh oh!

y-sq commented Jun 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vkuzo commented Jun 25, 2025

Uh oh!

facebook-github-bot commented Jul 3, 2025

Uh oh!

y-sq commented Jul 3, 2025

Uh oh!

facebook-github-bot commented Jul 3, 2025

Uh oh!

facebook-github-bot commented Jul 3, 2025

Uh oh!

facebook-github-bot commented Jul 3, 2025

Uh oh!

facebook-github-bot commented Jul 3, 2025

Uh oh!

Uh oh!

pytorch-bot bot commented Jun 24, 2025 •

edited

Loading

vkuzo Jun 24, 2025 •

edited

Loading

y-sq commented Jun 25, 2025 •

edited

Loading