Add flop formula for _scaled_mm #144973

lw · 2025-01-16T16:49:59Z

Stack from ghstack (oldest at bottom):

-> Add flop formula for _scaled_mm #144973

This will make it work correctly with the partitioner's AutoAC

[ghstack-poisoned]

pytorch-bot · 2025-01-16T16:50:02Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/144973

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit b8ea47e with merge base 62ce3e6 ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

pull / linux-jammy-py3-clang12-executorch / test (executorch, 1, 1, linux.2xlarge) (gh) (similar failure)
Process completed with exit code 1.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

This will make it work correctly with the partitioner's AutoAC ghstack-source-id: de015a1 Pull Request resolved: #144973

jeffdaily · 2025-01-16T16:57:13Z

test/test_flop_counter.py

+    def test_scaled_mm(self):
+        dtype = torch.float8_e4m3fnuz if torch.version.hip else torch.float8_e4m3fn
+        with FlopCounterMode() as mode:
+            torch._scaled_mm(
+                torch.randn((3 * 16, 5 * 16), device="cuda").to(dtype),
+                torch.randn((7 * 16, 5 * 16), device="cuda").to(dtype).t(),
+                scale_a=torch.ones((), device="cuda"),
+                scale_b=torch.ones((), device="cuda"),
+                out_dtype=torch.bfloat16,
+            )


Similar change suggested in previous PR.

#144872 (review)

jeffdaily

Nevermind, I just noticed you're not putting this test in test_matmul_cuda.py. Your change here looks good. Thanks for supporting MI300.

albanD · 2025-01-16T16:59:05Z

docs/source/conf.py

    "mm_flop",
    "normalize_tuple",
    "register_flop_formula",
+    "scaled_mm_flop",


Adding new entry to this file and the allowlist below are NEVER ok.
If you want this function to be public, you need to properly expose and document it.
My guess is that in this case you don't want it to be public so you should prepend an _* before it!

I'm following the convention of all other functions in that file. Did I miss some aspects of it or are you saying that all those functions are also "bad" and I should rather break the convention?

[ghstack-poisoned]

This will make it work correctly with the partitioner's AutoAC ghstack-source-id: fffd3fd Pull Request resolved: #144973

lw · 2025-01-17T09:36:42Z

@pytorchbot merge -f "Still failing because of torchtune, unrelated"

pytorchmergebot · 2025-01-17T09:38:13Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Update

53e8287

[ghstack-poisoned]

lw requested a review from albanD as a code owner January 16, 2025 16:50

lw added a commit that referenced this pull request Jan 16, 2025

Add flop formula for _scaled_mm

1982298

This will make it work correctly with the partitioner's AutoAC ghstack-source-id: de015a1 Pull Request resolved: #144973

lw added ciflow/trunk Trigger trunk jobs on your pull request topic: not user facing topic category ciflow/rocm Trigger "default" config CI on ROCm labels Jan 16, 2025

lw added a commit that referenced this pull request Jan 16, 2025

Add flop formula for _scaled_mm

51c3400

This will make it work correctly with the partitioner's AutoAC ghstack-source-id: de015a1 Pull Request resolved: #144973

lw mentioned this pull request Jan 16, 2025

Add flop formula for _scaled_mm #144872

Closed

jeffdaily reviewed Jan 16, 2025

View reviewed changes

albanD requested a review from Chillee January 16, 2025 16:58

jeffdaily approved these changes Jan 16, 2025

View reviewed changes

albanD reviewed Jan 16, 2025

View reviewed changes

Update

b8ea47e

[ghstack-poisoned]

lw added a commit that referenced this pull request Jan 16, 2025

Add flop formula for _scaled_mm

ff24b10

This will make it work correctly with the partitioner's AutoAC ghstack-source-id: fffd3fd Pull Request resolved: #144973

pytorchmergebot added the merging label Jan 17, 2025

pytorchmergebot added the Merged label Jan 17, 2025

pytorchmergebot closed this in a0d2c09 Jan 17, 2025

pytorchmergebot removed the merging label Jan 17, 2025

github-actions bot deleted the gh/lw/4/head branch February 17, 2025 02:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add flop formula for _scaled_mm #144973

Add flop formula for _scaled_mm #144973

Uh oh!

lw commented Jan 16, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jan 16, 2025 •

edited

Loading

Uh oh!

jeffdaily Jan 16, 2025

Uh oh!

jeffdaily left a comment

Uh oh!

albanD Jan 16, 2025

Uh oh!

lw Jan 16, 2025

Uh oh!

lw commented Jan 17, 2025

Uh oh!

pytorchmergebot commented Jan 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add flop formula for _scaled_mm #144973

Add flop formula for _scaled_mm #144973

Uh oh!

Conversation

lw commented Jan 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jan 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/144973

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

jeffdaily Jan 16, 2025

Choose a reason for hiding this comment

Uh oh!

jeffdaily left a comment

Choose a reason for hiding this comment

Uh oh!

albanD Jan 16, 2025

Choose a reason for hiding this comment

Uh oh!

lw Jan 16, 2025

Choose a reason for hiding this comment

Uh oh!

lw commented Jan 17, 2025

Uh oh!

pytorchmergebot commented Jan 17, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

lw commented Jan 16, 2025 •

edited

Loading

pytorch-bot bot commented Jan 16, 2025 •

edited

Loading