[MPS] Speedup addmm #116548

malfet · 2023-12-29T23:26:20Z

Stack from ghstack (oldest at bottom):

Do not copy bias to output
Skip respective multiplication op if either alpha or beta are equal to 1.0

- Do not copy bias to output - Skip respective multiplication op if either alpha or beta are equal to 1.0 [ghstack-poisoned]

pytorch-bot · 2023-12-29T23:26:25Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/116548

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit a6a34e5 with merge base 97891b1 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

- Do not copy bias to output - Skip respective multiplication op if either alpha or beta are equal to 1.0 [ghstack-poisoned]

albanD · 2023-12-30T09:16:11Z

aten/src/ATen/native/mps/operations/LinearAlgebra.mm

  if (&output != &self) {
    output.resize_(bias_sizes);
-    if (beta.toComplexDouble() != 0.0) {
-      output.copy_(*bias_);


If this is an out variant, overriding the output completely is not ok. We should add into it.

Yes, and it is(all torch.addmm OpInfo tests use alpha and beta not equal to 1), this is why I'm removing this one, as copy is redundant.

So I guess the next steps here would be to add this case to OpInfo and fix this kernel to have the appropriate behavior?

I don't think anything is needed here: addmm_out_mps copied bias to output for some reason, even though it wasn't at all needed, as MPSGraph always overwrites the output;

pytorch/aten/src/ATen/native/mps/operations/LinearAlgebra.mm

Lines 302 to 305 in 035e558

MPSGraphTensor* outputTensor = productTimesAlphaTensor;

if (is_beta_non_zero) {

outputTensor = [mpsGraph additionWithPrimaryTensor:productTimesAlphaTensor

secondaryTensor:biasTimesBetaTensor

Hmm, or perhaps I did not understand your original question: this override only happens if output != self, though yes, I'm not sure this kernel will work as expected if output = self, but imo this should be done as separate PR, this just eliminates unneeded multiplications if alpha and beta are 1 and unneeded override as function always writes to output

- Do not copy bias to output - Skip respective multiplication op if either alpha or beta are equal to 1.0 [ghstack-poisoned]

- Do not copy bias to output - Skip respective multiplication op if either alpha or beta are equal to 1.0 ghstack-source-id: 836df55 Pull Request resolved: #116548

albanD

Sounds good!

malfet · 2024-01-02T00:41:42Z

@pytorchbot merge -f "Lint and MPS are green"

pytorchmergebot · 2024-01-02T00:43:26Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

[MPS] Speedup addmm

de96e25

- Do not copy bias to output - Skip respective multiplication op if either alpha or beta are equal to 1.0 [ghstack-poisoned]

malfet requested a review from kulinseth as a code owner December 29, 2023 23:26

malfet mentioned this pull request Dec 29, 2023

[MPS] Fix addmm #116547

Closed

pytorch-bot bot added ciflow/mps Run MPS tests (subset of trunk) release notes: mps Release notes category labels Dec 29, 2023

Update on "[MPS] Speedup addmm"

5b4805b

- Do not copy bias to output - Skip respective multiplication op if either alpha or beta are equal to 1.0 [ghstack-poisoned]

malfet requested review from Skylion007 and albanD December 29, 2023 23:28

malfet added the topic: improvements topic category label Dec 29, 2023

albanD reviewed Dec 30, 2023

View reviewed changes

Update on "[MPS] Speedup addmm"

eda0b06

- Do not copy bias to output - Skip respective multiplication op if either alpha or beta are equal to 1.0 [ghstack-poisoned]

Update on "[MPS] Speedup addmm"

a6a34e5

- Do not copy bias to output - Skip respective multiplication op if either alpha or beta are equal to 1.0 [ghstack-poisoned]

malfet added a commit that referenced this pull request Dec 30, 2023

[MPS] Speedup addmm

9af7b6f

- Do not copy bias to output - Skip respective multiplication op if either alpha or beta are equal to 1.0 ghstack-source-id: 836df55 Pull Request resolved: #116548

albanD approved these changes Jan 1, 2024

View reviewed changes

pytorchmergebot added the merging label Jan 2, 2024

pytorchmergebot added the Merged label Jan 2, 2024

pytorchmergebot removed the merging label Jan 2, 2024

pytorchmergebot closed this in 1ed8efa Jan 2, 2024

facebook-github-bot deleted the gh/malfet/57/head branch January 5, 2024 15:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MPS] Speedup addmm #116548

[MPS] Speedup addmm #116548

Uh oh!

malfet commented Dec 29, 2023 •

edited

Loading

Uh oh!

pytorch-bot bot commented Dec 29, 2023 •

edited

Loading

Uh oh!

albanD Dec 30, 2023

Uh oh!

malfet Dec 30, 2023

Uh oh!

albanD Jan 1, 2024

Uh oh!

malfet Jan 1, 2024

Uh oh!

malfet Jan 1, 2024

Uh oh!

albanD left a comment

Uh oh!

malfet commented Jan 2, 2024

Uh oh!

pytorchmergebot commented Jan 2, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	MPSGraphTensor* outputTensor = productTimesAlphaTensor;
	if (is_beta_non_zero) {
	outputTensor = [mpsGraph additionWithPrimaryTensor:productTimesAlphaTensor
	secondaryTensor:biasTimesBetaTensor

[MPS] Speedup addmm #116548

[MPS] Speedup addmm #116548

Uh oh!

Conversation

malfet commented Dec 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Dec 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/116548

✅ No Failures

Uh oh!

albanD Dec 30, 2023

Choose a reason for hiding this comment

Uh oh!

malfet Dec 30, 2023

Choose a reason for hiding this comment

Uh oh!

albanD Jan 1, 2024

Choose a reason for hiding this comment

Uh oh!

malfet Jan 1, 2024

Choose a reason for hiding this comment

Uh oh!

malfet Jan 1, 2024

Choose a reason for hiding this comment

Uh oh!

albanD left a comment

Choose a reason for hiding this comment

Uh oh!

malfet commented Jan 2, 2024

Uh oh!

pytorchmergebot commented Jan 2, 2024

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

malfet commented Dec 29, 2023 •

edited

Loading

pytorch-bot bot commented Dec 29, 2023 •

edited

Loading