[optim][adagrad] group tensors in foreach to maximize perf #92362

janeyx99 · 2023-01-18T01:09:40Z

another one

pytorch-bot · 2023-01-18T01:09:43Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/92362

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 Failures

As of commit 67a1880:

NEW FAILURES - The following jobs have failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

albanD · 2023-01-18T18:25:42Z

torch/optim/adagrad.py

+        if maximize:
+            device_grads = torch._foreach_neg(device_grads)
+
+        device_has_sparse_grad = any(grad.is_sparse for grad in device_grads)


You now unconditionally recompute this even if it was already passed in? That doesn't sound right?

Hm, my thinking was that even if the whole batch may contain sparse grads, groups may not, so we can still use foreach to optimize these subgroups.

Is it more common for all grads to either be sparse or unsparse? If so, there may be other places I need to patch 😬

I think it is fine if we say that we need this info. I'm just more worried about the ignored arg ;)

I'm guessing whatever resolution we have here should be applied to https://github.com/pytorch/pytorch/pull/92338/files#r1072880613 as well?

Yep. Can be done later.

Yep, okay, my initial proposal is that we deprecate has_sparse_grad across the codebase.

janeyx99 · 2023-01-19T20:14:25Z

@pytorchbot merge

pytorchmergebot · 2023-01-19T20:16:55Z

Merge failed

Reason: Not merging any PRs at the moment because there is a merge blocking https://github.com/pytorch/pytorch/labels/ci:%20sev issue open at:
#92626

Details for Dev Infra team

Raised by workflow job

janeyx99 · 2023-01-19T21:57:50Z

@pytorchbot merge

pytorchmergebot · 2023-01-19T21:59:34Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2023-01-19T21:59:38Z

Merge failed

Reason: 1 jobs have failed, first few of them are: trunk / linux-bionic-cuda11.6-py3.10-gcc7-sm86 / test (default, 3, 4, linux.g5.4xlarge.nvidia.gpu)

Details for Dev Infra team

Raised by workflow job

janeyx99 · 2023-01-20T16:22:45Z

@pytorchbot merge -f "irrelevant failures"

pytorchmergebot · 2023-01-20T16:24:31Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

janeyx99 added the topic: performance topic category label Jan 18, 2023

janeyx99 requested a review from albanD as a code owner January 18, 2023 01:09

pytorch-bot bot added the release notes: nn release notes category label Jan 18, 2023

janeyx99 added the ciflow/trunk Trigger trunk jobs on your pull request label Jan 18, 2023

albanD reviewed Jan 18, 2023

View reviewed changes

albanD approved these changes Jan 19, 2023

View reviewed changes

[optim][adagrad] group tensors in foreach to maximize perf

67a1880

janeyx99 force-pushed the adagrad-group-foreach branch from 4a63bed to 67a1880 Compare January 19, 2023 20:12

pytorchmergebot added the Merged label Jan 20, 2023

pytorchmergebot closed this in b2ca2c8 Jan 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[optim][adagrad] group tensors in foreach to maximize perf #92362

[optim][adagrad] group tensors in foreach to maximize perf #92362

janeyx99 commented Jan 18, 2023

pytorch-bot bot commented Jan 18, 2023 •

edited

albanD Jan 18, 2023

janeyx99 Jan 18, 2023

albanD Jan 19, 2023

janeyx99 Jan 19, 2023

albanD Jan 19, 2023

janeyx99 Jan 19, 2023

janeyx99 commented Jan 19, 2023

pytorchmergebot commented Jan 19, 2023

janeyx99 commented Jan 19, 2023

pytorchmergebot commented Jan 19, 2023

pytorchmergebot commented Jan 19, 2023

janeyx99 commented Jan 20, 2023

pytorchmergebot commented Jan 20, 2023

[optim][adagrad] group tensors in foreach to maximize perf #92362

[optim][adagrad] group tensors in foreach to maximize perf #92362

Conversation

janeyx99 commented Jan 18, 2023

pytorch-bot bot commented Jan 18, 2023 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/92362

❌ 2 Failures

albanD Jan 18, 2023

Choose a reason for hiding this comment

janeyx99 Jan 18, 2023

Choose a reason for hiding this comment

albanD Jan 19, 2023

Choose a reason for hiding this comment

janeyx99 Jan 19, 2023

Choose a reason for hiding this comment

albanD Jan 19, 2023

Choose a reason for hiding this comment

janeyx99 Jan 19, 2023

Choose a reason for hiding this comment

janeyx99 commented Jan 19, 2023

pytorchmergebot commented Jan 19, 2023

Merge failed

janeyx99 commented Jan 19, 2023

pytorchmergebot commented Jan 19, 2023

Merge started

pytorchmergebot commented Jan 19, 2023

Merge failed

janeyx99 commented Jan 20, 2023

pytorchmergebot commented Jan 20, 2023

Merge started

pytorch-bot bot commented Jan 18, 2023 •

edited