[Profiler] Use parameter as key for optimizer state recording. #86753

robieta · 2022-10-12T00:08:42Z

Stack from ghstack (oldest at bottom):

While optimizer can store state however it likes, in practice most optimizer state corresponds to a particular parameter. (This is the case for all torch.optim optimizers.) Thus, it turns out to be ergonomic to collect using that structure. Note that this doesn't lock us into anything; we can always collect state with non Tensor keys if the use case arises.

One simplification that arises is that Module and Optimizer collection has very similar structure. So similar, in fact, that it is possible to use a common template for config. I also found that a lot of the check_and_store logic could be simplified and inlined by this joining of collected optimizer state.

Differential Revision: D40210703

While optimizer can store state however it likes, in practice most optimizer state corresponds to a particular parameter. (This is the case for all `torch.optim` optimizers.) Thus, it turns out to be ergonomic to collect using that structure. Note that this doesn't lock us into anything; we can always collect state with non Tensor keys if the use case arises. One simplification that arises is that Module and Optimizer collection has very similar structure. So similar, in fact, that it is possible to use a common template for config. I also found that a lot of the `check_and_store` logic could be simplified and inlined by this joining of collected optimizer state. Differential Revision: [D40210703](https://our.internmc.facebook.com/intern/diff/D40210703/) [ghstack-poisoned]

pytorch-bot · 2022-10-12T00:08:44Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/86753

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures, 8 Pending

As of commit 017d779:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…ing." While optimizer can store state however it likes, in practice most optimizer state corresponds to a particular parameter. (This is the case for all `torch.optim` optimizers.) Thus, it turns out to be ergonomic to collect using that structure. Note that this doesn't lock us into anything; we can always collect state with non Tensor keys if the use case arises. One simplification that arises is that Module and Optimizer collection has very similar structure. So similar, in fact, that it is possible to use a common template for config. I also found that a lot of the `check_and_store` logic could be simplified and inlined by this joining of collected optimizer state. Differential Revision: [D40210703](https://our.internmc.facebook.com/intern/diff/D40210703/) [ghstack-poisoned]

test/profiler/test_profiler.py

torch/csrc/profiler/collection.h

…ing." While optimizer can store state however it likes, in practice most optimizer state corresponds to a particular parameter. (This is the case for all `torch.optim` optimizers.) Thus, it turns out to be ergonomic to collect using that structure. Note that this doesn't lock us into anything; we can always collect state with non Tensor keys if the use case arises. One simplification that arises is that Module and Optimizer collection has very similar structure. So similar, in fact, that it is possible to use a common template for config. I also found that a lot of the `check_and_store` logic could be simplified and inlined by this joining of collected optimizer state. Differential Revision: [D40210703](https://our.internmc.facebook.com/intern/diff/D40210703/) [ghstack-poisoned]

aaronenyeshi

This looks great! The code looks more extensible and clean!

test/profiler/test_profiler.py

torch/csrc/autograd/profiler_python.cpp

…ing." While optimizer can store state however it likes, in practice most optimizer state corresponds to a particular parameter. (This is the case for all `torch.optim` optimizers.) Thus, it turns out to be ergonomic to collect using that structure. Note that this doesn't lock us into anything; we can always collect state with non Tensor keys if the use case arises. One simplification that arises is that Module and Optimizer collection has very similar structure. So similar, in fact, that it is possible to use a common template for config. I also found that a lot of the `check_and_store` logic could be simplified and inlined by this joining of collected optimizer state. Differential Revision: [D40210703](https://our.internmc.facebook.com/intern/diff/D40210703/) [ghstack-poisoned]

…ch#86753) While optimizer can store state however it likes, in practice most optimizer state corresponds to a particular parameter. (This is the case for all `torch.optim` optimizers.) Thus, it turns out to be ergonomic to collect using that structure. Note that this doesn't lock us into anything; we can always collect state with non Tensor keys if the use case arises. One simplification that arises is that Module and Optimizer collection has very similar structure. So similar, in fact, that it is possible to use a common template for config. I also found that a lot of the `check_and_store` logic could be simplified and inlined by this joining of collected optimizer state. Differential Revision: [D40210703](https://our.internmc.facebook.com/intern/diff/D40210703/) Pull Request resolved: pytorch#86753 Approved by: https://github.com/slgong-fb, https://github.com/aaronenyeshi

This was referenced Oct 12, 2022

[Profiler][Trivial] Small style and safety fixes #86752

Closed

[Profiler] Tensor IDs for Module and Optimizer variables #86754

Closed

[Profiler][Trivial] Add Module cls and self bindings and type_caster macro #86755

Closed

robieta mentioned this pull request Oct 15, 2022

[Profiler] Memory profiler part 5: Data flow graph #87006

Closed

robieta mentioned this pull request Oct 17, 2022

[Profiler] Handle ABA for TensorImpl* when assigning IDs #87133

Closed

robieta requested review from chaekit and slgong-fb October 17, 2022 21:23

slgong-fb approved these changes Oct 17, 2022

View reviewed changes

test/profiler/test_profiler.py Show resolved Hide resolved

torch/csrc/profiler/collection.h Show resolved Hide resolved

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 17, 2022

robieta mentioned this pull request Oct 19, 2022

[Profiler] Hold weak reference to prevent TensorImpl address reuse during profiling. #87244

Closed

robieta added the release notes: profiler release notes category label Oct 19, 2022

aaronenyeshi approved these changes Oct 20, 2022

View reviewed changes

test/profiler/test_profiler.py Show resolved Hide resolved

torch/csrc/autograd/profiler_python.cpp Show resolved Hide resolved

torch/csrc/autograd/profiler_python.cpp Show resolved Hide resolved

Taylor Robie added 2 commits October 21, 2022 11:42

This was referenced Oct 23, 2022

[Profiler] Memory profiler part 6: Mark gradients and temporary intermediates. #87566

Closed

[Profiler] Memory profiler part 7: Mark inputs #87567

Closed

[Profiler] Memory profiler part 8: Mark parameters. #87568

Closed

pytorchmergebot closed this in be2d647 Oct 23, 2022

facebook-github-bot deleted the gh/robieta/131/head branch June 8, 2023 18:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Profiler] Use parameter as key for optimizer state recording. #86753

[Profiler] Use parameter as key for optimizer state recording. #86753

robieta commented Oct 12, 2022 •

edited

pytorch-bot bot commented Oct 12, 2022 •

edited

aaronenyeshi left a comment

[Profiler] Use parameter as key for optimizer state recording. #86753

[Profiler] Use parameter as key for optimizer state recording. #86753

Conversation

robieta commented Oct 12, 2022 • edited

pytorch-bot bot commented Oct 12, 2022 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/86753

✅ No Failures, 8 Pending

aaronenyeshi left a comment

Choose a reason for hiding this comment

robieta commented Oct 12, 2022 •

edited

pytorch-bot bot commented Oct 12, 2022 •

edited