Fix retains grad behavior after in-place #79996

soulitzer · 2022-06-22T01:05:21Z

Stack from ghstack (oldest at bottom):

See this doc: https://docs.google.com/document/d/1KiRdnoj6B4cI3yl017hTbCqcOGO1gWIpUf20sldipHM/edit#

Two issues (1) regarding hooks in general and (2) regarding retains grad hooks are fixed, Python hooks, which rely on a different mechanism are not discussed here:

Hooks in cpp in general
- (fixed) new hooks to registered to a newer version of the tensor no longer get applied to grad_fn
  associated with older version of the tensor when the first hook was ever registered
- (unchanged) hooks registered to the older version of the tensor remain active on
Retains grad hooks
- (fixed) now get moved to the latest grad_fn. NB: To the user, retains_grad is not considered hooks
  or expected to behave like hooks (which we consider properties of the grad_fn) vs retains_gradness
  which is a property of the tensor.
(not in this PR) Python hooks
- (will fix) same issue as hooks in cpp where new hooks are being applied to grad_fn associated
  with the older version of the tensor

[ghstack-poisoned]

ghstack-source-id: e4459de Pull Request resolved: #79996

facebook-github-bot · 2022-06-22T01:05:29Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/79996
📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓Need help or want to give feedback on the CI? Visit our office hours

✅ No Failures (0 Pending)

As of commit 3926df3 (more details on the Dr. CI page):

Expand to see more

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

test/cpp/api/autograd.cpp

test/test_autograd.py

albanD · 2022-06-22T14:32:22Z

torch/csrc/autograd/variable.h

-  // Only meaningful on non-leaf variables (must be false otherwise)
-  bool retains_grad_;
+  // Only meaningful on non-leaf variables (must be -1 otherwise)
+  // The value of retains_grad_ indicates the index of it in cpp_hooks_list_


nit: This is more a handle than an index right?

What is the difference?

torch/csrc/autograd/variable.cpp

albanD · 2022-06-22T14:36:36Z

torch/csrc/autograd/variable.cpp

+  (*old_list)[idx] = nullptr;
+  materialize_autograd_meta(self)->cpp_hooks_list_ = new_list;
+
+  std::unique_ptr<FunctionPreHook> hook_ptr(


Why not reset the AM.retains_grad_ field and call self.retain_grad() here? That will avoid duplication.

retain_grad would call into register_hook which would wrap the function in a lambda again

torch/csrc/autograd/variable.cpp

See this doc: https://docs.google.com/document/d/1KiRdnoj6B4cI3yl017hTbCqcOGO1gWIpUf20sldipHM/edit# Two issues (1) regarding hooks in general and (2) regarding retains grad hooks are fixed, Python hooks, which rely on a different mechanism are not discussed here: - Hooks in cpp in general - (fixed) new hooks to registered to a newer version of the tensor no longer get applied to grad_fn associated with older version of the tensor when the first hook was ever registered - (unchanged) hooks registered to the older version of the tensor remain active on - Retains grad hooks - (fixed) now get moved to the latest grad_fn. NB: To the user, retains_grad is not considered hooks or expected to behave like hooks (which we consider properties of the grad_fn) vs retains_gradness which is a property of the tensor. - (not in this PR) Python hooks - (will fix) same issue as hooks in cpp where new hooks are being applied to grad_fn associated with the older version of the tensor [ghstack-poisoned]

ghstack-source-id: 0ec3b1a Pull Request resolved: #79996

See this doc: https://docs.google.com/document/d/1KiRdnoj6B4cI3yl017hTbCqcOGO1gWIpUf20sldipHM/edit# Two issues (1) regarding hooks in general and (2) regarding retains grad hooks are fixed, Python hooks, which rely on a different mechanism are not discussed here: - Hooks in cpp in general - (fixed) new hooks to registered to a newer version of the tensor no longer get applied to grad_fn associated with older version of the tensor when the first hook was ever registered - (unchanged) hooks registered to the older version of the tensor remain active on - Retains grad hooks - (fixed) now get moved to the latest grad_fn. NB: To the user, retains_grad is not considered hooks or expected to behave like hooks (which we consider properties of the grad_fn) vs retains_gradness which is a property of the tensor. - (not in this PR) Python hooks - (will fix) same issue as hooks in cpp where new hooks are being applied to grad_fn associated with the older version of the tensor [ghstack-poisoned]

albanD

SGTM
Small nits only!

test/cpp/api/autograd.cpp

torch/csrc/autograd/variable.cpp

See this doc: https://docs.google.com/document/d/1KiRdnoj6B4cI3yl017hTbCqcOGO1gWIpUf20sldipHM/edit# Two issues (1) regarding hooks in general and (2) regarding retains grad hooks are fixed, Python hooks, which rely on a different mechanism are not discussed here: - Hooks in cpp in general - (fixed) new hooks to registered to a newer version of the tensor no longer get applied to grad_fn associated with older version of the tensor when the first hook was ever registered - (unchanged) hooks registered to the older version of the tensor remain active on - Retains grad hooks - (fixed) now get moved to the latest grad_fn. NB: To the user, retains_grad is not considered hooks or expected to behave like hooks (which we consider properties of the grad_fn) vs retains_gradness which is a property of the tensor. - (not in this PR) Python hooks - (will fix) same issue as hooks in cpp where new hooks are being applied to grad_fn associated with the older version of the tensor [ghstack-poisoned]

soulitzer · 2022-07-08T15:40:21Z

@pytorchbot merge -g

pytorchmergebot · 2022-07-08T15:42:38Z

@pytorchbot successfully started a merge job. Check the current status here

github-actions · 2022-07-08T19:14:07Z

Hey @soulitzer.
You've committed this PR, but it does not have both a 'release notes: ...' and 'topics: ...' label. Please add one of each to the PR. The 'release notes: ...' label should represent the part of PyTorch that this PR changes (fx, autograd, distributed, etc) and the 'topics: ...' label should represent the kind of PR it is (not user facing, new feature, bug fix, perf improvement, etc). The list of valid labels can be found here for the 'release notes: ...' and here for the 'topics: ...'.
For changes that are 'topic: not user facing' there is no need for a release notes label.

Summary: See this doc: https://docs.google.com/document/d/1KiRdnoj6B4cI3yl017hTbCqcOGO1gWIpUf20sldipHM/edit# Two issues (1) regarding hooks in general and (2) regarding retains grad hooks are fixed, Python hooks, which rely on a different mechanism are not discussed here: - Hooks in cpp in general - (fixed) new hooks to registered to a newer version of the tensor no longer get applied to grad_fn associated with older version of the tensor when the first hook was ever registered - (unchanged) hooks registered to the older version of the tensor remain active on - Retains grad hooks - (fixed) now get moved to the latest grad_fn. NB: To the user, retains_grad is not considered hooks or expected to behave like hooks (which we consider properties of the grad_fn) vs retains_gradness which is a property of the tensor. - (not in this PR) Python hooks - (will fix) same issue as hooks in cpp where new hooks are being applied to grad_fn associated with the older version of the tensor Pull Request resolved: #79996 Approved by: https://github.com/albanD Test Plan: contbuild & OSS CI, see https://hud.pytorch.org/commit/pytorch/pytorch/516f3198d65f6932299d41ddbb98c26de5f0a367 Reviewed By: mehtanirav Differential Revision: D37749323 Pulled By: soulitzer fbshipit-source-id: eaa34c2c08cdd44970a0a4e647238603c19c6169

Fix retains grad behavior after in-place

a567fff

[ghstack-poisoned]

soulitzer requested a review from albanD as a code owner June 22, 2022 01:05

facebook-github-bot added the cla signed label Jun 22, 2022

soulitzer added a commit that referenced this pull request Jun 22, 2022

Fix retains grad behavior after in-place

8808fee

ghstack-source-id: e4459de Pull Request resolved: #79996

soulitzer mentioned this pull request Jun 22, 2022

(WIP) Fix retain grad behavior on in-place #79767

Closed

soulitzer commented Jun 22, 2022

View reviewed changes

test/cpp/api/autograd.cpp Show resolved Hide resolved

soulitzer commented Jun 22, 2022

View reviewed changes

test/test_autograd.py Show resolved Hide resolved

albanD reviewed Jun 22, 2022

View reviewed changes

soulitzer added a commit that referenced this pull request Jul 7, 2022

Fix retains grad behavior after in-place

3454ec6

ghstack-source-id: 0ec3b1a Pull Request resolved: #79996

soulitzer mentioned this pull request Jul 7, 2022

[forward ad] Match tangent strides with out-of-place view when they almost match #81049

Closed

soulitzer requested a review from albanD July 7, 2022 16:00

soulitzer mentioned this pull request Jul 7, 2022

[forward ad] Skip some metadata checks for 0 numel tensor #81055

Closed

albanD approved these changes Jul 8, 2022

View reviewed changes

test/cpp/api/autograd.cpp Outdated Show resolved Hide resolved

torch/csrc/autograd/variable.cpp Outdated Show resolved Hide resolved

torch/csrc/autograd/variable.cpp Show resolved Hide resolved

torch/csrc/autograd/variable.cpp Outdated Show resolved Hide resolved

This was referenced Jul 8, 2022

[forward ad] Fix codegen to ignore undefined outputs #81114

Closed

[forward ad] Fix codegen to ignore undefined outputs #81124

Closed

pytorchmergebot added the Merged label Jul 8, 2022

pytorchmergebot closed this in 516f319 Jul 8, 2022

soulitzer added release notes: autograd release notes category topic: bug fixes topic category labels Jul 8, 2022

soulitzer mentioned this pull request Jul 11, 2022

Update batch norm to compute forward grads for saved_mean and saved_var when input requires grad #81293

Closed

facebook-github-bot deleted the gh/soulitzer/93/head branch July 12, 2022 14:17

soulitzer mentioned this pull request Aug 4, 2022

backward(inputs=) is not working as expected when inplace are involved (and grad() does work fine) #79708

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix retains grad behavior after in-place #79996

Fix retains grad behavior after in-place #79996

Uh oh!

soulitzer commented Jun 22, 2022 •

edited

Loading

Uh oh!

facebook-github-bot commented Jun 22, 2022 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

albanD Jun 22, 2022

Uh oh!

soulitzer Jul 6, 2022

Uh oh!

Uh oh!

Uh oh!

albanD Jun 22, 2022

Uh oh!

soulitzer Jul 6, 2022

Uh oh!

Uh oh!

albanD left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

soulitzer commented Jul 8, 2022

Uh oh!

pytorchmergebot commented Jul 8, 2022

Uh oh!

github-actions bot commented Jul 8, 2022

Uh oh!

Uh oh!

Fix retains grad behavior after in-place #79996

Fix retains grad behavior after in-place #79996

Uh oh!

Conversation

soulitzer commented Jun 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented Jun 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful links

✅ No Failures (0 Pending)

Uh oh!

Uh oh!

Uh oh!

albanD Jun 22, 2022

Choose a reason for hiding this comment

Uh oh!

soulitzer Jul 6, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

albanD Jun 22, 2022

Choose a reason for hiding this comment

Uh oh!

soulitzer Jul 6, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

albanD left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

soulitzer commented Jul 8, 2022

Uh oh!

pytorchmergebot commented Jul 8, 2022

Uh oh!

github-actions bot commented Jul 8, 2022

Uh oh!

Uh oh!

soulitzer commented Jun 22, 2022 •

edited

Loading

facebook-github-bot commented Jun 22, 2022 •

edited

Loading