Move GradMode / AutoGradMode / NoGradGuard to ATen core #18573

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Closed

yf225 wants to merge 6 commits into pytorch:master from yf225:move_grad_mode_to_aten

Contributor

yf225 commented Mar 28, 2019 •

edited

Loading

After the Variable/Tensor merge, code paths in ATen need to be able to check whether a tensor requires gradient, and throw errors in places where a requires_grad=true tensor is not allowed (such as https://github.com/pytorch/pytorch/blob/master/aten/src/ATen/Utils.h#L76-L78 and https://github.com/pytorch/pytorch/blob/master/aten/src/ATen/SparseTensorImpl.cpp#L86). Since the GradMode thread-local variable controls whether a tensor should accumulate gradients, we need to be able to check this variable from ATen when we determine whether a tensor requires gradient, hence the PR to move GradMode / AutoGradMode / NoGradGuard to ATen.

Note that we intentionally don't merge at::GradMode and at::NonVariableTypeMode, with the following reasoning:
Semantically, at::GradMode and at::NonVariableTypeMode actually mean different things: at::GradMode controls whether a tensor should accumulate gradients, and at::NonVariableTypeMode controls whether a Variable should be treated as a non-Variable tensor in type dispatches. There are places whether we don't want the tensor to accumulate gradients, but still want the Variable to be treated as a Variable. Here is one example:

#  torch/tensor.py
with torch.no_grad():
   ...
   new_tensor = self.new()    # `at::GradMode` is false at this point
   ...

// tools/autograd/templates/python_variable_methods.cpp
static PyObject * THPVariable_new(PyObject* self, PyObject* args, PyObject* kwargs)
{
  ...
  // if we merge `at::GradMode` and `at::NonVariableTypeMode`, since `at::GradMode` is false and `self_.type()` checks `at::GradMode` to decide whether to return non-Variable type, it will return a non-Variable type here, which is not what we want (and throws a "Tensor that was converted to Variable was not actually a Variable" error)
  return THPVariable_Wrap(torch::utils::legacy_tensor_new(self_.type(), args, kwargs));
  ...
}

For the above reason, we cannot merge at::GradMode and at::NonVariableTypeMode, as they have different purposes.

yf225 requested review from ezyang and gchanan

March 28, 2019 17:57

yf225 requested review from ebetica and goldsborough as code owners

March 28, 2019 17:57

Contributor Author

yf225 commented Mar 28, 2019

Note from offline discussion with @gchanan : we should look into how to avoid moving GradMode to ATen, and change places where this move is expected.

Contributor

ezyang commented Mar 29, 2019

Note that we intentionally don't merge at::GradMode and at::NonVariableTypeMode, with the following reasoning

This would make a really good Note in the source code

ezyang approved these changes

View reviewed changes

Contributor

ezyang left a comment

okey dokey

gchanan requested changes

View reviewed changes

Contributor

gchanan left a comment

the note on the offline discussion says we don't actually need this, right?

yf225 closed this

yf225 reopened this

yf225 force-pushed the move_grad_mode_to_aten branch from 1fe8022 to b0f096d Compare

July 5, 2019 18:34

pytorchbot added caffe2 module: autograd module: build module: cpp module: internals labels

yf225 changed the title ~~Move GradMode / AutoGradMode / NoGradGuard to ATen~~ Move GradMode / AutoGradMode / NoGradGuard to ATen core

Contributor Author

yf225 commented Jul 5, 2019 •

edited

Loading

#22473 requires that we check torch::autograd::GradMode::is_enabled() in Caffe2 tensor's enforce_invariants() method in caffe2/core/tensor.cc. Since caffe2/core/tensor.cc is in Caffe2 core, @ljk53 suggested that we should move torch::autograd::GradMode from torch/csrc/autograd to ATen core, so that core code doesn't need to depend on non-core code (i.e. torch/csrc/autograd), which can prevent build structure headache.


          Move GradMode / AutoGradMode / NoGradGuard to ATen core

3f89042

yf225 force-pushed the move_grad_mode_to_aten branch from b0f096d to 3f89042 Compare

July 5, 2019 18:42

yf225 mentioned this pull request

Pass Variable into Caffe2 ops, by requiring that the Variable doesn't require grad #22473

Closed

gchanan approved these changes

View reviewed changes

facebook-github-bot reviewed

View reviewed changes

Contributor

facebook-github-bot left a comment

@yf225 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Will Feng added 2 commits

July 5, 2019 16:03


          fix torch/csrc/api/include/torch/utils.h

222b821


          special-case CAFFE2_FB_LIMITED_MOBILE_CAPABILITY in grad_mode.cpp

794a1e0

facebook-github-bot reviewed

View reviewed changes

Contributor

facebook-github-bot left a comment

@yf225 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.


          fix bug

5ea4193

facebook-github-bot reviewed

View reviewed changes

Contributor

facebook-github-bot left a comment

@yf225 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.


          try to use FEATURE_TORCH_MOBILE

8d55d82

facebook-github-bot reviewed

View reviewed changes

Contributor

facebook-github-bot left a comment

@yf225 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

fix

d6adf6d

yf225 force-pushed the move_grad_mode_to_aten branch from f8739c0 to d6adf6d Compare

July 6, 2019 00:52

facebook-github-bot reviewed

View reviewed changes

Contributor

facebook-github-bot left a comment

@yf225 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot closed this in

221af09

facebook-github-bot added the merged label

Contributor

facebook-github-bot commented Jul 6, 2019

@yf225 merged this pull request in 221af09.

zdevito pushed a commit to zdevito/ATen that referenced this pull request


          Move GradMode / AutoGradMode / NoGradGuard to ATen core (#18573)

d0f8ba5

Summary:
After the Variable/Tensor merge, code paths in ATen need to be able to check whether a tensor requires gradient, and throw errors in places where a `requires_grad=true` tensor is not allowed (such as https://github.com/pytorch/pytorch/blob/master/aten/src/ATen/Utils.h#L76-L78 and https://github.com/pytorch/pytorch/blob/master/aten/src/ATen/SparseTensorImpl.cpp#L86). Since the `GradMode` thread-local variable controls whether a tensor should accumulate gradients, we need to be able to check this variable from ATen when we determine whether a tensor requires gradient, hence the PR to move `GradMode` / `AutoGradMode` / `NoGradGuard` to ATen.

Note that we intentionally don't merge `at::GradMode` and `at::NonVariableTypeMode`, with the following reasoning:
Semantically, `at::GradMode` and `at::NonVariableTypeMode` actually mean different things: `at::GradMode` controls whether a tensor should accumulate gradients, and `at::NonVariableTypeMode` controls whether a Variable should be treated as a non-Variable tensor in type dispatches. There are places whether we *don't* want the tensor to accumulate gradients, but *still* want the Variable to be treated as a Variable. Here is one example:
```python
#  torch/tensor.py
with torch.no_grad():
   ...
   new_tensor = self.new()    # `at::GradMode` is false at this point
   ...
```
```cpp
// tools/autograd/templates/python_variable_methods.cpp
static PyObject * THPVariable_new(PyObject* self, PyObject* args, PyObject* kwargs)
{
  ...
  // if we merge `at::GradMode` and `at::NonVariableTypeMode`, since `at::GradMode` is false and `self_.type()` checks `at::GradMode` to decide whether to return non-Variable type, it will return a non-Variable type here, which is not what we want (and throws a "Tensor that was converted to Variable was not actually a Variable" error)
  return THPVariable_Wrap(torch::utils::legacy_tensor_new(self_.type(), args, kwargs));
  ...
}
```
For the above reason, we cannot merge `at::GradMode` and `at::NonVariableTypeMode`, as they have different purposes.
Pull Request resolved: pytorch/pytorch#18573

Differential Revision: D16134413

Pulled By: yf225

fbshipit-source-id: 6140347e78bc54206506499c264818eb693cdb8a

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

caffe2 module: autograd module: build module: cpp module: internals