Add python API for get_gradients() method. #28926

pritamdamania87 · 2019-10-30T21:56:08Z

Stack from ghstack:

Add python API for get_gradients() method. #28926 Add python API for get_gradients() method.

The get_gradients method was a pybind only method without any
documentation for this method for users.

I've moved this method to our python distributed autograd API and ensured that
we have appropriate docs for this method.

Differential Revision: D18234443

The get_gradients method was a pybind only method without any documentation for this method for users. I've moved this method to our python distributed autograd API and ensured that we have appropriate docs for this method. Differential Revision: [D18234443](https://our.internmc.facebook.com/intern/diff/D18234443/) [ghstack-poisoned]

The get_gradients method was a pybind only method without any documentation for this method for users. I've moved this method to our python distributed autograd API and ensured that we have appropriate docs for this method. Differential Revision: [D18234443](https://our.internmc.facebook.com/intern/diff/D18234443/) ghstack-source-id: 92941094 Pull Request resolved: #28926

pietern

Not a comment on the PR itself.

Why is this map keyed by tensors? Reason I ask: in autograd backward causes the grads to be accumulated in the grad member field of tensors with requires_grad=True. If you want access to the grads without accumulating them (i.e. compute them without side effects), you can use the torch.autograd.grad function. Similarly, I would expect dist_autograd.backward to accumulate grads in the grad member fields.

pritamdamania87 · 2019-10-31T19:20:57Z

@pietern The reason we don't accumulate grads on the .grad field is that multiple autograd passes from different trainers would end up stepping on each other. That is why we accumulate the grads in the autograd context instead. We had a discussion with @albanD and @ezyang and it made sense to also provide an option to accumulate gradients on the .grad field as well: #27641.

ezyang · 2019-10-31T21:21:58Z

You don't have to do this to attach documentation. Take a look at what we do in torch/_tensor_docs.py

pritamdamania87 · 2019-10-31T23:21:49Z

@ezyang If we have a handful of methods, wouldn't it be better to attach the documentation as we've done in this PR? This way its easier for developers navigating the code to quickly see the documentation inline itself.

ezyang · 2019-11-01T14:36:43Z

Making Python wrappers for C functions adds nontrivial overhead; it's one of the main reasons we've done it this way for regular torch functions. So in this case we decided to give up "there is alway a def ... for every Python function" in the name of performance.

pritamdamania87 · 2019-11-01T17:52:38Z

I agree this makes a lot of sense for the tensor methods. Although, in this case isn't each method called only once per forward and backward pass? Do you feel the performance overhead would still be significant?

ezyang · 2019-11-01T18:26:14Z

Yes, you might be fine. So the request to do it the other way is more of a hygiene thing (so that someone else doesn't come along and copy what you did here for a case where the performance does matter.)

pritamdamania87 · 2019-11-06T01:47:10Z

@ezyang So it looks like add_docstr doesn't work since pybind is probably already setting the docstring and as a result we fail here:

pytorch/torch/csrc/Module.cpp

Line 257 in 45391cc

"method '%s' already has a docstring", m->d_method->ml_name);

albanD · 2019-11-06T02:01:14Z

@pritamdamania87 If I remember correctly, if you pass a char* as the 3rd argument of the module def() method, it will be used as the docstring for the function. We already do that for example for the cpp extensions here.
Would that work for you?

The get_gradients method was a pybind only method without any documentation for this method for users. I've moved this method to our python distributed autograd API and ensured that we have appropriate docs for this method. Differential Revision: [D18234443](https://our.internmc.facebook.com/intern/diff/D18234443/) [ghstack-poisoned]

Pull Request resolved: #28926 The get_gradients method was a pybind only method without any documentation for this method for users. I've moved this method to our python distributed autograd API and ensured that we have appropriate docs for this method. ghstack-source-id: 93558845 Differential Revision: [D18234443](https://our.internmc.facebook.com/intern/diff/D18234443/)

pritamdamania87 · 2019-11-08T20:15:45Z

@albanD @ezyang Moved the docs to the pybind initialization. Could you take another look? Thanks!

albanD

LGTM

facebook-github-bot · 2019-11-12T08:08:14Z

This pull request has been merged in 17b0ab4.

pritamdamania87 requested review from apaszke, mrshenli and pietern as code owners October 30, 2019 21:56

pietern reviewed Oct 31, 2019

View reviewed changes

albanD approved these changes Nov 8, 2019

View reviewed changes

facebook-github-bot closed this in 17b0ab4 Nov 11, 2019

pritamdamania87 mentioned this pull request Nov 12, 2019

Lift rpc_timeout to RpcAgent, for other RpcAgents to reuse. #29341

Closed

xush6528 mentioned this pull request Nov 12, 2019

Remove thin Python wrapper in RPC module to avoid performance loss, also put docstring into rpc/init.cpp C++ Python bindings #29628

Closed

facebook-github-bot added the merged label Nov 12, 2019

facebook-github-bot deleted the gh/pritamdamania87/19/head branch November 15, 2019 15:17

rohan-varma mentioned this pull request Jan 29, 2020

[RPC] Move _get_current_rpc_agent() type calls to RPC pybind. #32780

Closed

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add python API for get_gradients() method. #28926

Add python API for get_gradients() method. #28926

Uh oh!

pritamdamania87 commented Oct 30, 2019 •

edited

Loading

Uh oh!

pietern left a comment

Uh oh!

pritamdamania87 commented Oct 31, 2019

Uh oh!

ezyang commented Oct 31, 2019

Uh oh!

pritamdamania87 commented Oct 31, 2019

Uh oh!

ezyang commented Nov 1, 2019

Uh oh!

pritamdamania87 commented Nov 1, 2019

Uh oh!

ezyang commented Nov 1, 2019

Uh oh!

pritamdamania87 commented Nov 6, 2019

Uh oh!

albanD commented Nov 6, 2019

Uh oh!

pritamdamania87 commented Nov 8, 2019

Uh oh!

albanD left a comment

Uh oh!

facebook-github-bot commented Nov 12, 2019

Uh oh!

Uh oh!

Add python API for get_gradients() method. #28926

Add python API for get_gradients() method. #28926

Uh oh!

Conversation

pritamdamania87 commented Oct 30, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pietern left a comment

Choose a reason for hiding this comment

Uh oh!

pritamdamania87 commented Oct 31, 2019

Uh oh!

ezyang commented Oct 31, 2019

Uh oh!

pritamdamania87 commented Oct 31, 2019

Uh oh!

ezyang commented Nov 1, 2019

Uh oh!

pritamdamania87 commented Nov 1, 2019

Uh oh!

ezyang commented Nov 1, 2019

Uh oh!

pritamdamania87 commented Nov 6, 2019

Uh oh!

albanD commented Nov 6, 2019

Uh oh!

pritamdamania87 commented Nov 8, 2019

Uh oh!

albanD left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Nov 12, 2019

Uh oh!

Uh oh!

pritamdamania87 commented Oct 30, 2019 •

edited

Loading