Fix derivatives of norm(p=inf) #78105

lezcano · 2022-05-23T17:46:41Z

Stack from ghstack:

Following up on #51099 (comment), we fix these derivatives, as they were incorrect until now.

As described in the note, the better solution would be to use vectorised operations on the preprocessing operation when reducing on CPU. It's not clear how difficult that may be.

Fixes #67517

Following up on #51099 (comment), we fix these derivatives, as they were incorrect until now. As described in the note, the better solution would be to use vectorised operations on the preprocessing operation when reducing on CPU. It's not clear how difficult that may be. Fixes #67517 [ghstack-poisoned]

facebook-github-bot · 2022-05-23T17:46:48Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/78105
📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓Need help or want to give feedback on the CI? Visit our office hours

✅ No Failures (0 Pending)

As of commit 0055821 (more details on the Dr. CI page):

Expand to see more

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

lezcano · 2022-05-23T17:49:42Z

@ngimel I improved the explanation for why we have to do that trick in linalg_vector_norm and I added the trick to norm to fix that case. Could you have a look?
We should figure out whether we can vectorise the input transformation on CPU for reductions. This would solve this problem.

I also simplified the formula for the derivative. That's for @albanD to have a look.

lezcano · 2022-05-23T17:50:32Z

fwiw, this PR fixes some issues that were uncovered in the top PR of the stack after fixing a the forward AD for the sgn function.

Following up on #51099 (comment), we fix these derivatives, as they were incorrect until now. As described in the note, the better solution would be to use vectorised operations on the preprocessing operation when reducing on CPU. It's not clear how difficult that may be. Fixes #67517 [ghstack-poisoned]

lezcano · 2022-05-24T17:15:04Z

@pytorchbot merge this

github-actions · 2022-05-24T17:20:17Z

Hey @lezcano.
You've committed this PR, but it does not have both a 'release notes: ...' and 'topics: ...' label. Please add one of each to the PR. The 'release notes: ...' label should represent the part of PyTorch that this PR changes (fx, autograd, distributed, etc) and the 'topics: ...' label should represent the kind of PR it is (not user facing, new feature, bug fix, perf improvement, etc). The list of valid labels can be found here for the 'release notes: ...' and here for the 'topics: ...'.
For changes that are 'topic: not user facing' there is no need for a release notes label.

Summary: Following up on #51099 (comment), we fix these derivatives, as they were incorrect until now. As described in the note, the better solution would be to use vectorised operations on the preprocessing operation when reducing on CPU. It's not clear how difficult that may be. Fixes #67517 Pull Request resolved: #78105 Approved by: https://github.com/ngimel Test Plan: contbuild & OSS CI, see https://hud.pytorch.org/commit/pytorch/pytorch/0c8c39fa715155190c51016fad5bdfc459ed80b3 Reviewed By: mehtanirav Differential Revision: D36668395 fbshipit-source-id: 5c3d887729279fdcd6dd0e47c4ea6372096a280b

lezcano requested review from mruberry, ngimel, nikitaved, IvanYashchuk, albanD and soulitzer as code owners May 23, 2022 17:46

facebook-github-bot added the cla signed label May 23, 2022

This was referenced May 23, 2022

Random number generators are not and should not be differentiable #78106

Closed

More forward AD formulas #77975

Closed

lezcano removed request for nikitaved, soulitzer, IvanYashchuk and mruberry May 23, 2022 17:47

ngimel approved these changes May 23, 2022

View reviewed changes

pytorchbot added the open source label May 23, 2022

lezcano added 2 commits May 24, 2022 09:48

pytorchmergebot added the Merged label May 24, 2022

pytorchmergebot closed this in 0c8c39f May 24, 2022

lezcano mentioned this pull request May 25, 2022

Make l1_loss composite #78257

Closed

lezcano added release notes: autograd release notes category topic: not user facing topic category labels May 25, 2022

facebook-github-bot deleted the gh/Lezcano/78/head branch May 28, 2022 14:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix derivatives of norm(p=inf) #78105

Fix derivatives of norm(p=inf) #78105

Uh oh!

lezcano commented May 23, 2022 •

edited

Loading

Uh oh!

facebook-github-bot commented May 23, 2022 •

edited

Loading

Uh oh!

lezcano commented May 23, 2022

Uh oh!

lezcano commented May 23, 2022

Uh oh!

lezcano commented May 24, 2022

Uh oh!

github-actions bot commented May 24, 2022

Uh oh!

Uh oh!

Fix derivatives of norm(p=inf) #78105

Fix derivatives of norm(p=inf) #78105

Uh oh!

Conversation

lezcano commented May 23, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented May 23, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful links

✅ No Failures (0 Pending)

Uh oh!

lezcano commented May 23, 2022

Uh oh!

lezcano commented May 23, 2022

Uh oh!

lezcano commented May 24, 2022

Uh oh!

github-actions bot commented May 24, 2022

Uh oh!

Uh oh!

lezcano commented May 23, 2022 •

edited

Loading

facebook-github-bot commented May 23, 2022 •

edited

Loading