Avoid NaN values in torch.cdist backward for p<1 #45720

kurtamohler · 2020-10-02T03:04:02Z

codecov · 2020-10-02T07:48:55Z

Codecov Report

Merging #45720 into master will decrease coverage by 0.07%.
The diff coverage is 61.70%.

@@            Coverage Diff             @@
##           master   #45720      +/-   ##
==========================================
- Coverage   68.61%   68.53%   -0.08%     
==========================================
  Files         405      409       +4     
  Lines       52045    52553     +508     
==========================================
+ Hits        35710    36018     +308     
- Misses      16335    16535     +200

Impacted Files	Coverage Δ
torch/_torch_docs.py	`100.00% <ø> (ø)`
torch/cuda/amp/grad_scaler.py	`23.94% <0.00%> (-0.46%)`	⬇️
torch/nn/modules/loss.py	`97.76% <ø> (ø)`
torch/onnx/symbolic_opset9.py	`35.56% <ø> (ø)`
torch/overrides.py	`97.08% <ø> (ø)`
torch/quantization/fx/pattern_utils.py	`87.23% <ø> (-1.86%)`	⬇️
torch/onnx/symbolic_opset11.py	`21.42% <9.09%> (-1.28%)`	⬇️
.../testing/_internal/distributed/distributed_test.py	`30.29% <12.19%> (-0.41%)`	⬇️
torch/onnx/symbolic_opset10.py	`40.00% <16.66%> (+3.74%)`	⬆️
torch/utils/benchmark/utils/timer.py	`78.82% <25.00%> (-8.85%)`	⬇️
... and 26 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 6acd7b6...476adff. Read the comment docs.

albanD

Looks mostly good.
Added comments inline.

test/test_autograd.py

aten/src/ATen/native/cpu/DistanceOpsKernel.cpp

aten/src/ATen/native/cuda/DistanceKernel.cu

albanD

Looks good.
We can think of more "near to zero" optimization in a follow up PR if we want to.

facebook-github-bot

@albanD has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

nikitaved · 2020-10-05T15:32:06Z

@alban, we can modify the formula for different cases p < 1, p > 1 to make it more numerically stable. It will be bc-breaking for sure.

albanD · 2020-10-05T16:08:26Z

Not all BC-breaking are a nogo. If we change the computed value to be closer to the correct thing, that's an OK thing to do.

The main thing is that we should make sure that it does not make other regions significantly worse.

facebook-github-bot · 2020-10-06T02:13:01Z

@albanD merged this pull request in 54aaffb.

kurtamohler requested review from albanD and nikitaved October 2, 2020 03:04

pytorchbot added the open source label Oct 2, 2020

albanD reviewed Oct 2, 2020

View reviewed changes

test/test_autograd.py Outdated Show resolved Hide resolved

test/test_autograd.py Outdated Show resolved Hide resolved

nikitaved reviewed Oct 2, 2020

View reviewed changes

aten/src/ATen/native/cpu/DistanceOpsKernel.cpp Show resolved Hide resolved

nikitaved reviewed Oct 2, 2020

View reviewed changes

aten/src/ATen/native/cuda/DistanceKernel.cu Show resolved Hide resolved

Avoid NaN values in torch.cdist backward for p<1

476adff

kurtamohler force-pushed the pytorch-cdist-nan-36493 branch from 297ccc8 to 476adff Compare October 2, 2020 18:28

albanD approved these changes Oct 5, 2020

View reviewed changes

facebook-github-bot reviewed Oct 5, 2020

View reviewed changes

facebook-github-bot closed this in 54aaffb Oct 5, 2020

facebook-github-bot added the merged label Oct 6, 2020

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid NaN values in torch.cdist backward for p<1 #45720

Avoid NaN values in torch.cdist backward for p<1 #45720

kurtamohler commented Oct 2, 2020

codecov bot commented Oct 2, 2020 •

edited

albanD left a comment

albanD left a comment

facebook-github-bot left a comment

nikitaved commented Oct 5, 2020

albanD commented Oct 5, 2020

facebook-github-bot commented Oct 6, 2020

Avoid NaN values in torch.cdist backward for p<1 #45720

Avoid NaN values in torch.cdist backward for p<1 #45720

Conversation

kurtamohler commented Oct 2, 2020

codecov bot commented Oct 2, 2020 • edited

Codecov Report

albanD left a comment

Choose a reason for hiding this comment

albanD left a comment

Choose a reason for hiding this comment

facebook-github-bot left a comment

Choose a reason for hiding this comment

nikitaved commented Oct 5, 2020

albanD commented Oct 5, 2020

facebook-github-bot commented Oct 6, 2020

codecov bot commented Oct 2, 2020 •

edited