torch.lerp: discrepancy between CUDA and CPU (with extremal inputs) #78484

kshitij12345 · 2022-05-30T09:50:06Z

🐛 Describe the bug

import torch

st = torch.tensor(0.2345)
en = torch.tensor(float("inf"))
weight = torch.tensor(-0.9890)

print(torch.lerp(st, en, weight))   # -inf
print(torch.lerp(st.cuda(), en.cuda(), weight.cuda()))  # nan

Versions

master

khushi-411 · 2023-03-12T17:45:52Z

Hi, @kshitij12345; I think this is fixed in the master branch and can be closed. Please see:

In [1]: import torch
   ...: 
   ...: st = torch.tensor(0.2345)
   ...: en = torch.tensor(float("inf"))
   ...: weight = torch.tensor(-0.9890)
   ...: 
   ...: print(torch.lerp(st, en, weight))   # -inf
   ...: print(torch.lerp(st.cuda(), en.cuda(), weight.cuda()))  # nan

tensor(nan)
tensor(nan, device='cuda:0')

In [2]: torch.__version__
Out[2]: '2.0.0a0+gitb7c2a65'

Thanks!

kshitij12345 · 2023-03-13T05:34:26Z

This looks like regression on CPU, as I'd expect -inf as the output.

>>> start = 0.2345
>>> end = -0.9890
>>> weight = float('inf')
>>> start + weight * (end - start)
-inf

ZailiWang · 2023-03-15T03:38:06Z

Yes this should be a regression, the original CPU result (-inf) should be expected.
I found the PR introduced the bug is #84844, in which weight < 0.5 is updated to abs(weight) < 0.5 so the calc is executed by the formula at the other branch. This judgement is for numerical stability when weight closer to 0, but disregarded this edge case.

import torch

st = torch.tensor(0.2345)
en = torch.tensor(float("inf"))
weight = torch.tensor(-0.9890)

print(st + weight * (en - st)) # -inf
print(en - (en-st)*(torch.tensor(1)-weight)) # nan

kshitij12345 added the module: NaNs and Infs Problems related to NaN and Inf handling in floating point label May 30, 2022

ejguan added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label May 31, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

torch.lerp: discrepancy between CUDA and CPU (with extremal inputs) #78484

torch.lerp: discrepancy between CUDA and CPU (with extremal inputs) #78484

kshitij12345 commented May 30, 2022

khushi-411 commented Mar 12, 2023

kshitij12345 commented Mar 13, 2023 •

edited

ZailiWang commented Mar 15, 2023 •

edited

torch.lerp: discrepancy between CUDA and CPU (with extremal inputs) #78484

torch.lerp: discrepancy between CUDA and CPU (with extremal inputs) #78484

Comments

kshitij12345 commented May 30, 2022

🐛 Describe the bug

Versions

khushi-411 commented Mar 12, 2023

kshitij12345 commented Mar 13, 2023 • edited

ZailiWang commented Mar 15, 2023 • edited

kshitij12345 commented Mar 13, 2023 •

edited

ZailiWang commented Mar 15, 2023 •

edited