Function 'SqrtBackward' returned nan values in its 0th output. Bug in min_enclosing_box.py? #20

zye1996 · 2021-04-25T16:15:41Z

When backprop with GIoU loss, there is a sqrt out of range regarding the line here when sqrt encounter 0 values.

Should we add a small offset to the value inside the sqrt? I tried 1e-8 and the training became unstable while 1e-16 is fine.

num = torch.sqrt( (y2-y1).square() + (x2-x1).square() +1e-16) + 1e-8

The text was updated successfully, but these errors were encountered:

lilanxiao · 2021-04-25T18:36:05Z

Thank you very much for the issue!

Yes, it's a bug. According to this link: pytorch/pytorch#6394, the backprop of torch.sqrt() would generate nan if the input is zero.

Actually, I'm surprised that Pytorch really puts an inf there. I thought the gradient was hard-coded to some very large but limited value. lol. My computer is now occupied by some other tasks and I cannot run tests. But I will fix this ASAP and let you know.

lilanxiao · 2021-04-27T10:30:49Z

hi, I've changed that line to
num = torch.sqrt( (y2-y1).square() + (x2-x1).square() +1e-14)
the 1e-8 is no more necessary as the sqrt is guaranteed positive. Please let me know if there are further issues.

zye1996 · 2021-04-28T14:33:22Z

hi, I've changed that line to
num = torch.sqrt( (y2-y1).square() + (x2-x1).square() +1e-14)
the 1e-8 is no more necessary as the sqrt is guaranteed positive. Please let me know if there are further issues.

Hi I have verified the fix work for the backprop. I can confirm that DIoU loss enhance the performance of the detector considerably.

lilanxiao added the bug Something isn't working label Apr 25, 2021

zye1996 closed this as completed Apr 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Function 'SqrtBackward' returned nan values in its 0th output. Bug in min_enclosing_box.py? #20

Function 'SqrtBackward' returned nan values in its 0th output. Bug in min_enclosing_box.py? #20

zye1996 commented Apr 25, 2021

lilanxiao commented Apr 25, 2021 •

edited

Loading

lilanxiao commented Apr 27, 2021

zye1996 commented Apr 28, 2021

Function 'SqrtBackward' returned nan values in its 0th output. Bug in min_enclosing_box.py? #20

Function 'SqrtBackward' returned nan values in its 0th output. Bug in min_enclosing_box.py? #20

Comments

zye1996 commented Apr 25, 2021

lilanxiao commented Apr 25, 2021 • edited Loading

lilanxiao commented Apr 27, 2021

zye1996 commented Apr 28, 2021

lilanxiao commented Apr 25, 2021 •

edited

Loading