lr_find doesn't return the correct suggestion if some losses are nan #1850

binshengliu · 2020-05-16T05:36:55Z

🐛 Bug

lr_finder doesn't return the correct suggestion if some losses are nan. The returned loss is the one corresponding to the nan value, which is very big in my case.

To Reproduce

This depends on the dataset. Please see the code sample.

Code sample

I believe this is caused by numpy. The related code should be https://github.com/PyTorchLightning/pytorch-lightning/blob/b84b02400a312240a6429c186cc63514eeb45a82/pytorch_lightning/trainer/lr_finder.py#L325

example_losses = [0.90, 0.89, 0.87, 0.86, 0.85, 0.84]
print(np.gradient(example_losses).argmin())
example_losses = [0.90, 0.89, 0.87, 0.86, 0.85, 0.84, float('nan')]
print(np.gradient(example_losses).argmin())

Output:

1
5

Expected behavior

Return the correct suggested loss.

Environment

CUDA:
- GPU:
- available: False
- version: 10.2
Packages:
- numpy: 1.18.4
- pyTorch_debug: False
- pyTorch_version: 1.5.0
- pytorch-lightning: 0.7.6
- tensorboard: 2.2.0
- tqdm: 4.45.0
System:
- OS: Linux
- architecture:
  - 64bit
- processor:
- python: 3.7.6
- version: Proposal for help #1 SMP Debian 4.19.118-2 (2020-04-29)

Additional context

NA

The text was updated successfully, but these errors were encountered:

github-actions · 2020-05-16T05:37:40Z

Hi! thanks for your contribution!, great first issue!

SkafteNicki · 2020-05-16T08:04:29Z

Good catch. I guess that we can not do much about how np.gradient behave but we can filter Nan values before doing the calculation. @binshengliu are you up for doing a PR?

binshengliu · 2020-05-16T10:53:58Z

Filtering out nan would be a reasonable approach. Locally I just reset nan to inf also to avoid dealing with indexes.

Sorry I'm quite inundated with my own projects recently and may not have enough time to shape a proper PR.

binshengliu added bug Something isn't working help wanted Open to be worked on labels May 16, 2020

rohitgr7 mentioned this issue May 17, 2020

Remove NaNs from loss in LRFinder #1862

Merged

5 tasks

Borda closed this as completed in #1862 May 19, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lr_find doesn't return the correct suggestion if some losses are nan #1850

lr_find doesn't return the correct suggestion if some losses are nan #1850

binshengliu commented May 16, 2020

github-actions bot commented May 16, 2020

SkafteNicki commented May 16, 2020

binshengliu commented May 16, 2020

lr_find doesn't return the correct suggestion if some losses are nan #1850

lr_find doesn't return the correct suggestion if some losses are nan #1850

Comments

binshengliu commented May 16, 2020

🐛 Bug

To Reproduce

Code sample

Expected behavior

Environment

Additional context

github-actions bot commented May 16, 2020

SkafteNicki commented May 16, 2020

binshengliu commented May 16, 2020