optim_low breaks if some parameter in the model has None gradient #40

ksreenivasan · 2021-01-05T18:00:02Z

In the step() method here, if p.grad = None, then this line will break causing the optimizer to crash. However, in several Deep Learning applications, it is common to have some parameters within a layer or even whole layers to be frozen. QPyTorch's optimizer would not be applicable to these cases.

PyTorch's default optimizer has a simple and elegant solution which is to just skip these parameters treating None has equivalent to 0 gradient. We have implemented this solution here. I would like to propose this change to QPyTorch as well.

The text was updated successfully, but these errors were encountered:

Tiiiger · 2021-01-05T18:50:04Z

hi @kamikazekartik thank you for your suggestion! if you can help fix it and submit a PR, I am happy to merge it. Otherwise it might take a while before I get to it.

ksreenivasan · 2021-01-05T22:05:50Z

Sounds good @Tiiiger! It's just a few lines of code, so I'll make the change and create a PR later today.

[#40] Allowing optimizer.step to handle None gradients

Tiiiger added a commit that referenced this issue Jan 6, 2021

Merge pull request #41 from kamikazekartik/master

ec3e500

[#40] Allowing optimizer.step to handle None gradients

Tiiiger closed this as completed Jan 6, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

optim_low breaks if some parameter in the model has None gradient #40

optim_low breaks if some parameter in the model has None gradient #40

ksreenivasan commented Jan 5, 2021

Tiiiger commented Jan 5, 2021

ksreenivasan commented Jan 5, 2021

optim_low breaks if some parameter in the model has None gradient #40

optim_low breaks if some parameter in the model has None gradient #40

Comments

ksreenivasan commented Jan 5, 2021

Tiiiger commented Jan 5, 2021

ksreenivasan commented Jan 5, 2021