Added parameter range checks for all optimizers #6000

lazypanda1 · 2018-03-26T04:22:55Z

This PR adds parameter range checks to all optimizers to ensure that end-users do not end up providing invalid values to the optimizers and be confused by the output when there is no actual problem with their model.

For example, running the following program produces NaNs in the output, due to invalid value of rho (>1.0).

import torch
from torch.autograd import Variable

N, D_in, H, D_out = 64, 1000, 100, 10

x = Variable(torch.randn(N, D_in))
y = Variable(torch.randn(N, D_out), requires_grad=False)

model = torch.nn.Sequential(
    torch.nn.Linear(D_in, H),
    torch.nn.ReLU(),
    torch.nn.Linear(H, D_out),
)
loss_fn = torch.nn.MSELoss(size_average=False)

learning_rate = 1e-4
optimizer = torch.optim.Adadelta(model.parameters(), lr=learning_rate, rho=1.1)
for t in range(2):
    y_pred = model(x)
    loss = loss_fn(y_pred, y)
    print(t, loss.data[0])
    optimizer.zero_grad()
    loss.backward()
    optimizer.step()

Output:

0 651.8707885742188
1 nan

I tried adding constraints for all the parameters that I could infer from the corresponding articles, but I am still missing some. Please feel free to suggest what should be bound for the ones which are missing.

This is similar to the bounds check which I added for Adam Optimizer

I can also add tests if needed.

torch/optim/sgd.py

+        if not 0.0 <= lr:
+            raise ValueError("Invalid learning rate: {}".format(lr))
+        if not 0.0 <= momentum <= 1.0:
+            raise ValueError("Invalid momentum value: {}".format(momentum))


vishwakftw

I have reviewed the rest of the ranges. Looks good to me.

ezyang · 2018-03-26T18:08:22Z

@pytorchbot test this please

apaszke · 2018-03-26T18:30:40Z

@pytorchbot test this please

test/test_optim.py

@@ -343,6 +349,8 @@ def test_adagrad(self):
                self._build_params_dict(weight, bias, lr=1e-2),
                lr=1e-1)
        )
+        with self.assertRaisesRegex(ValueError, "Invalid lr_decay value: -0.5"):
+            optim.Adagrad(None, lr=1e-2, lr_decay=-0.5)

    def test_adagrad_sparse(self):
        self._test_rosenbrock_sparse(


ezyang · 2018-03-27T14:51:11Z

@pytorchbot test this please

lazypanda1 · 2018-03-27T22:56:27Z

Can this be merged?

apaszke · 2018-03-28T09:22:31Z

Thank you @lazypanda1!

lazypanda1 added 2 commits March 14, 2018 20:39

Added parameter checks for optimizers

434225e

updated constraint for rho in adadelta

f51ba5d

vishwakftw reviewed Mar 26, 2018

View reviewed changes

vishwakftw approved these changes Mar 26, 2018

View reviewed changes

Fixed bound for momentum in sgd

1608848

apaszke approved these changes Mar 26, 2018

View reviewed changes

Added tests

3da7f95

lazypanda1 commented Mar 26, 2018

View reviewed changes

apaszke approved these changes Mar 28, 2018

View reviewed changes

apaszke merged commit 063946d into pytorch:master Mar 28, 2018

ssnl mentioned this pull request Apr 3, 2018

Fix SGD lr check failing on default value #6244

Merged

ezyang added the open source label Jun 24, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added parameter range checks for all optimizers #6000

Added parameter range checks for all optimizers #6000

lazypanda1 commented Mar 26, 2018 •

edited

This comment was marked as off-topic.

This comment was marked as off-topic.

vishwakftw left a comment

ezyang commented Mar 26, 2018

apaszke commented Mar 26, 2018

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

ezyang commented Mar 27, 2018

lazypanda1 commented Mar 27, 2018

apaszke commented Mar 28, 2018

Added parameter range checks for all optimizers #6000

Added parameter range checks for all optimizers #6000

Conversation

lazypanda1 commented Mar 26, 2018 • edited

This comment was marked as off-topic.

This comment was marked as off-topic.

vishwakftw left a comment

Choose a reason for hiding this comment

ezyang commented Mar 26, 2018

apaszke commented Mar 26, 2018

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

ezyang commented Mar 27, 2018

lazypanda1 commented Mar 27, 2018

apaszke commented Mar 28, 2018

lazypanda1 commented Mar 26, 2018 •

edited