Implement StaticSVI #1562

fehiepsi · 2018-11-23T00:46:18Z

This pull request implements StaticSVI, which is an SVI interface for model & guide which does not create new params dynamically. Hence LBFGS works with this inference (address #1519).

fritzo · 2018-11-23T18:41:30Z

tests/infer/test_static_svi.py

+        alpha_q, beta_q = torch.exp(alpha_q_log), torch.exp(beta_q_log)
+        pyro.sample("p_latent", dist.Beta(alpha_q, beta_q))
+
+    adam = optim.Adam({"lr": .001})


Can you also add a test using LBFGS?

@fritzo I have added another test for it. Using LBFGS for this model is quite flaky.

eb8680 · 2018-11-23T19:02:38Z

I don't really think this is necessary (in fact, I think we should deprecate SVI entirely, but that's another discussion). In this situation, we should encourage users to write their models as nn.Modules and use Pyro losses and torch.optim optimizers together directly instead of going through the SVI and PyroOptim interfaces:

class MyModel(nn.Module):
    def __init__(self, ...):
        ...

    def model(self, batch):
        ....

    def guide(self, batch):
        ...

model = MyModel(...)

elbo = pyro.infer.Trace_ELBO()
optim = torch.optim.SGD(model.parameters())  # or LBFGS etc
for batch in data:
    optim.zero_grad()
    elbo.loss_and_grads(model.model, model.guide, batch)
    optim.step()

fehiepsi · 2018-11-23T21:44:33Z

@eb8680 Do you mean that we'll have a Pyro nn.Module which will catch all parameters from model and guide when we call mymodel.parameters()? Without that, users have to know about poutine.trace(param_only=True) to capture all params for pytorch optimizer. In addition, SVI.run() is quite convenient to get samples from posterior. Otherwise, users will have to learn about poutine.trace(...), poutine.replay(...),...

eb8680 · 2018-11-23T23:03:12Z

@fehiepsi module.parameters() is a method of torch.nn.Module that PyTorch users would be familiar with. Most of our tutorials, e.g. the VAE tutorial, already wrap the model and guide as methods of a single nn.Module and never call pyro.param explicitly in the model or guide, but they use pyro.module (which calls module.named_parameters() under the hood) to pass the parameters to the Pyro parameter store via SVI.step. I don't think that extra layer of indirection is buying us much, and I think the snippet above is less opaque and more consistent with PyTorch idioms.

We shouldn't try to fix leaky abstractions by introducing more indirection, we should just deprecate/remove them and do our best to help users write code that's easier to understand. SVI, Trace, ParamStoreDict, ELBO, and PyroOptim are some of the worst offenders. On that note, re: trace and replay for parameter capture and posterior sampling instead of SVI, I actually think it'd be better to do that. We can always provide a couple of tiny wrappers that remove boilerplate without obfuscation.

fehiepsi · 2018-11-24T01:39:10Z

@eb8680 I took me a while to have a feeling that I understand what you mean. :) Did you mean that we define all the parameters ahead, so we don't need to use ParamStoreDict, PyroOptim? If that is the case, then I am in with the future deprecation of these wrappers (together with SVI of course). I'll try to think about that idiom for gp module btw. But please correct me if my understanding is incorrect.

About this PR, how about moving this to contrib for a while?

eb8680 · 2018-11-24T02:22:52Z

Did you mean that we define all the parameters ahead, so we don't need to use ParamStoreDict, PyroOptim?

Yes, I mean that if all the parameters can be defined ahead of time, we should encourage users to write their models or model/guide pairs as nn.Modules and use torch.optim and standard PyTorch idioms, and not just if they want to use LBFGS but also more generally. We've slowly but steadily accumulated technical debt in our API design (much of it my fault, like making Trace a networkx.Digraph) and we should try to be diligent about adding more.

Here's a rewritten version of the example from your test in this PR to illustrate:

x = 1 + torch.randn(10)

class MyModel(nn.Module):
    def __init__(self):
        mu = nn.Parameter(torch.tensor(0.))
        sigma = nn.Parameter(torch.tensor(1.))

    def forward(self):
        with pyro.plate("plate"):
            return pyro.sample("x", dist.Normal(self.mu, torch.exp(self.sigma)), obs=x)

model = MyModel()

def closure():
    return Trace_ELBO().loss_and_grads(model, lambda: pass)

optim = torch.optim.LBFGS(model.parameters())
for _ in range(100):
    optim.step(closure)

Or version 2:

x = 1 + torch.randn(10)

class MyModel(nn.Module):
    def __init__(self):
        mu = nn.Parameter(torch.tensor(0.))
        sigma = nn.Parameter(torch.tensor(1.))

    def model(self):
        with pyro.plate("plate"):
            return pyro.sample("x", dist.Normal(self.mu, torch.exp(self.sigma)), obs=x)

    def guide(self):
        pass

model = MyModel()

def closure():
    return Trace_ELBO().loss_and_grads(model.model, model.guide)

optim = torch.optim.LBFGS(model.parameters())
for _ in range(100):
    optim.step(closure)

About this PR, how about moving this to contrib for a while?

I'm not sure that's the right type of thing to add to contrib. How about an example that uses torch.optim.LBFGS instead?

fehiepsi · 2018-11-24T04:00:03Z

For that purpose, I vote to close this PR. I'll use that pattern in a GP tutorial instead. :)

fehiepsi added 2 commits November 21, 2018 15:47

move optim warning to svi

0f6bf3f

add test for static svi

1b82e98

fehiepsi requested a review from eb8680 November 23, 2018 00:46

revert docs for svi loss

cd146b7

fritzo reviewed Nov 23, 2018

View reviewed changes

fehiepsi added 2 commits November 23, 2018 16:56

add test for lbfgs

770d324

add lbfgs test

c2fa892

fehiepsi closed this Nov 24, 2018

fehiepsi mentioned this pull request Nov 27, 2018

LBFGS does not work with SVI #1519

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement StaticSVI #1562

Implement StaticSVI #1562

fehiepsi commented Nov 23, 2018 •

edited

fritzo Nov 23, 2018

fehiepsi Nov 23, 2018

eb8680 commented Nov 23, 2018 •

edited

fehiepsi commented Nov 23, 2018

eb8680 commented Nov 23, 2018 •

edited

fehiepsi commented Nov 24, 2018 •

edited

eb8680 commented Nov 24, 2018 •

edited

fehiepsi commented Nov 24, 2018

Implement StaticSVI #1562

Implement StaticSVI #1562

Conversation

fehiepsi commented Nov 23, 2018 • edited

fritzo Nov 23, 2018

Choose a reason for hiding this comment

fehiepsi Nov 23, 2018

Choose a reason for hiding this comment

eb8680 commented Nov 23, 2018 • edited

fehiepsi commented Nov 23, 2018

eb8680 commented Nov 23, 2018 • edited

fehiepsi commented Nov 24, 2018 • edited

eb8680 commented Nov 24, 2018 • edited

fehiepsi commented Nov 24, 2018

fehiepsi commented Nov 23, 2018 •

edited

eb8680 commented Nov 23, 2018 •

edited

eb8680 commented Nov 23, 2018 •

edited

fehiepsi commented Nov 24, 2018 •

edited

eb8680 commented Nov 24, 2018 •

edited