Do not recalculate gradient in NUTS #1730

ColCarroll · 2017-01-30T03:57:38Z

This is an attempt at addressing #1693. Local benchmarks put it marginally slower than NUTS on current master. My two explanations are
-- The grad calculation is already memoized
-- I did a bad job factoring out the dlogp function

Wanted to make this PR in case @aseyboldt or someone else wanted to take a look, but will close again if there is not progress soon.

aseyboldt · 2017-01-30T09:18:28Z

How did you test the speedup? It works for me, if I try a large model. Fitting this (probably not particularly useful) model took 2:50 without, and 1:35 with this patch:

import pymc3
import numpy as np
import theano.tensor as tt

model = pymc3.Model()
data = np.random.randn(200, 1000)
with model:
    mu = pymc3.Normal("mu", mu=0, sd=1, shape=1000)
    mu_ = tt.dot(data, mu)
    pymc3.Normal("measure", mu=mu_, sd=1, observed=np.ones(200))

with model:
    step = pymc3.NUTS(profile=True)
    trace = pymc3.sample(1000, step=step, init=None,
                         tune=500, progressbar=True)

I had to work around the test_value mismatch for q_grad.

ColCarroll · 2017-01-30T13:29:49Z

Wow, thanks for testing it out so quickly! I've got a way-too-simplistic benchmark I use, of just sampling from a normal distribution 100k times. On master it is ~4.1k samples/s, on this branch, ~3.9k samples/s.

The speed up you're reporting sounds worth it -- let me refactor this a bit to make it maintainable!

aseyboldt · 2017-01-30T13:41:50Z

I guess that would work well for measuring the overhead for each sample, but for real world models I'm usually happy if it reports samples/s instead of s/sample. :-)

ColCarroll · 2017-02-01T16:07:01Z

Should be ready for review

twiecki · 2017-02-01T16:56:07Z

pymc3/step_methods/hmc/nuts.py

@@ -69,39 +74,38 @@ def astep(self, q0):
        else:
            step_size = np.exp(self.log_step_size_bar)

-        u = floatX(nr.uniform())
+        u = np.typeDict[theano.config.floatX](nr.uniform())


Why this change?

hmm... could have sworn there was a reasonable speedup from this, but can't see it any more. I'll revert these two (there's another in the buildtree).

twiecki · 2017-02-02T09:14:31Z

pymc3/step_methods/hmc/nuts.py

        leaf_size = int(np.log(u) + energy_change <= 0)
        is_valid_sample = (np.log(u) + energy_change < Emax)
-        return q_edge, p_edge, q_edge, leaf_size, is_valid_sample, min(1, np.exp(-energy_change)), 1
+        p_accept = min(1, np.exp(-energy_change))
+        return BinaryTree(q, p, q_grad, q, leaf_size, is_valid_sample, p_accept, 1)


This use of namedtuple definitely cleans things up a bit 👍.

twiecki · 2017-02-02T09:16:16Z

Very nice, a bit unfortunate for the code that we have to carry the dlogp around all the time but the speed-up is definitely worth it. Also another example of where the regression tests come in really handy.

ColCarroll added the WIP label Jan 30, 2017

ColCarroll added 2 commits February 1, 2017 07:57

Add gradient as an argument

dc80404

Use named tuples for trees, fix shape

b0a3f4a

ColCarroll force-pushed the speed_up_nuts branch from 259c188 to b0a3f4a Compare February 1, 2017 14:14

Enable progressbar to avoid timeouts

3b6db37

ColCarroll removed the WIP label Feb 1, 2017

twiecki reviewed Feb 1, 2017

View reviewed changes

Use custom floatX function

911edbe

twiecki reviewed Feb 2, 2017

View reviewed changes

twiecki merged commit c014311 into pymc-devs:master Feb 2, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do not recalculate gradient in NUTS #1730

Do not recalculate gradient in NUTS #1730

ColCarroll commented Jan 30, 2017

aseyboldt commented Jan 30, 2017

ColCarroll commented Jan 30, 2017

aseyboldt commented Jan 30, 2017

ColCarroll commented Feb 1, 2017

twiecki Feb 1, 2017

ColCarroll Feb 1, 2017

twiecki Feb 2, 2017

twiecki commented Feb 2, 2017

Do not recalculate gradient in NUTS #1730

Do not recalculate gradient in NUTS #1730

Conversation

ColCarroll commented Jan 30, 2017

aseyboldt commented Jan 30, 2017

ColCarroll commented Jan 30, 2017

aseyboldt commented Jan 30, 2017

ColCarroll commented Feb 1, 2017

twiecki Feb 1, 2017

Choose a reason for hiding this comment

ColCarroll Feb 1, 2017

Choose a reason for hiding this comment

twiecki Feb 2, 2017

Choose a reason for hiding this comment

twiecki commented Feb 2, 2017