WIP Samplers should keep the same dtype as provided by theano. #1253

twiecki · 2016-07-22T08:53:05Z

Related: #1246

fhuszar · 2016-07-22T09:27:50Z

pymc3/step_methods/metropolis.py

        else:
-            q = q0 + delta
+            q0 = q0.astype(theano.config.floatX)
+            q = (q0 + delta).astype(theano.config.floatX)


Does this effect numerical precision when theano is not actually needed? Can we use model.dtypeinstead of theano.config.floatX? This way if the model does not actually need theano or to run on GPU, the sampling can still run in float64 even if theanorc specifies float32.

How do you mean? Theano is always needed and I would expect that model.dtype must match theano.config.floatX.

Okay. Also, downcast only needs to happen at the very last step before the theano function is called. This is why I prefer the allow_input_downcast=True setting of theano.function as it only happens quietly when needed.

I tried this over here #1265 but it doesn't seem to work.

fhuszar · 2016-07-22T09:45:18Z

pymc3/step_methods/metropolis.py

        else:
-            q = q0 + delta
+            q0 = q0.astype(theano.config.floatX)
+            q = (q0 + delta).astype(theano.config.floatX)

        q_new = metrop_select(self.delta_logp(q, q0), q, q0)


hey, is a theano function involving logp recompiled every time a Metropolis-Hastings accept-reject happens? This is going to be painfully slow (although theano does cache some of the optimisations, you still only want to compile a function once).

Actually this is just a python function (contrary to what I said earlier, I misremembered because we made delta_logp a theano function, and that function should only be compiled once).

Probably can easily turn metrop_select into a function using ifelse. Then turn the proposal generation to theano random.

twiecki · 2016-07-28T07:29:09Z

Merged in #1265 in case it helps. It now seems this is the only way to enforce correct updating of shared variables.

twiecki · 2016-07-30T21:42:12Z

Can someone check if we can sample on the GPU now? @fhuszar

cynddl · 2016-08-09T18:49:55Z

It seems to work on my laptop (Python 3.5.1, Theano 0.8.2, PyMC3 from the branch use_theano_float_type). I tried the sample code from #1246 with both CPU and GPU. It returns 10k samples in 1.3s on CPU and 3.2s on GPU.

twiecki · 2016-08-09T19:11:02Z

@cynddl Awesome, thanks for checking!

twiecki · 2016-08-09T19:35:17Z

@cynddl A really good test would be https://github.com/twiecki/WhileMyMCMCGentlySamples/blob/master/content/downloads/notebooks/bayesian_neural_network_lasagne.ipynb (specifically the convolutional net at the bottom). Want to test with that?

cynddl · 2016-08-09T20:29:51Z

I've been trying with the branch use_theano_float_type for theano and the bleeding edge version for lasagne (I get a RuntimeError when defining models with the stable version). The sampling gets stuck at:

Iteration 0 [0%]: ELBO = -17251256.38

Same behavior on CPU or GPU. Maybe the problem is too large for my computer. Any idea?

twiecki · 2016-08-10T08:37:16Z

Yeah, possibly, it's definitely slow for such a big model. You could also try the smaller https://github.com/pymc-devs/pymc3/blob/master/docs/source/notebooks/bayesian_neural_network_advi.ipynb

…no. (#1253)" This reverts commit 680a8eb.

springcoil · 2016-09-04T17:42:39Z

What's the status on this - are we working on a GPU - I need to get one for home so I can check this out. It looks some cool work - and it looks like a lot of the errors are fixed. I've not had time or the interest to look into this :)

twiecki · 2016-09-07T07:32:48Z

Yeah, I'm not quite sure what the best way to go about this is. If we enforce the same dtype everywhere model creation suffers. Maybe there is a way to only switch it on explicitly but that seems a bit cumbersome.

Spaak · 2016-12-16T09:11:55Z

@twiecki I'm somewhat confused. I think I'm running into this same error (TypeError: expected type_num 11 (NPY_FLOAT32) got 12) using either NUTS or Metropolis. When I look in the repository for metropolis.py, there is no code for dtype handling Yet PR #1253 is listed as merged into master, which should have added the relevant code.

Were these commits (inadvertently) reverted? Apologies if I'm missing something obvious.

(PS: I'm on CPU, not GPU, but with theano.config.floatX = 'float32'.)

twiecki · 2016-12-16T11:09:04Z

I did: 1c9adc6

I thought the PR fixed the problem even with trust_input. Apparently I was wrong. Can you try if removing that line fixes your problem?

Spaak · 2016-12-16T11:37:45Z

Like I wrote I think the commits in this PR (at least beb720e) have somehow disappeared from master. I'm not git wizard enough to try to determine how, but this seems to be the case. Because of that, here delta.dtype will be float64 independent of floatX setting, and thus q.dtype will be float64 as well.

Commenting the trust_input gives me, as expected, an error about Theano not wanting to downcast the input (upon calling self.delta_logp with q as input here). If I then add allow_input_downcast=True it all works of course, but of course that is suboptimal because we'd rather add in float32s in the first place.

twiecki · 2016-12-16T11:45:20Z

Hm, I reverted that merge because it caused some other unanticipated problems. Ideally we'd have dtypes consistent with floatX.

Just to make sure, you're saying that the model works with that commit in place?

Spaak · 2016-12-16T12:08:08Z

Yes, with that commit (or actually this one: a171012 because the other one is no longer reachable in the repository) in place the model works, even when 1c9adc6 is also in place.

twiecki · 2016-12-16T12:09:15Z

Interesting, I suppose we should revisit why other people where seeing problems with that in place.

twiecki · 2016-12-16T12:21:32Z

@Spaak Can you try this branch: #1338?

Spaak · 2016-12-16T12:28:28Z

Model works, both Metropolis and NUTS.

Edit: note, just for completeness, I'm running on CPU with floatX=float32, not GPU. Emphasizing this again because I saw an issue where CPU/GPU was relevant. Will get a GPU soon to test further.

Edit 2: that branch gives me massive dtype-related errors with ADVI, just sampling works well now.

twiecki mentioned this pull request Jul 22, 2016

Using PyMC3 on the GPU #1246

Closed

fhuszar reviewed Jul 22, 2016
View reviewed changes

twiecki force-pushed the use_theano_float_type branch from e4614b6 to c1a1d0e Compare July 22, 2016 09:28

fhuszar reviewed Jul 22, 2016
View reviewed changes

MAINT Allow downcast in metropolis and nuts functions.

d978522

twiecki mentioned this pull request Jul 27, 2016

WIP Allow downcast in metropolis and nuts functions. #1265

Closed

ENH Samplers should keep the same dtype as provided by theano.

beb720e

twiecki force-pushed the use_theano_float_type branch from c1a1d0e to beb720e Compare July 28, 2016 07:28

BUG np.array -> array.

02efa6f

twiecki merged commit 680a8eb into master Aug 9, 2016

twiecki deleted the use_theano_float_type branch August 9, 2016 19:12

twiecki mentioned this pull request Aug 11, 2016

Theano related problem with a multivariate model? #1283

Closed

twiecki added a commit that referenced this pull request Aug 14, 2016

Revert "MAINT Samplers should keep the same dtype as provided by thea…

c396430

…no. (#1253)" This reverts commit 680a8eb.

twiecki mentioned this pull request Sep 7, 2016

WIP Use theano float type #1338

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP Samplers should keep the same dtype as provided by theano. #1253

WIP Samplers should keep the same dtype as provided by theano. #1253

twiecki commented Jul 22, 2016

fhuszar Jul 22, 2016

twiecki Jul 22, 2016

fhuszar Jul 22, 2016

twiecki Jul 27, 2016

fhuszar Jul 22, 2016 •

edited

Loading

twiecki Jul 22, 2016

twiecki Jul 22, 2016

twiecki commented Jul 28, 2016

twiecki commented Jul 30, 2016

cynddl commented Aug 9, 2016

twiecki commented Aug 9, 2016

twiecki commented Aug 9, 2016

cynddl commented Aug 9, 2016

twiecki commented Aug 10, 2016

springcoil commented Sep 4, 2016

twiecki commented Sep 7, 2016

Spaak commented Dec 16, 2016 •

edited

Loading

twiecki commented Dec 16, 2016

Spaak commented Dec 16, 2016

twiecki commented Dec 16, 2016

Spaak commented Dec 16, 2016

twiecki commented Dec 16, 2016

twiecki commented Dec 16, 2016

Spaak commented Dec 16, 2016 •

edited

Loading

WIP Samplers should keep the same dtype as provided by theano. #1253

WIP Samplers should keep the same dtype as provided by theano. #1253

Conversation

twiecki commented Jul 22, 2016

fhuszar Jul 22, 2016

Choose a reason for hiding this comment

twiecki Jul 22, 2016

Choose a reason for hiding this comment

fhuszar Jul 22, 2016

Choose a reason for hiding this comment

twiecki Jul 27, 2016

Choose a reason for hiding this comment

fhuszar Jul 22, 2016 • edited Loading

Choose a reason for hiding this comment

twiecki Jul 22, 2016

Choose a reason for hiding this comment

twiecki Jul 22, 2016

Choose a reason for hiding this comment

twiecki commented Jul 28, 2016

twiecki commented Jul 30, 2016

cynddl commented Aug 9, 2016

twiecki commented Aug 9, 2016

twiecki commented Aug 9, 2016

cynddl commented Aug 9, 2016

twiecki commented Aug 10, 2016

springcoil commented Sep 4, 2016

twiecki commented Sep 7, 2016

Spaak commented Dec 16, 2016 • edited Loading

twiecki commented Dec 16, 2016

Spaak commented Dec 16, 2016

twiecki commented Dec 16, 2016

Spaak commented Dec 16, 2016

twiecki commented Dec 16, 2016

twiecki commented Dec 16, 2016

Spaak commented Dec 16, 2016 • edited Loading

fhuszar Jul 22, 2016 •

edited

Loading

Spaak commented Dec 16, 2016 •

edited

Loading

Spaak commented Dec 16, 2016 •

edited

Loading