WIP: fix bound evaluation in dist_math #1591

fonnesbeck · 2016-12-10T21:56:38Z

This is a first pass at fixing the bug in bound that manifests itself in #1579. I've used stack to get around the heterogeneous arguments, but this results in dimension-matching problems for some models that I don't yet understand.

Also added the tanks example from #1579 to test_examples.

Closes #1579

twiecki · 2016-12-10T22:20:20Z

does the original code not work?

fonnesbeck · 2016-12-11T01:42:00Z

Original code?

Actually build on miniconda with python2.7

twiecki · 2016-12-12T08:06:37Z

     ret = 1
     for c in vals:
         ret = ret * (1 * c)
     return ret

twiecki · 2016-12-12T08:36:49Z

1b7b4bb

fonnesbeck · 2016-12-12T13:31:08Z

I assumed it would reintroduce #1449

twiecki · 2016-12-12T13:50:22Z

Oh that was the original issue.

I think that bears taking another look. Either we have broadcasting in bound which can lead to sometimes odd behavior as in #1449 or we don't, but I don't think there's a middle ground.

twiecki · 2016-12-12T14:10:27Z

Perhaps we should have a kwarg broadcast=False that can turn broadcasting on or off. Both use-cases (DiscreteUniform and Multinomial) are valid I think.

…_fix

twiecki · 2016-12-12T18:13:54Z

How does this handle the #1449 case?

fonnesbeck · 2016-12-12T18:41:25Z

It does fix it, at least based on the example @AustinRochford used in the issue. Perhaps I will include that as a test, incase of regression.

fonnesbeck · 2016-12-12T18:48:25Z

Please have a peek at this @ferrine and/or @AustinRochford

ferrine · 2016-12-12T18:52:33Z

I'm a bit sick and have a deadline for tomorrow:( I'll review the PR in some days

fonnesbeck · 2016-12-12T18:54:11Z

@ferrine no big deal, thanks. Get well.

AustinRochford · 2016-12-12T19:15:27Z

This looks like a reasonable compromise to me. 👍

jsalvatier · 2016-12-12T20:17:49Z

pymc3/distributions/dist_math.py

+    try:
+        return tt.all(tt.stack([1*val for val in vals]), axis=0)
+    except (TypeError, IndexError):
+        return tt.all([tt.all(1 * val) for val in vals])


When is it right to do this second version?

Sometimes tt.stack will fail due to different lengths of the elements of vals (the TypeError; happens with the Poisson mixture) and when there is no axis to iterate over (resulting in the IndexError).

You can try to ravel all variables after safe converting them in theano, then concatenate and finally use tt.all

I thought stack did most of that (including the conversion) for me automatically?

You run into situations where vals is not only of different types, but also of different lengths, so we need to deal with those potential combinations. tt.stack doesn't like heterogeneous dimensions. Haven't seen a case that tricks both yet, however.

Different dims can be excluded by ravel

jsalvatier · 2016-12-12T21:26:47Z

hmm, they should always be broadcastable together though (since this is elementwise). It seems to me that it should work like this. Actually, I think I'm not clear why we're not sticking with my original version. What exactly is the extra case that we want to handle? This version will sometimes collapse down the conditions into a single value, which is not what we want for elementwise. We do want that (or are ok with it) for non-elementwise distributions, but we should just have a different function for that. Or make a more explicit switch of some kind. Right now, you might accidentally get the collapsing behavior, and that will be confusing. Stacking doesn't handle arrays that can be broadcasted to together but aren't the same shape (some of the inputs to conditions may not be the full shape), which we need for this. On Mon, Dec 12, 2016 at 12:57 PM Chris Fonnesbeck <notifications@github.com> wrote: *@fonnesbeck* commented on this pull request. ------------------------------ In pymc3/distributions/dist_math.py <#1591>:

@@ -29,8 +29,16 @@ def bound(logp, *conditions):

def alltrue(vals): - return tt.all([tt.all(1 * val) for val in vals]) + """ + Asserts truth of all elements in vals, across the lowest axis. This maintains + element-wise evaluations for multivariate inputs. + """ + try: + return tt.all(tt.stack([1*val for val in vals]), axis=0) + except (TypeError, IndexError): + return tt.all([tt.all(1 * val) for val in vals]) You run into situations where vals is not only of different types, but also of different lengths, so we need to deal with those potential combinations. tt.stack doesn't like heterogeneous dimensions. Haven't seen a case that tricks both yet, however. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#1591>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAQgsTctmyE_HDhdi5fH6V_qdpw4hLp9ks5rHbU7gaJpZM4LJxTP> .

fonnesbeck · 2016-12-12T21:31:48Z

The original problem is #1449, which had some unexpected broadcasting behavior.

jsalvatier · 2016-12-12T23:26:10Z

Right, okay. Then suggest we make bound_elemwise and bound and alltrue_elemwise and alltrue or do something similar.

fonnesbeck · 2016-12-12T23:33:16Z

Perhaps its just simpler (or at least clearer, if not simpler) to call all explicitly in each distribution as needed, and pass that to bound. e.g. tt.all(value >= 0), tt.all(0 <= p), etc. That way, each call to the logp would always return a 1D array of booleans (one for each condition).

jsalvatier · 2016-12-13T00:28:08Z

For multivariate distributions? That would be okay. Though I think we should still rename bound to elemwise_bound or similar. Just to be less confusing.

We should keep elemwise bound to be completely elemwise so that the logps are elemwise (and individual elements can be nans). Its useful for debugging.

fonnesbeck · 2016-12-13T00:48:34Z

No, I mean for all the distributions (multivariates already do this). Unless I am missing something, if any condition for any element of a call to logp fails, the logp should return -inf. You can still debug inside of the logp if you need to.

So, if you have a vector-valued Gamma distribution, for example, you would return -inf on the call to logp on any violation of the bound by any element, be it a bad parameter element or a bad value element.

jsalvatier · 2016-12-13T00:58:55Z

Gotcha, I think that seems bad to me. Elementwise distributions should be as elementwise as possible. This change would make them partially a single distribution. Its more conceptually coherent.

What do you mean you can still debug inside the logp?

The problems I'm thinking of are things like: if you pass bad initial values to an elemwise dist, its better if you can look at the logp to tell which ones are bad. Or if find_MAP messes up. Or if you write a bad sampler or distribution.

You can also imagine custom samplers using the -inf information to adjust the scale for particular elements.

I also don't really see the benefit. We can just have two functions with clearer names.

fonnesbeck · 2016-12-13T01:13:09Z

I think I've lost the plot on this one. I will close this and let someone else take a shot at it. I'm sure the answer is easy and I'm just missing it.

fonnesbeck added 2 commits December 10, 2016 15:54

First pass at dealing with bound evaluation in dist_math

2fe9773

Removed shared import

cfbdf34

Actually build on miniconda with python2.7

e7dc271

colin and others added 3 commits December 10, 2016 20:49

Fix python2 build errors

0150b75

Fix for py2/py3

7b23fb5

Merge pull request #1592 from ColCarroll/fix_travis_build

5be027f

Actually build on miniconda with python2.7

twiecki added the WIP label Dec 12, 2016

fonnesbeck added 4 commits December 12, 2016 10:58

First pass at dealing with bound evaluation in dist_math

3a914c7

Removed shared import

d026615

Changed alltrue to deal with TypeError for some disitrbutions

1196f48

Merge branch 'alltrue_fix' of github.com:pymc-devs/pymc3 into alltrue…

abb0abc

…_fix

fonnesbeck mentioned this pull request Dec 12, 2016

Error when using DiscreteUniform #1579

Closed

Caught IndexError from Mixture in bound

5be17f5

Fixed syntax error in except statement

24f8367

Added issue example from #1449 as test case for bound

1d845e2

jsalvatier reviewed Dec 12, 2016

View reviewed changes

fonnesbeck closed this Dec 13, 2016

fonnesbeck deleted the alltrue_fix branch December 13, 2016 01:13

twiecki mentioned this pull request Dec 13, 2016

Add back bound_elemwise. #1596

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: fix bound evaluation in dist_math #1591

WIP: fix bound evaluation in dist_math #1591

fonnesbeck commented Dec 10, 2016

twiecki commented Dec 10, 2016

fonnesbeck commented Dec 11, 2016

twiecki commented Dec 12, 2016 •

edited

twiecki commented Dec 12, 2016

fonnesbeck commented Dec 12, 2016

twiecki commented Dec 12, 2016

twiecki commented Dec 12, 2016

twiecki commented Dec 12, 2016

fonnesbeck commented Dec 12, 2016

fonnesbeck commented Dec 12, 2016

ferrine commented Dec 12, 2016

fonnesbeck commented Dec 12, 2016

AustinRochford commented Dec 12, 2016

jsalvatier Dec 12, 2016

fonnesbeck Dec 12, 2016

ferrine Dec 12, 2016 •

edited

fonnesbeck Dec 12, 2016

fonnesbeck Dec 12, 2016

ferrine Dec 12, 2016

jsalvatier commented Dec 12, 2016 via email

fonnesbeck commented Dec 12, 2016

jsalvatier commented Dec 12, 2016

fonnesbeck commented Dec 12, 2016 •

edited

jsalvatier commented Dec 13, 2016

fonnesbeck commented Dec 13, 2016

jsalvatier commented Dec 13, 2016 •

edited

fonnesbeck commented Dec 13, 2016

WIP: fix bound evaluation in dist_math #1591

WIP: fix bound evaluation in dist_math #1591

Conversation

fonnesbeck commented Dec 10, 2016

twiecki commented Dec 10, 2016

fonnesbeck commented Dec 11, 2016

twiecki commented Dec 12, 2016 • edited

twiecki commented Dec 12, 2016

fonnesbeck commented Dec 12, 2016

twiecki commented Dec 12, 2016

twiecki commented Dec 12, 2016

twiecki commented Dec 12, 2016

fonnesbeck commented Dec 12, 2016

fonnesbeck commented Dec 12, 2016

ferrine commented Dec 12, 2016

fonnesbeck commented Dec 12, 2016

AustinRochford commented Dec 12, 2016

jsalvatier Dec 12, 2016

Choose a reason for hiding this comment

fonnesbeck Dec 12, 2016

Choose a reason for hiding this comment

ferrine Dec 12, 2016 • edited

Choose a reason for hiding this comment

fonnesbeck Dec 12, 2016

Choose a reason for hiding this comment

fonnesbeck Dec 12, 2016

Choose a reason for hiding this comment

ferrine Dec 12, 2016

Choose a reason for hiding this comment

jsalvatier commented Dec 12, 2016 via email

fonnesbeck commented Dec 12, 2016

jsalvatier commented Dec 12, 2016

fonnesbeck commented Dec 12, 2016 • edited

jsalvatier commented Dec 13, 2016

fonnesbeck commented Dec 13, 2016

jsalvatier commented Dec 13, 2016 • edited

fonnesbeck commented Dec 13, 2016

twiecki commented Dec 12, 2016 •

edited

ferrine Dec 12, 2016 •

edited

fonnesbeck commented Dec 12, 2016 •

edited

jsalvatier commented Dec 13, 2016 •

edited