WIP Mixture Models #1437

AustinRochford · 2016-10-10T23:46:19Z

This is my first pass at a flexible mixture model class, and definitely needs a lot of work. As this notebook shows, this code can support two use cases:

comp_dists is a PyMC3 distribution; that is, each of the mixture components are from the same distributional family, differing only in their parameters (e.g. a mixture of normals)
comp_dists is an iterable of PyMC3 distributions (e.g. a zero inflated Poisson)

There are a few issues to address here, that I'd love to get some feedback on:

How to subclass Discrete or Continuous as appropriate, based on the component distributions (it seems like there is some Python metaprogramming magic that should work here, but I am not very well versed in that sort of thing)
Intuitive broadcasting
NUTS seems slow for these mixture models; this may be an initialization/scaling problem

@twiecki @springcoil @fonnesbeck any feedback/guidance you could give would be much appreciated :)

twiecki · 2016-10-11T07:24:21Z

Nice, this is a really clear implementation.

How to subclass Discrete or Continuous as appropriate, based on the component distributions (it seems like there is some Python metaprogramming magic that should work here, but I am not very well versed in that sort of thing)

I assume you mean for discrete mixtures? If you look at the code, it's mostly about default dtypes:

class Discrete(Distribution):
    """Base class for discrete distributions"""

    def __init__(self, shape=(), dtype='int64', defaults=['mode'], *args, **kwargs):
        super(Discrete, self).__init__(
            shape, dtype, defaults=defaults, *args, **kwargs)


class Continuous(Distribution):
    """Base class for continuous distributions"""

    def __init__(self, shape=(), dtype='float64', defaults=['median', 'mean', 'mode'], *args, **kwargs):
        super(Continuous, self).__init__(
            shape, dtype, defaults=defaults, *args, **kwargs)

As such, maybe we could inherit from Distribution instead and take the dtype from the mixture?

Intuitive broadcasting

Can you specify an example case?

NUTS seems slow for these mixture models; this may be an initialization/scaling problem

Yes, that's the most common reason. If you post an example it would help with testing.

AustinRochford · 2016-10-11T11:52:15Z

@twiecki thanks for the feedback.

The Continuous vs. Discrete approach makes complete sense.

For broadcasting, the notebook I posted shows two different cases. As the code is now, when the component distributions all have the same type (as in the normal mixture case comp_dists=pm.Normal.dist(mu, sd)), then the observations for the Mixture need to be broadcastable with mu (hence observed=x[:, np.newaxis]), but when the components are specified as a list of distributions (as in the zero-inflated Poisson case comp_dists=[pm.ConstantDist.dist(0), pm.Poissoin.dist(lam)), the observations should be one-dimensional. As I write this out, I think this is easier to handle than I originally thought.

AustinRochford · 2016-10-11T13:05:17Z

I added a NUTS example to the linked notebook; 6 samples/sec seems quite low from my experience.

twiecki · 2016-10-11T18:03:13Z

I think this looks good. The NUTS issue I bet is initialization, can you try with ADVI init (https://gist.github.com/jonsedar/cd4985bbfafdba61b3c8d077dd91f237)?

In any case, I wouldn't block on the NUTS issue, we can resolve it later. Instead, I would focus on:

tests
docs (the example notebook you have is close, just needs a bit of text).

AustinRochford · 2016-10-11T18:48:39Z

@twiecki yup, those are my priorities. I would also add random value generation to support posterior predictive sampling to the list of necessary additions.

One more question: do we want to replace the current ZeroInflated* implementations with mixtures?

twiecki · 2016-10-12T13:15:15Z

One more question: do we want to replace the current ZeroInflated* implementations with mixtures?

Yes, that's probably cleaner.

AustinRochford · 2016-10-13T14:22:50Z

I also need to better understand/resolve the issues in #1449 before I can make this code work with dependent weights.

AustinRochford · 2016-10-14T13:59:10Z

pymc3/distributions/mixture.py

+        comp_dists = self.comp_dists
+
+        try:
+            value_ = value if value.ndim > 1 else value[:, np.newaxis]


This should perhaps use tt.shapepadright instead of np.newaxis

AustinRochford · 2016-10-14T20:38:01Z

I am not going to port the ZeroInflated* models to be subclasses in this pull request. Due to #1452, we no longer broadcast a certain way in ConstantDist's logp that makes that a bit trickier. To keep things simpler, I will separate it into a subsquent PR unless anyone has objections.

twiecki · 2016-10-14T22:15:00Z

That sounds right. Let me know when you want us to take another look.

AustinRochford · 2016-10-18T02:46:19Z

@twiecki I think I have gotten this into a pretty good state and would love your feedback on it again. Happy to make any changes you think are necessary to merge.

twiecki · 2016-10-18T08:31:30Z

pymc3/distributions/mixture.py

+        _, sd = get_tau_sd(tau=kwargs.pop('tau', None),
+                           sd=kwargs.pop('sd', None))
+
+        super(NormalMixture, self).__init__(w, Normal.dist(mu, sd=sd),


The simplicity to create a Normal Mixture here is really validating, nicely done.

twiecki · 2016-10-18T08:33:36Z

This is a really high-quality PR, thanks!

* First pass at mixture modelling * No longer necessary to reference self.comp_dists directly in logp * Add dimension internally (when necessary) * Import get_tau_sd * Misc bugfixes * Add sampling to Mixtures * Differentiate between Discrete and Continuous mixtures when possible * Add support for 2D weights * Gracefully try to calculate mean and mode defaults * Add docstrings for Mixture classes * Export mixture models * Reference self.comp_dists * Remove unnecessary pm. * Add Mixture tests * Add missing imports * Add marginalized Gaussian mixture model example * Calculate the mode of the mixture distribution correctly

AustinRochford commented Oct 14, 2016

View reviewed changes

AustinRochford added 16 commits October 16, 2016 23:32

First pass at mixture modelling

2a01506

No longer necessary to reference self.comp_dists directly in logp

8cb4bd5

Add dimension internally (when necessary)

6fe4efa

Import get_tau_sd

b60a643

Misc bugfixes

af63fde

Add sampling to Mixtures

aa23b90

Differentiate between Discrete and Continuous mixtures when possible

fb34ceb

Add support for 2D weights

c30e358

Gracefully try to calculate mean and mode defaults

4dfd130

Add docstrings for Mixture classes

71bae8b

Export mixture models

a4e722b

Reference self.comp_dists

0785acd

Remove unnecessary pm.

beedb34

Add Mixture tests

4222acd

Add missing imports

1db7c25

Add marginalized Gaussian mixture model example

2cf8121

Calculate the mode of the mixture distribution correctly

ef4a817

twiecki reviewed Oct 18, 2016

View reviewed changes

twiecki merged commit 2572852 into pymc-devs:master Oct 18, 2016

AustinRochford deleted the WIP-mixture-model branch October 18, 2016 11:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP Mixture Models #1437

WIP Mixture Models #1437

AustinRochford commented Oct 10, 2016 •

edited

Loading

twiecki commented Oct 11, 2016

AustinRochford commented Oct 11, 2016

AustinRochford commented Oct 11, 2016

twiecki commented Oct 11, 2016

AustinRochford commented Oct 11, 2016

twiecki commented Oct 12, 2016

AustinRochford commented Oct 13, 2016

AustinRochford Oct 14, 2016

AustinRochford commented Oct 14, 2016 •

edited

Loading

twiecki commented Oct 14, 2016

AustinRochford commented Oct 18, 2016

twiecki Oct 18, 2016

twiecki commented Oct 18, 2016

WIP Mixture Models #1437

WIP Mixture Models #1437

Conversation

AustinRochford commented Oct 10, 2016 • edited Loading

twiecki commented Oct 11, 2016

AustinRochford commented Oct 11, 2016

AustinRochford commented Oct 11, 2016

twiecki commented Oct 11, 2016

AustinRochford commented Oct 11, 2016

twiecki commented Oct 12, 2016

AustinRochford commented Oct 13, 2016

AustinRochford Oct 14, 2016

Choose a reason for hiding this comment

AustinRochford commented Oct 14, 2016 • edited Loading

twiecki commented Oct 14, 2016

AustinRochford commented Oct 18, 2016

twiecki Oct 18, 2016

Choose a reason for hiding this comment

twiecki commented Oct 18, 2016

AustinRochford commented Oct 10, 2016 •

edited

Loading

AustinRochford commented Oct 14, 2016 •

edited

Loading