ENH: Allow specyfing inverse covariance of a multivariate normal distribution #16002

siddhantwahal · 2022-04-17T21:37:09Z

Reference issue

What does this implement/fix?

This PR allows specifying the inverse covariance of a multivariate normal in addition or instead of the covariance by adding the inverse_cov keyword argument.

To provide functionality with the covariance or its inverse, essential information from the supplied matrix is extracted into a _CovInfo object. This object is then passed to private helper methods such as multivariate_normal_gen._logpdf.

This PR also introduces the private multivariate_normal_gen._rvs method. This introduction is only needed to solve #11772 and is a little orthogonal to the changes needed to solve #11053. However, the introduction of the _CovInfo allows solving #11772 in just a few lines, so I've gone ahead and included them here.

If desired, the _CovInfo class could be included in the public API so that advanced users can instantiate it and pass it as a keyword argument instead. For example, instead of only being restricted to supplying the inverse

multivariate_normal(mean=0, inverse_cov=1)

users would get complete control over the description of the covariance matrix through its square root factors:

multivariante_normal(mean=0, cov_info=CovInfo(...)

Thus, this PR also makes progress towards #15675.

Additional information

This PR supersedes #12184 which should be closed if this is merged.

mdhaber · 2022-05-16T23:30:21Z

@siddhantwahal thanks for your patience here. Personally, I've been focusing on fixing bugs... When that's a little more under control, I know there are a lot of enhancements waiting!
Could you take a look at gh-15675? I wouldn't ask you to include that in this PR, but can you envision what that might look like as a follow-up to this one?

siddhantwahal · 2022-05-17T02:42:44Z

@mdhaber no worries!

I think gh-15675 can be broken into two tasks:

allow specifying the eigendecomposition of the covariance if known (so that the covariance isn't factorized again)
allow specifying the covariance as a composition of rotation angles and variances

To enable (1) after this PR:

make the CovInfo class public
swap the inverse_cov=None keyword argument in multivariate_normal with cov_info=None
- this will require some care when setting defaults in _process_parameters
- don't create the CovInfo object if one is passed as a keyword argument, maybe this could just be done in _process_parameters?

I think that's it.

One possible UX is (assuming the eigen decomposition is u @ d @ u.T:

[nav] In [3]: cov_info = CovInfo(
         ...:     sqrt_cov=u @ np.sqrt(d),
         ...:     sqrt_inv_cov=u @ np.sqrt(1 / d),
         ...:     log_det_cov=np.sum(np.log(d)),
         ...:     rank=len(d)
         ...:   )
[nav] In [4]: mvn = multivariate_normal(mean=np.zeros(10_000), cov_info=cov_info)

The UX could be enhanced by adding factory methods to create CovInfo from the eigendecomposition or a combination of rotations and variances. For example, CovInfo.from_eigendecomp(eigvals=u, eigvecs=d) or CovInfo.from_rotation(angles=angles, variances=variances). That would accomplish (2).

mdhaber · 2022-05-23T00:13:17Z

Thanks @siddhantwahal for your patience. I'll take a look in June if nobody gets to this first.

mdhaber · 2022-06-26T19:02:48Z

scipy/stats/_multivariate.py

+        self._cov = cov
+        self._inverse_cov = inverse_cov
+
+    @cached_property


I like the idea of this class. I wonder why it doesn't go a little further. Why not accept all of the information the user can provide in the initializer and store all of it as private attributes in the initializer (e.g. self._sqrt_cov = _sqrt_cov), then have public cached_propertys for all of these attributes that computes them (lazily, in effect) in the most efficient way, given whatever information is available?

mdhaber

Before I go much further with this review, I want to check what you think of the idea of refactoring this so that all the logic about the covariance matrix (including the stuff in process_parameters, _process_cov_shaped) goes in the _CovInfo class, and _process_parameters would instantiate the _CovInfo object rather than doing that separately in every method.

Or really, what is the distinction between _CovInfo and _PSD / what should it be? Maybe I haven't looked deeply enough, but it seems like it would be nice to have one class that accepts all sorts of different representations of a covariance matrix and has methods that do all the things you want to do with a covariance matrix. If we combined the things we like from this PR and the existing _PSD class to make a nice class representing a covariance matrix, maybe we could also endow other multivariate distributions (e.g. multivariate_t) with these same features, but with minimal changes to existing code?

mdhaber · 2022-06-26T19:42:35Z

scipy/stats/_multivariate.py

@@ -437,7 +537,31 @@ def _process_quantiles(self, x, dim):

        return x

-    def _logpdf(self, x, mean, prec_U, log_det_cov, rank):
+    def _create_cov_info(self, cov, inverse_cov, allow_singular=True):


Along the lines of https://github.com/scipy/scipy/pull/16002/files#r906859132, I think this logic could go in _CovInfo. Similar logic could be added for eigendecomposition or variances and rotation angles.

mdhaber · 2022-07-15T05:25:52Z

What do you think about this comment #16002 (review) @siddhantwahal? (Sorry it took me a while to get to this!)

siddhantwahal · 2022-07-15T13:33:54Z

What do you think about this comment #16002 (review) @siddhantwahal? (Sorry it took me a while to get to this!)

Thanks a lot for the thoughtful review @mdhaber. I was swamped with preparing for our Scipy 2022 tutorial so haven't had a chance to consider or respond to your suggestion. I plan on resuming work on this in the next few weeks!

mdhaber · 2022-07-15T16:52:38Z

@siddhantwahal oh, are you here? Hope to see you at the sprint tomorrow!

siddhantwahal · 2022-08-14T23:47:04Z

Apologies for the delay here @mdhaber.

Or really, what is the distinction between _CovInfo and _PSD / what should it be?

I was envisioning _CovInfo to be a simple container to hold properties (sqrt_cov, log_det_cov etc.) of the covariance matrix. How this container was created could be left unspecified. In effect,_PSD was an algorithm or policy for creating a _CovInfo instance. This separation made sense because the various multivariate distributions only care about what the various properties of the covariance are, not how they're created (strictly mathematically speaking; from the software perspective we need to maintain backwards compatibility and numerical stability).

However, this separation forces us to specify the _CovInfo object in its entirety and prevents lazy computation of properties. A compromise here is to introduce different implementations for different algorithms or representations. 7e75eeb is an attempt towards this. After this commit, _PSD could be entirely replaced with EigSvdCovInfo in other distributions, but that's best done in a separate PR.

Maybe I haven't looked deeply enough, but it seems like it would be nice to have one class that accepts all sorts of different representations of a covariance matrix and has methods that do all the things you want to do with a covariance matrix.

It's not yet clear to me whether we could or even should have one class to do this. For example, multivariate_normal and multivariate_t both use the Eigenvalue and SVD based algorithm (implemented in _PSD and the implementation of multivariate_normal.rvs in master) to compute covariance properties, but the Wishart distribution uses the Cholesky factorization. There could be several other ways to represent the covariance matrix. We could end up with a very long (and therefore difficult to maintain) class if we try to account for all of these possibilities in one class.

If we combined the things we like from this PR and the existing _PSD class to make a nice class representing a covariance matrix, maybe we could also endow other multivariate distributions (e.g. multivariate_t) with these same features, but with minimal changes to existing code?

That should be possible with the latest implementation too, EigSvdCovInfo should plug in directly into multivariate_t.

Before I go much further with this review, I want to check what you think of the idea of refactoring this so that all the logic about the covariance matrix (including the stuff in process_parameters, _process_cov_shaped) goes in the _CovInfo class, and _process_parameters would instantiate the _CovInfo object rather than doing that separately in every method.

_process_parameters is parsing mean, cov, andinverse_cov, inferring dimensionality, assigning suitable defaults (like mean=0, cov=1 if either is None), ensuring that they have a suitable data type. I find the current implementation appropriate -- the distributions remain responsible for parsing user input for mean and covariances, while CovInfo is concerned with further breaking down the parsed covariance input into properties commonly accessed by the distribution. Same for _process_cov_shaped, it's just checking whether the dimensions of the covariance match up with the mean.

Thanks for the suggestion of computing CovInfo in process_parameters instead of doing it in every method, that is now implemented in 39cbaed. Since all distributions need now is a CovInfo object, it's really easy to expose a cov_info=None optional argument to all methods so that users can supply their own CovInfo object.

[skip azp] [skip actions]

mdhaber · 2022-09-13T06:22:32Z

I was hoping Sphinx would automatically document the attributes based on the docstrings of the properties, but that's not happening. Direct link to documentation:
https://output.circle-artifacts.com/output/job/70f59c8e-3b9b-47b1-aacc-dfdbfa2b71d7/artifacts/0/html/reference/stats.html

Thoughts for how the properties should be documented (so that they are rendered)?

mdhaber · 2022-09-14T14:26:50Z

@tirthasheshpatel I thought it would be good to get your eyes on this. IIRC, you were interested in vectorization over many covariance matrices for the multivariate_t distribution, and this is setting us down that road.
We're not taking the straightest path. In addition, users will get several options for computing the tricky part of several multivariate distributions (inverse matrix vector product) and much greater control over what to do with singular covariance matrices. But I think that's important, too. If you want to see a little further than what we have here, please take a look at mdhaber#88, which has four of these Covariance classes (and notes about a fifth one using LDL), all vectorized to support broadcasting over ND shape parameters for the MVN pdf and entropy methods (for starters). I think it would take just a few lines to go from there to multivariate_t.

siddhantwahal · 2022-09-15T02:42:42Z

Thoughts for how the properties should be documented (so that they are rendered)?

I started to look into this but didn't get very far. Sphinx is picking up the properties, (see the log_pdet docs, for example).

My working hypothesis is that inherited-members needs to be explicitly activated to render properties from the parent class, but haven't gotten round to testing it (dev env broke again).

tirthasheshpatel

Sorry for the late review! Just a few nitpicky comments otherwise the implementation looks good.

scipy/stats/_covariance.py

tirthasheshpatel · 2022-09-24T00:46:57Z

About cache_propertys not rendering in the docs: since only covariance needs to be cached, we can make others normal properties, and instead of making covariance a cache_property, we can instead make it a normal property and initialize _covariance to None in the __init__ method and lazily compute it (only once) when covariance is accessed. What do you think?

mdhaber · 2022-09-24T18:36:55Z

Thanks @tirthasheshpatel! I implemented these suggestions. As it turns out, we don't need to do anything special with covariance because the _covariance of the subclass is a cached_property, so it will only ever be calculated once.

mdhaber · 2022-09-24T19:25:09Z

Failures appear to be unrelated.

scipy/stats/_multivariate.py

scipy/stats/tests/test_multivariate.py

tirthasheshpatel

The tests look very strong! Just one nitpick and I think this PR is good to go. Thanks @mdhaber, @siddhantwahal!!

… test

mdhaber · 2022-09-27T05:05:21Z

Thanks @tirthasheshpatel! Made the change. Please feel free to squash-merge with me as a co-author.

What would you be most interested in reviewing next? We can specify covariance matrices using:

diagonals (diagonal matrices only, of course)
cholesky decomposition
eigenvalue decomposition

And I can add LDL after we add these. Presumably, I should add these in short, separate PRs to keep things simple?

tirthasheshpatel · 2022-09-27T05:14:28Z

What would you be most interested in reviewing next?

I think the diagonal one should be a bit easier. We can start with that!

Presumably, I should add these in short, separate PRs to keep things simple?

That sounds good!

tirthasheshpatel · 2022-09-27T06:02:25Z

Merged! Thanks @mdhaber @siddhantwahal!

mdhaber · 2022-09-27T14:37:43Z

Thanks, Tirth!
And @siddhantwahal - we can also change the rvs method shortly.
This merged sooner than expected, so to anyone coming by - please feel free to comment here. This is still a work in progress and not yet released, so we can still make changes in a separate PR.

tupui · 2022-09-28T10:14:11Z

scipy/stats/_covariance.py

+
+class Covariance:
+    """
+    Representation of a covariance matrix


This is really missing some documentation IMHO.

Also, from the naming of the subclasses, it seems that classmethods would be more appropriate. Then you would have Covariance.from_precision instead of CovViaPrecision.

It wasn't public before. The subclasses had more information about how they were working. When we go to the classmethod idea, I can add some more information to the Covariance class itself.

siddhantwahal added 17 commits March 20, 2022 18:29

Add sqrt computation to _PSD

459abd4

Allow specifying inverse covariance in multivariate normal

ba868de

Update tests

44c75e5

Update test for degenerate distributions

209f2dd

Add an explanatory comment

8a9a114

Update documentation

41328f5

Fix typos

012f2a0

Refactor to improve readability

c730f0f

Refactor tests

f8f7841

Fix indentation

36c183a

Flake8 fixes

8a205c1

Minor style fixes

0695987

Add missing line back

cecb09b

Add tests for non-numerical matrices

3cc126c

Add explanatory comments

12c04f6

Fix flake8

94c5383

Always squeeze output

9ca2628

tylerjereddy added scipy.stats enhancement A new feature or improvement labels Apr 18, 2022

mdhaber reviewed Jun 26, 2022

View reviewed changes

siddhantwahal added 2 commits August 11, 2022 20:26

Introduce fuller implementations of CovInfo

7e75eeb

Create covariance info in _process_parameters

39cbaed

Update scipy/stats/__init__.py

8110cb3

[skip azp] [skip actions]

mdhaber requested a review from tirthasheshpatel September 14, 2022 14:37

tirthasheshpatel reviewed Sep 23, 2022

View reviewed changes

scipy/stats/_covariance.py Outdated Show resolved Hide resolved

scipy/stats/_covariance.py Outdated Show resolved Hide resolved

scipy/stats/_covariance.py Outdated Show resolved Hide resolved

tirthasheshpatel mentioned this pull request Sep 24, 2022

BUG: Numpydoc doesn't render attributes decorated with cached_property in the Attributes section numpy/numpydoc#432

Closed

mdhaber added 2 commits September 24, 2022 11:34

MAINT: stats.Covariance: adjustements per review

fc2ec3f

Merge remote-tracking branch 'upstream/main' into enh/mvn-normal-inv-cov

a23e2e2

tirthasheshpatel reviewed Sep 27, 2022

View reviewed changes

scipy/stats/_multivariate.py Show resolved Hide resolved

tirthasheshpatel reviewed Sep 27, 2022

View reviewed changes

scipy/stats/_multivariate.py Show resolved Hide resolved

tirthasheshpatel reviewed Sep 27, 2022

View reviewed changes

scipy/stats/_multivariate.py Show resolved Hide resolved

tirthasheshpatel reviewed Sep 27, 2022

View reviewed changes

scipy/stats/tests/test_multivariate.py Outdated Show resolved Hide resolved

tirthasheshpatel approved these changes Sep 27, 2022

View reviewed changes

mdhaber added 2 commits September 26, 2022 22:04

TST: stats.multivariate_normal: remove unnecesary cdf call to clarify…

f3c89ba

… test

Merge remote-tracking branch 'upstream/main' into enh/mvn-normal-inv-cov

be4fae7

tirthasheshpatel merged commit e13d4f3 into scipy:main Sep 27, 2022

mdhaber added this to the 1.10.0 milestone Sep 27, 2022

mdhaber mentioned this pull request Sep 27, 2022

ENH: stats.Covariance: add CovViaDiagonal #17103

Merged

tupui reviewed Sep 28, 2022

View reviewed changes

This was referenced Oct 1, 2022

ENH: stats.covariance: add CovViaCholesky #17128

Merged

ENH: stats.Covariance: specifying covariance matrix by its eigendecomposition #17135

Merged

mdhaber mentioned this pull request Jun 6, 2023

ENH: multivariate_normal.rvs extremely slow #18639

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: Allow specyfing inverse covariance of a multivariate normal distribution #16002

ENH: Allow specyfing inverse covariance of a multivariate normal distribution #16002

siddhantwahal commented Apr 17, 2022

mdhaber commented May 16, 2022

siddhantwahal commented May 17, 2022 •

edited

Loading

mdhaber commented May 23, 2022

mdhaber Jun 26, 2022 •

edited

Loading

mdhaber left a comment

mdhaber Jun 26, 2022

mdhaber commented Jul 15, 2022

siddhantwahal commented Jul 15, 2022

mdhaber commented Jul 15, 2022

siddhantwahal commented Aug 14, 2022 •

edited

Loading

mdhaber commented Sep 13, 2022

mdhaber commented Sep 14, 2022 •

edited

Loading

siddhantwahal commented Sep 15, 2022

tirthasheshpatel left a comment

tirthasheshpatel commented Sep 24, 2022

mdhaber commented Sep 24, 2022

mdhaber commented Sep 24, 2022

tirthasheshpatel left a comment

mdhaber commented Sep 27, 2022 •

edited

Loading

tirthasheshpatel commented Sep 27, 2022

tirthasheshpatel commented Sep 27, 2022

mdhaber commented Sep 27, 2022 •

edited

Loading

tupui Sep 28, 2022

mdhaber Sep 29, 2022

ENH: Allow specyfing inverse covariance of a multivariate normal distribution #16002

ENH: Allow specyfing inverse covariance of a multivariate normal distribution #16002

Conversation

siddhantwahal commented Apr 17, 2022

Reference issue

What does this implement/fix?

Additional information

mdhaber commented May 16, 2022

siddhantwahal commented May 17, 2022 • edited Loading

mdhaber commented May 23, 2022

mdhaber Jun 26, 2022 • edited Loading

Choose a reason for hiding this comment

mdhaber left a comment

Choose a reason for hiding this comment

mdhaber Jun 26, 2022

Choose a reason for hiding this comment

mdhaber commented Jul 15, 2022

siddhantwahal commented Jul 15, 2022

mdhaber commented Jul 15, 2022

siddhantwahal commented Aug 14, 2022 • edited Loading

mdhaber commented Sep 13, 2022

mdhaber commented Sep 14, 2022 • edited Loading

siddhantwahal commented Sep 15, 2022

tirthasheshpatel left a comment

Choose a reason for hiding this comment

tirthasheshpatel commented Sep 24, 2022

mdhaber commented Sep 24, 2022

mdhaber commented Sep 24, 2022

tirthasheshpatel left a comment

Choose a reason for hiding this comment

mdhaber commented Sep 27, 2022 • edited Loading

tirthasheshpatel commented Sep 27, 2022

tirthasheshpatel commented Sep 27, 2022

mdhaber commented Sep 27, 2022 • edited Loading

tupui Sep 28, 2022

Choose a reason for hiding this comment

mdhaber Sep 29, 2022

Choose a reason for hiding this comment

siddhantwahal commented May 17, 2022 •

edited

Loading

mdhaber Jun 26, 2022 •

edited

Loading

siddhantwahal commented Aug 14, 2022 •

edited

Loading

mdhaber commented Sep 14, 2022 •

edited

Loading

mdhaber commented Sep 27, 2022 •

edited

Loading

mdhaber commented Sep 27, 2022 •

edited

Loading