Documentation error in GaussianMixture #10141

rdturnermtl · 2017-11-14T22:26:17Z

In the documentation of sklearn.mixture.GaussianMixture, it says that precisions_cholesky_ is:
"The cholesky decomposition of the precision matrices of each mixture component."
However in lines 317--321 of sklearn.mixture.gaussian_mixture.py it is clearly computing the inverse of the cholesky rather than the cholesky of the inverse. These are not the same:

import numpy as np

np.random.seed(0)
S = np.random.randn(5, 5)
S = np.dot(S, S.T)
IC = np.linalg.inv(np.linalg.cholesky(S).T)
CI = np.linalg.cholesky(np.linalg.inv(S)).T

print IC
print CI

gives

[[ 0.2801735   0.05324512 -0.24035224  0.15420992  0.49081071]
 [ 0.          0.70922139 -0.67751324  0.23295284 -6.89597681]
 [ 0.          0.          0.89988635 -0.67437444 -3.85334538]
 [ 0.          0.          0.          0.72994649  6.24915248]
 [ 0.          0.          0.          0.          3.76407547]]
[[ 0.63543472 -4.9542323  -3.48037026  5.00400095  2.90737741]
 [ 0.          4.90166957  1.74700377 -3.69934856 -2.35698345]
 [ 0.          0.          0.97357464 -0.71268093 -0.27514477]
 [ 0.          0.          0.          0.5929664   0.09843469]
 [ 0.          0.          0.          0.          0.27323201]]

I think the code is correct, but the documentation needs to be corrected. The same issue applies to BayesianGaussianMixture.

The text was updated successfully, but these errors were encountered:

amueller · 2017-11-14T22:41:32Z

My matrix algebra seem very rusty :-/

np.linalg.inv(S)
np.dot(IC, IC.T)
np.dot(CI.T, CI)

all produce the same result, which makes me think there's just some transpose somewhere?

amueller · 2017-11-14T22:50:32Z

Ah, never mind. IC is not a cholesky because it's upper triangular times lower triangular. So yes, seems like an issue with the docs.

jnothman · 2017-11-14T23:11:20Z

Please feel free to offer a PR with a change in the docs.

FarahSaeed · 2017-11-16T05:00:45Z

Hi In _initialize method, it's computing cholesky of precision matrices in lines 644 - 647 of gaussian_mixture.py, (which I think is cholesky of inverse). Isn't it different from the implementation in _compute_precision_cholesky which takes inverse of cholesky?

lesteve · 2017-11-16T14:50:55Z

@FarahSaeed friendly advice: for clarity's sake link to code like this.

In your particular example, I guess you mean

scikit-learn/sklearn/mixture/gaussian_mixture.py

Lines 644 to 647 in e260119

    
           elif self.covariance_type == 'full': 
        
               self.precisions_cholesky_ = np.array( 
        
                   [linalg.cholesky(prec_init, lower=True) 
        
                    for prec_init in self.precisions_init])

?

FarahSaeed · 2017-11-16T15:30:47Z

@lesteve yeah exactly.

rdturnermtl · 2017-11-16T16:13:57Z

Yeah that definitely looks like an inconsistency in the code. Maybe it would make more sense to take the Cholesky than the precision for the initialization.

Is there a good way to fix this without creating backwards compatibility issues?

aby0 · 2017-12-05T10:56:06Z

@jnothman As far as my understanding, we generally use cholesky decomposition for calculating inverse of positive definite matrix efficiently(better than LU decomposition), so taking cholesky of covariance makes sense in order to calculate precision matrix eventually. So, It might be just an documentation error. If so, can I move ahead to make changes in documentation? Thanks

jnothman · 2017-12-06T10:32:21Z

This isn't my expertise, @aby0, but submit a PR and the change will certainly be considered!

jnothman added Documentation help wanted labels Nov 14, 2017

cmarmo added the module:mixture label Nov 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Documentation error in GaussianMixture #10141

Documentation error in GaussianMixture #10141

rdturnermtl commented Nov 14, 2017

amueller commented Nov 14, 2017

amueller commented Nov 14, 2017

jnothman commented Nov 14, 2017

FarahSaeed commented Nov 16, 2017

lesteve commented Nov 16, 2017

FarahSaeed commented Nov 16, 2017

rdturnermtl commented Nov 16, 2017

aby0 commented Dec 5, 2017

jnothman commented Dec 6, 2017

Documentation error in GaussianMixture #10141

Documentation error in GaussianMixture #10141

Comments

rdturnermtl commented Nov 14, 2017

amueller commented Nov 14, 2017

amueller commented Nov 14, 2017

jnothman commented Nov 14, 2017

FarahSaeed commented Nov 16, 2017

lesteve commented Nov 16, 2017

FarahSaeed commented Nov 16, 2017

rdturnermtl commented Nov 16, 2017

aby0 commented Dec 5, 2017

jnothman commented Dec 6, 2017