correcting information criterion calculation in least_angle.py #6080

mehmetbasbug · 2015-12-23T19:42:20Z

The information criterion calculation is not compatible with the original paper
Zou, Hui, Trevor Hastie, and Robert Tibshirani. "On the “degrees of freedom” of the lasso." The Annals of Statistics 35.5 (2007): 2173-2192.
APA

The information criterion calculation is not compatible with the original paper Zou, Hui, Trevor Hastie, and Robert Tibshirani. "On the “degrees of freedom” of the lasso." The Annals of Statistics 35.5 (2007): 2173-2192. APA

amueller · 2016-10-07T17:55:08Z

@mehmetbasbug sorry for the slow reply. @agramfort @GaelVaroquaux do you have any input on this?

agramfort · 2016-10-12T13:57:55Z

sorry @mehmetbasbug for the slow reaction

can you point me to the equation you refer to?

mehmetbasbug · 2016-10-12T14:09:26Z

Eq. 2.15 and Eq. 2.16
Basically, the error is due to taking the log of mse.

agramfort · 2016-10-13T05:03:01Z

sklearn/linear_model/least_angle.py

@@ -1424,6 +1424,7 @@ def fit(self, X, y, copy_X=True):

        R = y[:, np.newaxis] - np.dot(X, coef_path_)  # residuals
        mean_squared_error = np.mean(R ** 2, axis=0)
+        sigma = np.var(y)


call it sigma2

agramfort · 2016-10-13T05:06:07Z

sklearn/linear_model/least_angle.py

@@ -1437,7 +1438,7 @@ def fit(self, X, y, copy_X=True):

        self.alphas_ = alphas_
        with np.errstate(divide='ignore'):
-            self.criterion_ = n_samples * np.log(mean_squared_error) + K * df
+            self.criterion_ = n_samples * mean_squared_error/sigma + K * df


to me it should be

mean_squared_error / sigma2 + K * df

given that K is 2 or log(n)

ideally everything should be divided by n.

Did you check how it affects the example using BIC and AIC?

if you want to divide everything by n then we should have

mean_squared_error / sigma2 + K * df / n_samples

It was a long while ago but I remember getting the same results as the R package from the authors when I used the corrected version.

ok agreed. can you see why travis is not happy and add a test that compares with R solution? thx

Thanks for your reply to #7692. In general, it is hard to estimate sigma2 with a single formula for LASSO. I looked at R implementation of selectiveInference Page 12.

Estimate of error standard deviation. If NULL (default), this is estimated using
the mean squared residual of the full least squares fit when n >= 2p, and using
the standard deviation of y when n < 2p.

So if n >= 2p, it should be mean_squared_error * n_samples / (n_samples - df - 1).
if n < 2p, it should be var(y).

For the diabetes data in travis test, n >= 2p, the first formula should be used. This is probably why travis is not happy.

Also, sigma2 should be moved to the second term as a multiplicative term. Dividing by sigma2 is numerically unstable

agramfort · 2016-10-13T05:06:32Z

also it breaks the tests. Please see travis errors

agramfort · 2017-06-06T18:41:55Z

@mehmetbasbug @yuachen follow up on #9022

please review if you have time

thanks

amueller added the Waiting for Reviewer label Oct 7, 2016

agramfort reviewed Oct 13, 2016

View reviewed changes

agramfort mentioned this pull request Oct 18, 2016

Wrong implementation in sklearn.linear_model.LassoLarsIC? #7692

Closed

agramfort mentioned this pull request Jun 6, 2017

[MRG+1] Fix BIC/AIC for Lasso #9022

Merged

agramfort closed this Jun 6, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

correcting information criterion calculation in least_angle.py #6080

correcting information criterion calculation in least_angle.py #6080

mehmetbasbug commented Dec 23, 2015

amueller commented Oct 7, 2016

agramfort commented Oct 12, 2016

mehmetbasbug commented Oct 12, 2016

agramfort Oct 13, 2016

agramfort Oct 13, 2016

mehmetbasbug Oct 15, 2016

agramfort Oct 16, 2016

yuachen Oct 18, 2016 •

edited

agramfort commented Oct 13, 2016

agramfort commented Jun 6, 2017

correcting information criterion calculation in least_angle.py #6080

correcting information criterion calculation in least_angle.py #6080

Conversation

mehmetbasbug commented Dec 23, 2015

amueller commented Oct 7, 2016

agramfort commented Oct 12, 2016

mehmetbasbug commented Oct 12, 2016

agramfort Oct 13, 2016

Choose a reason for hiding this comment

agramfort Oct 13, 2016

Choose a reason for hiding this comment

mehmetbasbug Oct 15, 2016

Choose a reason for hiding this comment

agramfort Oct 16, 2016

Choose a reason for hiding this comment

yuachen Oct 18, 2016 • edited

Choose a reason for hiding this comment

agramfort commented Oct 13, 2016

agramfort commented Jun 6, 2017

yuachen Oct 18, 2016 •

edited