[MRG + 1] Fix numerical instability in LassoLars when alpha=0 (#7778) #7849

jmontoyam · 2016-11-09T18:38:57Z

This pull request fixes issue #7778: sklearn LassoLars implementation does not give the same result that the LassoLars implementation available in R (lars library).

What does this implement/fix? Explain your changes.

This mismatch is due to a bug in the file least_angle.py, in lines 406 and 407 (lars_path method).
The bug is related to the way these two lines use the np.resize function.
According to the docs of the np.resize function, If the new array is larger than the original array, then the new array is filled with repeated copies of a, which causes a subtle error, because in the lars_path implementation those new values are used as if they were equal to zero.

jmontoyam · 2016-11-09T18:53:41Z

@agramfort, @tguillemot, thank you very much for all your help and advices. This is my first pull request to sklearn, I'm very happy to start contributing to this amazing project!

raghavrv · 2016-11-09T19:23:20Z

@jmontoyam Yohooo! Looking forward to more such awesome PRs :)

raghavrv · 2016-11-09T19:23:42Z

sklearn/linear_model/least_angle.py

-                coefs = np.resize(coefs, (n_iter + add_features, n_features))
-                alphas = np.resize(alphas, n_iter + add_features)
+                coefs = np.concatenate((coefs,
+                                       np.zeros((add_features, n_features))),


Could you add a space before np... to match the indentation

@raghavrv Thank you very much!, I have added the space you suggested me ;)

jmontoyam

@raghavrv Thank you very much!, I have added the space you suggested me ;)

jmontoyam · 2016-11-09T19:35:47Z

sklearn/linear_model/least_angle.py

-                coefs = np.resize(coefs, (n_iter + add_features, n_features))
-                alphas = np.resize(alphas, n_iter + add_features)
+                coefs = np.concatenate((coefs,
+                                       np.zeros((add_features, n_features))),


@raghavrv Thank you very much!, I have added the space you suggested me ;)

agramfort · 2016-11-10T09:45:05Z

sklearn/linear_model/least_angle.py

-                alphas = np.resize(alphas, n_iter + add_features)
+                coefs = np.concatenate((coefs,
+                                        np.zeros((add_features, n_features))),
+                                       axis=0)


can you bench if this is faster:

coefs = np.resize(coefs, (n_iter + add_features, n_features))
coefs[:-add_features, :] = 0.

I think you mean:
coefs[-add_features:] = 0

The version you propose is a little bit faster (but the difference is very tiny, it is approx. 0.2ms faster according to my toy benchmark).

I will modify the code following your advice ;)

Thanks!

agramfort · 2016-11-10T09:46:07Z

thx @jmontoyam

can you check why the tests don't pass?

agramfort · 2016-11-10T20:20:31Z

LGTM

can you just update what's new page to document the bug fix?

thanks heaps !

srmcc · 2016-11-10T21:16:38Z

Thank you for figuring this out! It's so much help...

jmontoyam · 2016-11-11T13:21:27Z

@agramfort @raghavrv
Please help me:
Conflicting files: doc/whats_new.rst is
This is because this file has changed since the time I made the PR.
Please excuse me!, I know this is a very beginner question: what do you suggest me to do to solve this error?
If I update the upstream and at the same time some else modify again this file, there will be a conflict again, right?

jmontoyam · 2016-11-11T13:42:02Z

@agramfort @raghavrv
I think I have solved the conflict...but I don't know if my solution was the cleanest one :)

Review is out of date.

NelleV · 2016-11-12T03:01:36Z

Thanks!

raghavrv · 2016-11-12T03:07:56Z

@amueller Maybe you can sqeeze this into 0.18.1

tguillemot · 2016-11-13T10:34:53Z

@jmontoyam Congrats for your first PR !!!

jmontoyam · 2016-11-13T17:33:51Z

@tguillemot , thank you very much for all the advices and suggestions!, and for answering very kindly all the beginner questions that I asked you :)

…-learn#7778) (scikit-learn#7849) * Fix bug 7778 * Add test_lasso_lars_vs_R_implementation * Add a space to match the indentation * Solve E501 line too long (80 > 79 characters) * assert_array_almost_equal up to 12 decimals * Tiny modification for increasing performance * Update what's new page * Trying to solve conflicts * Solve conflict in doc/whats_new.rst

jmontoyam added 2 commits November 9, 2016 15:14

Fix bug 7778

d3bbe03

Add test_lasso_lars_vs_R_implementation

2c96607

raghavrv changed the title ~~Fix bug 7778~~ [MRG] Fix numerical instability in LassoLars when alpha=0 (#7778) Nov 9, 2016

raghavrv added this to the 0.19 milestone Nov 9, 2016

raghavrv added the Bug label Nov 9, 2016

jmontoyam mentioned this pull request Nov 9, 2016

linear_model.LassoLars algorithm/documentation is incomplete for alpha=0 #7778

Closed

raghavrv previously requested changes Nov 9, 2016

View reviewed changes

Add a space to match the indentation

73524d8

jmontoyam commented Nov 9, 2016

View reviewed changes

agramfort reviewed Nov 10, 2016

View reviewed changes

jmontoyam added 3 commits November 10, 2016 10:54

Solve E501 line too long (80 > 79 characters)

c0b4e81

assert_array_almost_equal up to 12 decimals

88792ec

Tiny modification for increasing performance

60bdf50

raghavrv changed the title ~~[MRG] Fix numerical instability in LassoLars when alpha=0 (#7778)~~ [MRG + 1] Fix numerical instability in LassoLars when alpha=0 (#7778) Nov 10, 2016

Update what's new page

04cba8d

jmontoyam added 3 commits November 11, 2016 14:30

Trying to solve conflicts

9d979bf

Merge branch 'master' into fix_bug_7778

fdad17a

Solve conflict in doc/whats_new.rst

8284fe7

NelleV merged commit 5230382 into scikit-learn:master Nov 12, 2016

raghavrv modified the milestones: 0.18.1, 0.19 Nov 12, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MRG + 1] Fix numerical instability in LassoLars when alpha=0 (#7778) #7849

[MRG + 1] Fix numerical instability in LassoLars when alpha=0 (#7778) #7849

jmontoyam commented Nov 9, 2016

jmontoyam commented Nov 9, 2016

raghavrv commented Nov 9, 2016

raghavrv Nov 9, 2016

jmontoyam Nov 9, 2016

jmontoyam left a comment

jmontoyam Nov 9, 2016

agramfort Nov 10, 2016

jmontoyam Nov 10, 2016

agramfort commented Nov 10, 2016

agramfort commented Nov 10, 2016

srmcc commented Nov 10, 2016

jmontoyam commented Nov 11, 2016

jmontoyam commented Nov 11, 2016

NelleV commented Nov 12, 2016

raghavrv commented Nov 12, 2016

tguillemot commented Nov 13, 2016

jmontoyam commented Nov 13, 2016

[MRG + 1] Fix numerical instability in LassoLars when alpha=0 (#7778) #7849

[MRG + 1] Fix numerical instability in LassoLars when alpha=0 (#7778) #7849

Conversation

jmontoyam commented Nov 9, 2016

What does this implement/fix? Explain your changes.

jmontoyam commented Nov 9, 2016

raghavrv commented Nov 9, 2016

raghavrv Nov 9, 2016

Choose a reason for hiding this comment

jmontoyam Nov 9, 2016

Choose a reason for hiding this comment

jmontoyam left a comment

Choose a reason for hiding this comment

jmontoyam Nov 9, 2016

Choose a reason for hiding this comment

agramfort Nov 10, 2016

Choose a reason for hiding this comment

jmontoyam Nov 10, 2016

Choose a reason for hiding this comment

agramfort commented Nov 10, 2016

agramfort commented Nov 10, 2016

srmcc commented Nov 10, 2016

jmontoyam commented Nov 11, 2016

jmontoyam commented Nov 11, 2016

NelleV commented Nov 12, 2016

raghavrv commented Nov 12, 2016

tguillemot commented Nov 13, 2016

jmontoyam commented Nov 13, 2016