LARS fixes: intercept, normalization, bugfix #3493

rcurtin · 2023-06-01T20:32:33Z

This is the end of a very deep rabbit hole. I thought that #3442 was a result of LARS not supporting fitting an intercept and being able to normalize data to unit variance. That's actually not the case, but I only found that out after writing the support, so here it is.

Deep inside the changes is a minor bugfix that takes a little while to explain:

LARS is an iterative algorithm that adds a feature to the model in each iteration. It selects which feature is the best to add based on residuals, and then selects a step size for how much of that feature to add (so to speak). The paper's rule for choosing the step size, in Equation 2.13, forces a positive (specifically nonzero) step size. The step size is chosen such that we step exactly to the point where the next feature should be added.

However, it can sometimes happen that two different features have the exact same residual value, and could equally be chosen as the next feature to add to the model. In this case, the step size should technically be 0: we should add both features at once. I say "technically" because the paper does not actually clarify this, so my claim there is somewhat rooted in opinion, and I think it's possible to disagree on reasonable grounds.

Anyway, the fix implied by my opinion is to relax Equation 2.13 to allow a zero step size, and thus what effectively happens is that the model can add two features at once.

This fixes the newly-added KKT checks from @gscarella, which I adapted to add to the LARS test suite.

About the intercept and normalization support: I enable them by default since the paper's theory assumes that. There's a little bit of extra accounting to make sure predictions are still right when this happens.

rcurtin · 2023-06-01T20:39:45Z

src/mlpack/methods/lars/lars_impl.hpp

          gamma = val1;
-        if ((val2 > 0.0) && (val2 < gamma))
+        if ((val2 >= 0.0) && (val2 < gamma))


This is the tiny relaxation that is the bugfix part of this PR.

…e don't learn anything.

…red.

…es not seem to be supported by the paper.

… behavior).

…he right thing to do. (To me the paper is slightly unclear here.)

rcurtin · 2023-06-06T01:25:06Z

src/mlpack/methods/lars/lars_impl.hpp

+            gamma = val1;
+          if ((val2 >= 0.0) && (val2 < gamma))
+            gamma = val2;
+        }


It turns out the "tiny relaxation" is not so tiny: on LARS iterations where we are using the LASSO modification, sometimes we remove a feature because its sign changes (this is Eq. 3.6). However, we need to avoid considering it for the next iteration---so we can't use the relaxation that allows gamma (the step size) to become 0, since the removed feature's correlation should be exactly zero.

rcurtin · 2023-06-06T01:26:29Z

I also had one more thing to fix here: LocalCoordinateCoding and SparseCoding are written to pass a precomputed Gram matrix that assumes no normalization and no intercept, which interferes with the changes to behavior in this PR. So, I fixed that, then also added some checks that throw an exception if the user gave a Gram matrix that does not match the settings of fitIntercept and normalizeData. Wish I could have my day back, but it was an interesting adventure nonetheless.

…termination condition partway through.

mlpack-bot

Second approval provided automatically after 24 hours. 👍

rcurtin added 4 commits June 1, 2023 16:21

Fix bug(?): handle when the step size needs to be zero.

97a3450

Add intercept and normalization support to LARS.

0fc6101

Minor syntax fix.

199e11c

Add tests for intercept/normalization and KKT conditions.

c7e8b86

rcurtin added c: methods t: bugfix t: added feature labels Jun 1, 2023

Update HISTORY.

d551fd5

rcurtin mentioned this pull request Jun 1, 2023

Problem with LARS to satisfy optimality (KKT) conditions #3442

Closed

rcurtin commented Jun 1, 2023

View reviewed changes

rcurtin added 12 commits June 1, 2023 17:02

Also run KKT tests for normalized data.

e9ae754

Merge branch 'master' into lars-intercept-normalize

71f4900

Make sure that interceptPath is populated even if lambda is so high w…

6362fa8

…e don't learn anything.

Make sure to take the rest of the step when a singularity is encounte…

044b3a2

…red.

Skipping variables due to the previous iteration's lasso condition do…

a325031

…es not seem to be supported by the paper.

This edge case no longer needs to be handled separately.

f044fc7

Fix inaccurate comment.

aa4615a

Make sure to disable intercepts and normalization for SparseCoding also.

71e0835

Normalization and intercepts should be disabled for LCC (like the old…

ab37e89

… behavior).

I can't have my day back, but maybe someone else can.

113c979

Try to throw an exception when the given Gram matrix seems wrong.

356924a

Actually, avoiding adding the removed feature the next iteration is t…

da1e3a3

…he right thing to do. (To me the paper is slightly unclear here.)

rcurtin commented Jun 6, 2023

View reviewed changes

rcurtin added 2 commits June 6, 2023 10:03

Fix adding intercept when no model is fit.

683c8c8

maxCorr is set in the beginning of the loop, so we need to check the …

95fef95

…termination condition partway through.

conradsnicta approved these changes Jun 8, 2023

View reviewed changes

mlpack-bot bot approved these changes Jun 9, 2023

View reviewed changes

conradsnicta merged commit 493b879 into mlpack:master Jun 10, 2023
19 checks passed

rcurtin mentioned this pull request Jun 14, 2023

Release version 4.2.0 #3495

Closed

rcurtin mentioned this pull request Jun 14, 2023

Release version 4.2.0 #3496

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LARS fixes: intercept, normalization, bugfix #3493

LARS fixes: intercept, normalization, bugfix #3493

rcurtin commented Jun 1, 2023

rcurtin Jun 1, 2023

rcurtin Jun 6, 2023

rcurtin commented Jun 6, 2023

mlpack-bot bot left a comment

LARS fixes: intercept, normalization, bugfix #3493

LARS fixes: intercept, normalization, bugfix #3493

Conversation

rcurtin commented Jun 1, 2023

rcurtin Jun 1, 2023

Choose a reason for hiding this comment

rcurtin Jun 6, 2023

Choose a reason for hiding this comment

rcurtin commented Jun 6, 2023

mlpack-bot bot left a comment

Choose a reason for hiding this comment