Modifications to PCovR #98

bhelfrecht · 2021-04-13T08:54:06Z

This PR is for a few tweaks to PCovR to address a few issues, namely #83 and #81, and to clean up the fitting.

The PCovR instantiation has been changed to do away with the alpha parameter, since it didn't seem to ever get passed to the regressor. Instead, one just instantiates PCovR with an sklearn linear regression or ridge object.
The Yhat and W parameters have been removed from fit, since this can be achieved by instantiating PCovR with a fitted regressor, which also ensures that Yhat is consistent with W, which was not the case before.
The handling of the singular values in _fit_sample_space has been made consistent with that in _fit_feature_space, that is, the inverse singular values are set to zero if the corresponding singular values are less than tol

KPCovR should also be modified to use this "bring your own regressor" initialization, and all fit calls for PCovR-related objects should probably use lowercase y to be consistent with sklearn. These future improvements could be rolled into this PR or given their own.

…at and W

… metrics to be maximized

…o feat/pcovr_improvements

rosecers

Generally looks good, but I have a design question: why use the name regressor opposed to the sklearn-style estimator?

skcosmo/utils/_pcovr_utils.py

bhelfrecht · 2021-05-03T14:13:11Z

Generally looks good, but I have a design question: why use the name regressor opposed to the sklearn-style estimator?

estimator is rather general, and can also refer to classifiers -- in PCovR, we specifically require a regressor, hence the name choice. The use of regressor is also consistent with sklearn's TransformedTargetRegressor.

rosecers · 2021-05-03T14:25:14Z

Generally looks good, but I have a design question: why use the name regressor opposed to the sklearn-style estimator?

estimator is rather general, and can also refer to classifiers -- in PCovR, we specifically require a regressor, hence the name choice. The use of regressor is also consistent with sklearn's TransformedTargetRegressor.

Okay! I just wanted to make sure there was a concrete reason for doing so

…o feat/pcovr_improvements

…rovided

rosecers

Letting it get one more pass from @Luthaf before we merge, but looks ready to me!

Luthaf

The code looks god overall! I have one small request =)

Luthaf · 2021-05-03T15:35:19Z

skcosmo/utils/_pcovr_utils.py

+            UC = UC.T[:, (vC ** 2) > rcond]
+            vC = vC[(vC ** 2) > rcond]


Could you add a test hitting this code path? Even something simple with toy X/Y matrices.

@bhelfrecht, codecov is still not seeing these lines in the coverage report, which is really weird since you explicitly seeded the rng ... Maybe different version of numpy have different RNG? CI is using numpy == 1.20.2.

Just fixed it -- there was a typo in the test that made it bypass the targeted code

…ance via the eigendecomposition or SVD

Luthaf · 2021-05-04T09:28:38Z

Could you try to rebase & cleanup the git history? I can help to do this if you want!

bhelfrecht · 2021-05-04T09:36:44Z

I think I've got it, depends how messy it gets. How "clean" should it be? For starters, I assume all the silly "Formatting" fixup'ed into the previous commit?

rosecers · 2021-05-04T09:37:40Z

Why not just fixup the formatting commits and then do squash and merge?

bhelfrecht · 2021-05-04T09:39:34Z

Perfect, will do.

Luthaf · 2021-05-04T09:44:27Z

fixup formatting commit and removing merge commits is enough for me, then we can keep the rest of history around and use a normal merge/rebase and merge.

If we go for a squash merge on the github side, there is no need to spend time doing fixup, since everything will be a single commit anyway.

rosecers · 2021-05-04T09:45:36Z

fixup formatting commit and removing merge commits is enough for me, then we can keep the rest of history around and use a normal merge/rebase and merge.

If we go for a squash merge on the github side, there is no need to spend time doing fixup, since everything will be a single commit anyway.

fixup removes the unnecessary commit messages from the history. I'm still for fixup + squash / merge.

Luthaf · 2021-05-04T09:48:38Z

When we squash and merge on github we can edit the commit message as we want, removing unnecessary commit messages if needed.

bhelfrecht added 23 commits April 13, 2021 07:43

Change fit call to use linear regression objects instead of alpha, Yh…

d8d8b0c

…at and W

Add utility for checking and fitting a regressor for use in PCovR

f3cbe12

Fix docstrings

13ab07f

Fix capitalization errors

12ab4e9

More robust handling of small singular values

d637bf8

More robust inversion of the covariance

a3cb474

Add regression checking to utils init

3498089

Fix capitalization error

24a6d01

Fix capitalization error

cd88a34

Modify tests for new PCovR regressor infrastructure

bbf55c1

Modify PCovR example with new regressor infrastructure

4b7d70e

Use the negative of the PCovR score, as is the sklearn convention for…

a6e66b2

… metrics to be maximized

Formatting

579e656

Remove unused Ridge import

a1abbdc

Fix for square root mix-up in singular values

c088733

Use eigh for inverting the covariance instead of svd

e305268

Merge branch 'main' of https://github.com/cosmo-epfl/scikit-cosmo int…

9cc41f3

…o feat/pcovr_improvements

Formatting

facffac

Import formatting

74986dc

Remove import of scipy linalg after switch to eigh

bb88010

Move regressor checking to check_lr_fit

ea6119c

Ensure regressor is passed to PCovR initialization during tests

303fea5

Add tests checking consistency of pre-fitted regressors

34b1e34

bhelfrecht marked this pull request as ready for review May 3, 2021 10:17

bhelfrecht requested a review from rosecers May 3, 2021 10:18

rosecers reviewed May 3, 2021

View reviewed changes

skcosmo/utils/_pcovr_utils.py Outdated Show resolved Hide resolved

skcosmo/utils/_pcovr_utils.py Show resolved Hide resolved

bhelfrecht added 2 commits May 3, 2021 16:29

Merge branch 'main' of https://github.com/cosmo-epfl/scikit-cosmo int…

8cd2275

…o feat/pcovr_improvements

Only check regressor weight dimensions if a pre-fitted regressor is p…

51aa328

…rovided

rosecers self-requested a review May 3, 2021 15:16

rosecers approved these changes May 3, 2021

View reviewed changes

Luthaf reviewed May 3, 2021

View reviewed changes

bhelfrecht added 3 commits May 4, 2021 11:03

Change invalid default passed to randomized_svd

695d4eb

Add test for the computation of the inverse square root of the covari…

351f889

…ance via the eigendecomposition or SVD

Fix typo in test of inverse covariances to hit the targeted code

014fb43

Luthaf approved these changes May 4, 2021

View reviewed changes

bhelfrecht merged commit 6747507 into scikit-learn-contrib:main May 4, 2021

bhelfrecht deleted the feat/pcovr_improvements branch May 4, 2021 13:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modifications to PCovR #98

Modifications to PCovR #98

bhelfrecht commented Apr 13, 2021

rosecers left a comment

bhelfrecht commented May 3, 2021

rosecers commented May 3, 2021

rosecers left a comment

Luthaf left a comment

Luthaf May 3, 2021

Luthaf May 4, 2021

bhelfrecht May 4, 2021

Luthaf commented May 4, 2021

bhelfrecht commented May 4, 2021

rosecers commented May 4, 2021 •

edited

Loading

bhelfrecht commented May 4, 2021

Luthaf commented May 4, 2021

rosecers commented May 4, 2021

Luthaf commented May 4, 2021 •

edited

Loading

Modifications to PCovR #98

Modifications to PCovR #98

Conversation

bhelfrecht commented Apr 13, 2021

rosecers left a comment

Choose a reason for hiding this comment

bhelfrecht commented May 3, 2021

rosecers commented May 3, 2021

rosecers left a comment

Choose a reason for hiding this comment

Luthaf left a comment

Choose a reason for hiding this comment

Luthaf May 3, 2021

Choose a reason for hiding this comment

Luthaf May 4, 2021

Choose a reason for hiding this comment

bhelfrecht May 4, 2021

Choose a reason for hiding this comment

Luthaf commented May 4, 2021

bhelfrecht commented May 4, 2021

rosecers commented May 4, 2021 • edited Loading

bhelfrecht commented May 4, 2021

Luthaf commented May 4, 2021

rosecers commented May 4, 2021

Luthaf commented May 4, 2021 • edited Loading

rosecers commented May 4, 2021 •

edited

Loading

Luthaf commented May 4, 2021 •

edited

Loading