ENH add dtype preservation to FactorAnalysis #24321

svenstehle · 2022-09-02T10:01:51Z

Reference Issues/PRs

In scope of #11000
Continuing work of @thibsej already started in #13303

What does this implement/fix? Explain your changes.

Added dtype preservation to FactorAnalysis

Any other comments?

The test passes, but did we also check the numerical stability? PR #13303 added a test for this, ~~is this obsolete with our new check_transformer_preserve_dtypes?~~

No - we still need to check for numerical stability.

sklearn/tests/test_common.py::test_estimators[FactorAnalysis()-check_transformer_preserve_dtypes] PASSED

glemaitre

It was quite complex to make a review. Instead, I pushed the changes directly into the branch. So what changed is the following:

improve the test to check the dtype of the fitted attribute and public function
check the equivalence of the transform function between 32 and 64 bits

The latter point is difficult. Doing it I saw 2 other things to change:

a bug where the stopping criterion was not the absolute value
a SMALL value that was hard coded and actually too small for 32 bits. So I used eps instead depending of the input type.

glemaitre · 2022-12-29T15:19:25Z

We can now check if the tests are passing on all platforms.

jeremiedbb · 2022-12-29T17:15:30Z

1e-1 is really high tolerance. It's almost like having no test at all. I'm not sure we should even add such a test. To me there are 2 possibilities:

We care that the results should really be close between float32 and float64. In that case we can't say that this estimator preserves dtype and we force conversion to float64.
We consider that preserving dtype essentially means that all computations are done without conversion and it's not so bad that the results between float32 and float64 are not close (up to a reasonable tol).

This PR in its current state assumes the 2nd option. I could live with that, but I'd like the opinion of other devs before merging.

svenstehle and others added 4 commits September 2, 2022 11:58

[skip ci] open pr

ae51e54

Merge branch 'main' into enh_add_dtype_preservation_to_FactorAnalysis

e88872b

add dtype preservation to FactorAnalysis, ensure tests with _more_tags

e3bfae3

add failing test that checks for numerical stability

4554494

cmarmo added Waiting for Reviewer module:decomposition labels Oct 22, 2022

glemaitre changed the title ~~[ENH] add dtype preservation to FactorAnalysis~~ ENH add dtype preservation to FactorAnalysis Nov 4, 2022

glemaitre self-requested a review December 29, 2022 10:48

glemaitre added 4 commits December 29, 2022 11:51

style

de5499b

Merge remote-tracking branch 'origin/main' into pr/svenstehle/24321

5ee1d12

iter

e2616d5

DOC update changelog

80b6ade

glemaitre reviewed Dec 29, 2022

View reviewed changes

glemaitre approved these changes Dec 29, 2022

View reviewed changes

glemaitre added Waiting for Second Reviewer First reviewer is done, need a second one! and removed Waiting for Reviewer labels Dec 29, 2022

jeremiedbb added 2 commits December 29, 2022 17:32

check lower tol

5b38332

revert

b24bd01

cln

b5eda98

thomasjpfan mentioned this pull request Feb 16, 2023

Float32 support factor analysis #13303

Closed

jeremiedbb added Needs Decision Requires decision and removed Waiting for Second Reviewer First reviewer is done, need a second one! labels Feb 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH add dtype preservation to FactorAnalysis #24321

ENH add dtype preservation to FactorAnalysis #24321

svenstehle commented Sep 2, 2022 •

edited by glemaitre

glemaitre left a comment

glemaitre commented Dec 29, 2022

jeremiedbb commented Dec 29, 2022

ENH add dtype preservation to FactorAnalysis #24321

Are you sure you want to change the base?

ENH add dtype preservation to FactorAnalysis #24321

Conversation

svenstehle commented Sep 2, 2022 • edited by glemaitre

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

glemaitre left a comment

Choose a reason for hiding this comment

glemaitre commented Dec 29, 2022

jeremiedbb commented Dec 29, 2022

svenstehle commented Sep 2, 2022 •

edited by glemaitre