LogisticRegression convert to float64 #8769

GaelVaroquaux · 2017-04-20T12:29:48Z

Looking at the code of LogisticRegression, I have just noticed that it converts automatically the data to float64. I would expect at least the SAG, SAGA, newton-cg and lbfgs solvers to be able to work with float32.

Also, in a similar line of thoughts, the code converts to C-ordered data. How useful / necessary is this for solvers others than liblinear.

I am asking these questions because it seems that we could reduce the memory footprint of LogisticRegression.

Ping @arthurmensch for the SAG part.

massich · 2017-04-20T15:18:31Z

I can work on this

arthurmensch · 2017-04-20T15:23:47Z

Great! Handling float32 is not that hard: We need fused type : you can have a look at cd_fast.pyx. Handling F ordered arrays is possible but requires to change the way we access some arrays in place of the code where we use explicit pointers.

…

On Thu, Apr 20, 2017, 5:19 PM Joan Massich ***@***.***> wrote: I can work on this — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#8769 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AD48Y4m29WJ2a8SIOcVzrOsd3E0-AHAsks5rx3dkgaJpZM4NC9c0> .

mblondel · 2017-04-25T00:41:06Z

Maybe not for L-BFGS:
scipy/scipy#4873

arthurmensch · 2017-06-07T08:18:13Z

I will take care of introducing fused type in sag.pyx

…

On Wed, Jun 7, 2017, 10:01 AM Olivier Grisel ***@***.***> wrote: Reopened #8769 <#8769>. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#8769 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AD48Y14MUFRBlUQSZU6S1OX2Ig8ebpdkks5sBli5gaJpZM4NC9c0> .

massich · 2017-06-07T08:21:44Z

massich · 2017-06-07T08:24:48Z

@arthurmensch if you take care of fuse type in sag.pyx (see PR #9020) make sure to talk to @Henley13

ogrisel · 2017-06-07T10:07:24Z

saga and sag should indeed be addressed in the same PR.

massich · 2017-06-09T10:09:33Z

@TomDLT suggested to add type32 type 64 support for _preprocess_data here

vene · 2017-06-11T06:57:10Z

can we add isotonic regression to that list? I'll be happy to take it

massich · 2017-06-11T12:41:40Z

Sure :)

…

On Sun, Jun 11, 2017, 08:57 Vlad Niculae ***@***.***> wrote: can we add isotonic regression to that list? I'll be happy to take it — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#8769 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AGt-45MeYBb7lVbrJOzOs84Wkl7IUU5Vks5sC4_VgaJpZM4NC9c0> .

massich · 2017-06-14T15:55:55Z

@ogrisel suggested in #9033 to use [np.float64, np.float32] as the default policy. As here for logistic regression:

if self.solver == 'lbfgs':
    # scipy lbfgs does not support float32 yet:
    # https://github.com/scipy/scipy/issues/4873
    _dtype = np.float64
else:
    # all other solvers work at both float precision levels
    _dtype = [np.float64, np.float32]

As well as adding a test to ensure that type of the predicted output remains consistent. i.e:

    assert_equal(clf_32.predict(X_32).dtype, X_32.dtype)
    assert_equal(clf_64.predict(X_64).dtype, X_64.dtype)

cc: @Henley13, @raghavrv, @ncordier, @vene

raghavrv · 2017-06-14T15:59:21Z

@ogrisel advised me that

assert clf_32.predict(X_32).dtype == X_32.dtype
...

is better as it leads to more informative error messages in pytests to which we will be switching to soon.

jnothman · 2017-06-14T21:50:48Z

at which point i assume we'll change all assert_equal?

…

On 15 Jun 2017 1:59 am, "(Venkat) Raghav, Rajagopalan" < ***@***.***> wrote: @ogrisel <https://github.com/ogrisel> advised me that assert clf_32.predict(X_32).dtype == X_32.dtype... is better as it leads to more informative error messages in pytests to which we will be switching to soon. — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#8769 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAEz65b9bp9Vx25RKzR-B0A_td80OCfjks5sEANcgaJpZM4NC9c0> .

GaelVaroquaux · 2019-02-27T10:18:23Z

Fixed by #13273

GaelVaroquaux added Enhancement Moderate Anything that requires some knowledge of conventions and best practices Need Contributor labels Apr 20, 2017

massich mentioned this issue May 5, 2017

[MRG+2] LogisticRegression convert to float64 (newton-cg) #8835

Merged

massich mentioned this issue Jun 2, 2017

[RFC] who should drive the dtype of the computations X or y ? #8976

Closed

Henley13 mentioned this issue Jun 6, 2017

[MRG] LogisticRegression convert to float64 (sag) #9020

Closed

agramfort closed this as completed in #8835 Jun 7, 2017

ogrisel reopened this Jun 7, 2017

massich mentioned this issue Jun 7, 2017

[MRG+2] Ridge linear model dtype consistency (all solvers but sag) #9033

Merged

ncordier mentioned this issue Jun 7, 2017

[WIP] Add test for lasso consistency dtype #9034

Closed

Henley13 mentioned this issue Jun 7, 2017

[MRG] Fused type makedataset #9040

Closed

ncordier mentioned this issue Jun 8, 2017

[MRG] Pass the dtype consistency & accuracy tests for ridge massich/scikit-learn#4

Merged

raghavrv mentioned this issue Jun 8, 2017

[MRG + 1] ENH Ensure PCA and randomized_svd_low_rank don't upcast float to double #9067

Merged

ncordier mentioned this issue Jun 9, 2017

[WIP] Allow SGDClassifier to support np.float32 without upcasting to float64 #9084

Closed

This was referenced Jun 9, 2017

[MRG] Bayesian regression model dtype consistency #9087

Closed

[MRG+1] _preprocess_data consistent with fused types #9093

Merged

vene mentioned this issue Jun 11, 2017

[MRG+1] fused types in isotonic_regression #9106

Merged

lesteve added help wanted and removed Need Contributor labels Oct 18, 2017

glemaitre mentioned this issue Apr 20, 2018

Preserving dtype for float32 / float64 in transformers #11000

Open

28 tasks

NelleV mentioned this issue May 28, 2018

[MRG] LogisticRegression convert to float64 (sag) #11155

Closed

GaelVaroquaux added this to To do in Sprint Paris 2019 via automation Feb 25, 2019

massich mentioned this issue Feb 25, 2019

LogisticRegression convert to float64 (for SAG solver) #13243

Merged

GaelVaroquaux moved this from To do to In progress in Sprint Paris 2019 Feb 26, 2019

This was referenced Feb 26, 2019

[MRG+2] Add float32 support for Linear Discriminant Analysis #13273

Merged

Add float32 support for Latent Dirichlet Allocation #13275

Closed

GaelVaroquaux closed this as completed Feb 27, 2019

Sprint Paris 2019 automation moved this from In progress to Done Feb 27, 2019

thibsej mentioned this issue Feb 27, 2019

Float32 support factor analysis #13303

Closed

rth mentioned this issue Jul 24, 2019

PERF Support converting 32-bit matrices directly to liblinear format … #14296

Merged

rth mentioned this issue Feb 4, 2020

More memory efficient LinearClassifierMixin.predict #16381

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LogisticRegression convert to float64 #8769

LogisticRegression convert to float64 #8769

GaelVaroquaux commented Apr 20, 2017

massich commented Apr 20, 2017

arthurmensch commented Apr 20, 2017 via email

mblondel commented Apr 25, 2017

arthurmensch commented Jun 7, 2017 via email

massich commented Jun 7, 2017 •

edited by vene

massich commented Jun 7, 2017

ogrisel commented Jun 7, 2017

massich commented Jun 9, 2017

vene commented Jun 11, 2017

massich commented Jun 11, 2017 via email

massich commented Jun 14, 2017 •

edited

raghavrv commented Jun 14, 2017

jnothman commented Jun 14, 2017 via email

GaelVaroquaux commented Feb 27, 2019

LogisticRegression convert to float64 #8769

LogisticRegression convert to float64 #8769

Comments

GaelVaroquaux commented Apr 20, 2017

massich commented Apr 20, 2017

arthurmensch commented Apr 20, 2017 via email

mblondel commented Apr 25, 2017

arthurmensch commented Jun 7, 2017 via email

massich commented Jun 7, 2017 • edited by vene

massich commented Jun 7, 2017

ogrisel commented Jun 7, 2017

massich commented Jun 9, 2017

vene commented Jun 11, 2017

massich commented Jun 11, 2017 via email

massich commented Jun 14, 2017 • edited

raghavrv commented Jun 14, 2017

jnothman commented Jun 14, 2017 via email

GaelVaroquaux commented Feb 27, 2019

massich commented Jun 7, 2017 •

edited by vene

massich commented Jun 14, 2017 •

edited