Investigate the impact of LU-variants on randomized SVD with scipy's lu_no_fortran branch #16

ogrisel · 2023-05-23T15:36:54Z

Experiment with the lu_no_fortan branch of scipy (scipy/scipy#18358) to:

evaluate the impact of using index-based permutation in the randomized SVD solver of scikit-learn
explore the impact of discarding the permutation info and therefore confirming @tygert's intuitions expressed in RFC: add support for LU factorization in the linalg extension data-apis/array-api#627 (comment).

Here are the resulting plots of the randomized SVD benchmarks with various normalizers:

So in conclusion:

Using LU-based normalization while completely discarding the permutation info is sometimes better and sometimes worse (!!!) than no normalization at all, but always worse than an LU-based normalizer that takes the permutation into account one way or another.
Using index-based permutation (using matrix_p=False followed by row-wise fancing indexing) seems to be approximately as fast as letting scipy precompute the permutation with permute_l=True).
LU-based normalization with the new Cython-based scipy branch is still slightly faster than QR-based normalization (on average).

…u_no_fortran

lezcano · 2023-05-23T16:05:53Z

Fair enough. Thank you for the benchmarks!

There's just one bit I don't understand. How come in the last graph "LU_no_permute" and "QR" seem to "come back in time" (iter 0 happens AFTER iter 1)?

ogrisel · 2023-05-23T16:37:57Z

There's just one bit I don't understand. How come in the last graph "LU_no_permute" and "QR" seem to "come back in time" (iter 0 happens AFTER iter 1)?

Those are random fluctuation for the small times. Ideally, I should re-run the benchmarks many times and plot the average with horizontal error bars, but life is too short ;)

Investigated the impact of LU-variants on randomized SVD with scipy l…

15d3342

…u_no_fortran

ogrisel mentioned this pull request May 23, 2023

RFC: add support for LU factorization in the linalg extension data-apis/array-api#627

Open

ogrisel changed the title ~~Investigate the impact of LU-variants on randomized SVD with scipy u_no_fortran~~ Investigate the impact of LU-variants on randomized SVD with scipy's lu_no_fortran branch May 23, 2023

ogrisel mentioned this pull request Jun 16, 2023

ENH Array API support for PCA scikit-learn/scikit-learn#26315

Merged

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Investigate the impact of LU-variants on randomized SVD with scipy's lu_no_fortran branch #16

Investigate the impact of LU-variants on randomized SVD with scipy's lu_no_fortran branch #16

ogrisel commented May 23, 2023 •

edited

Loading

lezcano commented May 23, 2023

ogrisel commented May 23, 2023

Investigate the impact of LU-variants on randomized SVD with scipy's lu_no_fortran branch #16

Are you sure you want to change the base?

Investigate the impact of LU-variants on randomized SVD with scipy's lu_no_fortran branch #16

Conversation

ogrisel commented May 23, 2023 • edited Loading

lezcano commented May 23, 2023

ogrisel commented May 23, 2023

ogrisel commented May 23, 2023 •

edited

Loading