Faster GLM preprocessing by fusing kernels #4549

achirkin · 2022-02-02T09:21:19Z

Fuse fit_intercept and normalize kernels when both are enabled. This change reduces the preprocess/postprocess runtime almost by half when the data is normalized (which is false by default though).
Furthermore, it changes the behavior of the "normalize" switch from dividing by the column-wise L2 norm to dividing by the column-wise standard deviation.

achirkin · 2022-02-08T07:23:39Z

rerun tests

cjnolet · 2022-02-08T12:49:03Z

rerun tests

codecov-commenter · 2022-02-09T23:43:28Z

Codecov Report

❗ No coverage uploaded for pull request base (branch-22.04@9921c61). Click here to learn what that means.
The diff coverage is n/a.

@@               Coverage Diff               @@
##             branch-22.04    #4549   +/-   ##
===============================================
  Coverage                ?   85.73%           
===============================================
  Files                   ?      239           
  Lines                   ?    19585           
  Branches                ?        0           
===============================================
  Hits                    ?    16792           
  Misses                  ?     2793           
  Partials                ?        0

Flag	Coverage Δ
dask	`46.18% <0.00%> (?)`
non-dask	`78.73% <0.00%> (?)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 9921c61...6f056df. Read the comment docs.

cjnolet

A couple very minor things. PR looks great overall.

cpp/test/sg/cd_test.cu

cpp/test/sg/ridge.cu

python/cuml/linear_model/elastic_net.pyx

cjnolet

I meant to request changes.

achirkin · 2022-02-10T17:19:07Z

rerun tests

cjnolet

LGTM!

cjnolet · 2022-02-10T21:21:58Z

@gpucibot merge

Fuse fit_intercept and normalize kernels when both are enabled. This change reduces the preprocess/postprocess runtime almost by half when the data is normalized (which is false by default though). Furthermore, it changes the behavior of the "normalize" switch from dividing by the column-wise L2 norm to dividing by the column-wise standard deviation. Authors: - Artem M. Chirkin (https://github.com/achirkin) Approvers: - Corey J. Nolet (https://github.com/cjnolet) URL: rapidsai#4549

Faster GLM preprocessing by fusing kernels

a351308

achirkin requested review from a team as code owners February 2, 2022 09:21

github-actions bot added CMake CUDA/C++ labels Feb 2, 2022

achirkin added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change and removed CMake CUDA/C++ labels Feb 2, 2022

achirkin marked this pull request as draft February 2, 2022 12:21

Document the changes in behavior.

6f0defb

github-actions bot added CMake CUDA/C++ Cython / Python Cython or Python issue labels Feb 4, 2022

Unpin my raft branch

a40019d

github-actions bot removed the CMake label Feb 5, 2022

achirkin marked this pull request as ready for review February 5, 2022 08:25

achirkin requested a review from a team as a code owner February 5, 2022 08:25

achirkin added the 3 - Ready for Review Ready for review by team label Feb 5, 2022

Merge branch 'branch-22.04' into enh-ols-faster-preprocess

5691bab

achirkin added 2 commits February 9, 2022 07:43

Merge branch 'branch-22.04' into enh-ols-faster-preprocess

3c6f0b7

Merge branch 'branch-22.04' into enh-ols-faster-preprocess

6f056df

cjnolet reviewed Feb 10, 2022

View reviewed changes

cpp/test/sg/cd_test.cu Show resolved Hide resolved

cpp/test/sg/ridge.cu Outdated Show resolved Hide resolved

python/cuml/linear_model/elastic_net.pyx Show resolved Hide resolved

cjnolet requested changes Feb 10, 2022

View reviewed changes

achirkin added 2 commits February 10, 2022 07:56

Merge branch 'branch-22.04' into enh-ols-faster-preprocess

ca1ce95

More descriptive comments

ffebe6d

cjnolet approved these changes Feb 10, 2022

View reviewed changes

rapids-bot bot merged commit 7ab0f4c into rapidsai:branch-22.04 Feb 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Faster GLM preprocessing by fusing kernels #4549

Faster GLM preprocessing by fusing kernels #4549

achirkin commented Feb 2, 2022 •

edited

achirkin commented Feb 8, 2022

cjnolet commented Feb 8, 2022

codecov-commenter commented Feb 9, 2022

cjnolet left a comment

cjnolet left a comment

achirkin commented Feb 10, 2022

cjnolet left a comment

cjnolet commented Feb 10, 2022

Faster GLM preprocessing by fusing kernels #4549

Faster GLM preprocessing by fusing kernels #4549

Conversation

achirkin commented Feb 2, 2022 • edited

achirkin commented Feb 8, 2022

cjnolet commented Feb 8, 2022

codecov-commenter commented Feb 9, 2022

Codecov Report

cjnolet left a comment

Choose a reason for hiding this comment

cjnolet left a comment

Choose a reason for hiding this comment

achirkin commented Feb 10, 2022

cjnolet left a comment

Choose a reason for hiding this comment

cjnolet commented Feb 10, 2022

achirkin commented Feb 2, 2022 •

edited