In FIL, clip blocks_per_sm to one wave instead of asserting #4271

levsnv · 2021-10-06T18:42:26Z

After using the feature for a while, we found it easier to sometimes set a high number and expect FIL to scale down when the GPU requires, instead of checking externally every time.

canonizer · 2021-10-06T20:11:15Z

cpp/src/fil/fil.cu

-    ASSERT(blocks_per_sm <= max_blocks_per_sm,
-           "on this GPU, FIL blocks_per_sm cannot exceed %d",
-           max_blocks_per_sm);
+    blocks_per_sm = std::min(blocks_per_sm, max_threads_per_sm / FIL_TPB);


I think we should also add a warning message if blocks_per_sm > max_threads_per_sm / FIL_TPB.

dantegd · 2021-10-06T21:54:37Z

@gpucibot merge

codecov-commenter · 2021-10-06T22:25:54Z

Codecov Report

❗ No coverage uploaded for pull request base (branch-21.12@c3b5aec). Click here to learn what that means.
The diff coverage is n/a.

@@               Coverage Diff               @@
##             branch-21.12    #4271   +/-   ##
===============================================
  Coverage                ?   86.06%           
===============================================
  Files                   ?      231           
  Lines                   ?    18691           
  Branches                ?        0           
===============================================
  Hits                    ?    16087           
  Misses                  ?     2604           
  Partials                ?        0

Flag	Coverage Δ
dask	`47.01% <0.00%> (?)`
non-dask	`78.75% <0.00%> (?)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update c3b5aec...1f6810a. Read the comment docs.

…#4271) After using the feature for a while, we found it easier to sometimes set a high number and expect FIL to scale down when the GPU requires, instead of checking externally every time. Authors: - Levs Dolgovs (https://github.com/levsnv) Approvers: - Jordan Jacobelli (https://github.com/Ethyling) - Andy Adinets (https://github.com/canonizer) - Dante Gama Dessavre (https://github.com/dantegd) URL: rapidsai#4271

clip, don't assert

1f6810a

levsnv requested a review from a team as a code owner October 6, 2021 18:42

levsnv requested a review from canonizer October 6, 2021 18:42

github-actions bot added the CUDA/C++ label Oct 6, 2021

levsnv self-assigned this Oct 6, 2021

levsnv added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Oct 6, 2021

levsnv requested a review from jjacobelli October 6, 2021 18:44

jjacobelli approved these changes Oct 6, 2021

View reviewed changes

canonizer approved these changes Oct 6, 2021

View reviewed changes

dantegd approved these changes Oct 6, 2021

View reviewed changes

rapids-bot bot merged commit 51c41c4 into rapidsai:branch-21.12 Oct 6, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

In FIL, clip blocks_per_sm to one wave instead of asserting #4271

In FIL, clip blocks_per_sm to one wave instead of asserting #4271

levsnv commented Oct 6, 2021

canonizer Oct 6, 2021

dantegd commented Oct 6, 2021

codecov-commenter commented Oct 6, 2021

In FIL, clip blocks_per_sm to one wave instead of asserting #4271

In FIL, clip blocks_per_sm to one wave instead of asserting #4271

Conversation

levsnv commented Oct 6, 2021

canonizer Oct 6, 2021

Choose a reason for hiding this comment

dantegd commented Oct 6, 2021

codecov-commenter commented Oct 6, 2021

Codecov Report