-
Notifications
You must be signed in to change notification settings - Fork 513
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fixing intermittent HBDSCAN pytest failure in CI #4025
Fixing intermittent HBDSCAN pytest failure in CI #4025
Conversation
Codecov Report
@@ Coverage Diff @@
## branch-21.08 #4025 +/- ##
===============================================
Coverage ? 85.46%
===============================================
Files ? 230
Lines ? 18133
Branches ? 0
===============================================
Hits ? 15498
Misses ? 2635
Partials ? 0
Flags with carried forward coverage won't be shown. Click here to find out more. Continue to review full report at Codecov.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Thanks for chasing this down, Divye. The assumptions and expectations in the fused l2 knn primitive are not clearly noted anywhere and it's been a bit of challenge to decompose them while customizing the k-select behavior for the reachability distances in hdbscan.
@gpucibot merge |
closes #4025 again. This is happening because the underlying assumptions in the fused-1nn have likely changed, and that trickles down to HDBSCAN and the functors we use for accessing that API - as discussed with @cjnolet Authors: - Divye Gala (https://github.com/divyegala) Approvers: - Corey J. Nolet (https://github.com/cjnolet) URL: #4052
There was an OOB access occurring here https://github.com/rapidsai/cuml/blob/0707a46f717023ca0b5047c3aeee9ead1a093272/cpp/src/hdbscan/runner.h#L74 when `b.key == 1`. This error was getting swallowed up and we were seeing those `thrust::transform` errors in CI. Tagging @cjnolet to confirm if this is the intended way to use this functor and the fix is correct. Authors: - Divye Gala (https://github.com/divyegala) Approvers: - Corey J. Nolet (https://github.com/cjnolet) URL: rapidsai#4025
closes rapidsai#4025 again. This is happening because the underlying assumptions in the fused-1nn have likely changed, and that trickles down to HDBSCAN and the functors we use for accessing that API - as discussed with @cjnolet Authors: - Divye Gala (https://github.com/divyegala) Approvers: - Corey J. Nolet (https://github.com/cjnolet) URL: rapidsai#4052
There was an OOB access occurring here
cuml/cpp/src/hdbscan/runner.h
Line 74 in f7fb363
b.key == 1
. This error was getting swallowed up and we were seeing thosethrust::transform
errors in CI.Tagging @cjnolet to confirm if this is the intended way to use this functor and the fix is correct.