Allow public tuning of cub::DeviceMergeSort #8600
Conversation
|
Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually. Contributors can view more details about this message here. |
c268a82 to
18fb743
Compare
This comment has been minimized.
This comment has been minimized.
687b996 to
c16b518
Compare
This comment has been minimized.
This comment has been minimized.
c16b518 to
9f648f3
Compare
This comment has been minimized.
This comment has been minimized.
9f648f3 to
dd2410e
Compare
This comment has been minimized.
This comment has been minimized.
dd2410e to
9bacddc
Compare
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
|
No actionable comments were generated in the recent review. 🎉 ℹ️ Recent review info⚙️ Run configurationConfiguration used: Path: .coderabbit.yaml Review profile: CHILL Plan: Enterprise Run ID: 📒 Files selected for processing (10)
✅ Files skipped from review due to trivial changes (2)
🚧 Files skipped from review as they are similar to previous changes (6)
suggestion: WalkthroughExports public cub::MergeSortPolicy and updates selectors, dispatch, kernels, agents, docs, and tests to use it; sizing now uses policy.threads_per_block * policy.items_per_thread and benchmarks/tests demonstrate env-based tuning with MergeSortPolicy. ChangesMergeSortPolicy Public API and Dispatch Chain
Assessment against linked issues
Suggested reviewers
Comment |
There was a problem hiding this comment.
Actionable comments posted: 3
ℹ️ Review info
⚙️ Run configuration
Configuration used: Path: .coderabbit.yaml
Review profile: CHILL
Plan: Enterprise
Run ID: a3136543-ac94-4d25-82ad-22987888fe5f
📒 Files selected for processing (9)
cub/benchmarks/bench/merge_sort/keys.cucub/benchmarks/bench/merge_sort/pairs.cucub/cub/agent/agent_merge_sort.cuhcub/cub/device/device_merge_sort.cuhcub/cub/device/dispatch/dispatch_merge_sort.cuhcub/cub/device/dispatch/kernels/kernel_merge_sort.cuhcub/cub/device/dispatch/tuning/tuning_merge_sort.cuhcub/test/catch2_test_device_merge_sort_env.cucub/test/catch2_test_device_merge_sort_env_api.cu
This comment has been minimized.
This comment has been minimized.
576a30c to
9811c8f
Compare
🥳 CI Workflow Results🟩 Finished in 2h 12m: Pass: 100%/284 | Total: 11d 05h | Max: 2h 10m | Hits: 20%/929605See results here. |
Preceding work:
cuda::execution::tunepublic #8652BlockThreadstoThreadsPerBlock#8835block_threads->threads_per_block#8836Fixes: #8574
New public entities for
cub::DeviceMergeSortcub::MergeSortPolicythreads_per_block,items_per_thread,load_algorithm,load_modifier,store_algorithm