Standard abstraction for specifying thread grid hierarchy and dimensions #2037
Labels
CUDA Next
Feature intended for the Cuda Next experimental library
feature request
New feature or request.
CUDA C++ Runtime needs a thread hierarchy description object that specifies a mix of static and dynamic dimensions. Hierarchy will consist of the fundamental block and grid level and allow for optional levels like cluster or custom levels. It will also be expandable with future levels.
There will also be a collection of queries available in both host and device code to count, index etc the entities in the hierarchy and extract the compile-time information from it.
This is a fundamental building block for #2038, but is meant to be reusable outside of that API.
Tasks
The text was updated successfully, but these errors were encountered: