Support setting kernel block cluster dimensions #484

eyalroz · 2023-03-28T09:04:36Z

With the Hopper architecture, NVIDIA has introduced "clusters" of blocks which can use each other's shared memory. The clustering can be set either using a __cluster_dims__(1,2,3) qualifier in the kernel's signature, or at run-time. We need to support the run-time setting within our launch_configuration_t class and in the launch config builder mechanism.

The text was updated successfully, but these errors were encountered:

eyalroz · 2024-03-16T00:01:17Z

Fixed by addressing #564 .

eyalroz added the task label Mar 28, 2023

eyalroz added this to the Full CUDA 12 Support milestone Mar 28, 2023

eyalroz closed this as completed Mar 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support setting kernel block cluster dimensions #484

Support setting kernel block cluster dimensions #484

eyalroz commented Mar 28, 2023

eyalroz commented Mar 16, 2024

Support setting kernel block cluster dimensions #484

Support setting kernel block cluster dimensions #484

Comments

eyalroz commented Mar 28, 2023

eyalroz commented Mar 16, 2024