Launch configuration: use ZE_extension_kernel_max_group_size_properties #430

maleadt · 2024-04-19T10:29:39Z

With prime-sized inputs the suggested group size always consists of only a single thread:

julia> k = @oneapi launch=false identity(nothing)

julia> oneL0.suggest_groupsize(k.fun, 521)
oneAPI.oneL0.ZeDim3(1, 1, 1)

julia> oneL0.suggest_groupsize(k.fun, 7877)
oneAPI.oneL0.ZeDim3(1, 1, 1)

julia> oneL0.suggest_groupsize(k.fun, 7919)
oneAPI.oneL0.ZeDim3(1, 1, 1)

But also with non prime-sized inputs the configuration looks highly suboptimal:

julia> oneL0.suggest_groupsize(k.fun, 8000)
oneAPI.oneL0.ZeDim3(64, 1, 1)

(this kernel can launch groups of 512 threads on this system)

Maybe I'm misinterpreting the use of this API? I thought it was a counterpart of the CUDA occupancy API (cuOccupancyMaxPotentialBlockSize), suggesting a groupsize that accomplishes a reasonable occupancy.

The text was updated successfully, but these errors were encountered:

maleadt · 2024-04-19T12:08:38Z

Filed upstream: intel/compute-runtime#725

maleadt · 2024-04-22T16:46:52Z

As noted by upstream, this is expected; the suggested launch configuration exactly covers the input space. Since we don't care about this, using bounds checks at run time, we can use more relaxed launch configurations. A workaround is implemented in #431, but once there's a new driver release we should use the Level Zero extension to query the maximum launch configuration for a given kernel.

maleadt · 2024-05-16T10:30:09Z

Fixed by #431

maleadt mentioned this issue Apr 19, 2024

Roll our own launch configuration #431

Merged

maleadt added libraries Things about libraries and how we use them. upstream Out of our hands. labels Apr 19, 2024

maleadt changed the title ~~Confusing suggest_groupsize results~~ Launch configuration: use ZE_extension_kernel_max_group_size_properties Apr 22, 2024

maleadt added kernels Things about kernels and how they are compiled. and removed upstream Out of our hands. libraries Things about libraries and how we use them. labels Apr 22, 2024

maleadt closed this as completed May 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Launch configuration: use ZE_extension_kernel_max_group_size_properties #430

Launch configuration: use ZE_extension_kernel_max_group_size_properties #430

maleadt commented Apr 19, 2024

maleadt commented Apr 19, 2024

maleadt commented Apr 22, 2024

maleadt commented May 16, 2024

Launch configuration: use ZE_extension_kernel_max_group_size_properties #430

Launch configuration: use ZE_extension_kernel_max_group_size_properties #430

Comments

maleadt commented Apr 19, 2024

maleadt commented Apr 19, 2024

maleadt commented Apr 22, 2024

maleadt commented May 16, 2024