Currently, composable kernel supplies kernels with very sparse shape (runtime parameters) support for an individual kernel. To use this library, we need to heavily rely on tuning mechanism to find a supported (as well as faster) one. Here we want to have the default kernels (that is, the kerenls come out of InstanceFactory with index 0) to support runtime parameters densely. We want thoes ( padded generic) kernels to support all runtime parameter combination, and impl->IsSupportedArgument(arg), the padded generic ones must always return true.
Currently, composable kernel supplies kernels with very sparse shape (runtime parameters) support for an individual kernel. To use this library, we need to heavily rely on tuning mechanism to find a supported (as well as faster) one. Here we want to have the default kernels (that is, the kerenls come out of
InstanceFactorywith index0) to support runtime parameters densely. We want thoes ( padded generic) kernels to support all runtime parameter combination, andimpl->IsSupportedArgument(arg), the padded generic ones must always return true.