Skip to content

Request for padded generic kernel #709

@cloudhan

Description

@cloudhan

Currently, composable kernel supplies kernels with very sparse shape (runtime parameters) support for an individual kernel. To use this library, we need to heavily rely on tuning mechanism to find a supported (as well as faster) one. Here we want to have the default kernels (that is, the kerenls come out of InstanceFactory with index 0) to support runtime parameters densely. We want thoes ( padded generic) kernels to support all runtime parameter combination, and impl->IsSupportedArgument(arg), the padded generic ones must always return true.

  • FlashAttention
  • Gemm + elementwise
  • pure elementwise kernels

Metadata

Metadata

Labels

enhancementNew feature or request

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions