Skip to content

b9717

Choose a tag to compare

@github-actions github-actions released this 19 Jun 06:32
8141e73

ggml-cpu: support K tails in power10 Q8/Q4 MMA matmul (#24753)

  • ggml-cpu: support K tails in Power10 MMA Q8/Q4 matmul

This patch removes the requirement that K be divisible by kc in the tinyBlas_Q0_PPC tiled matmul path. Process the final K panel using its actual depth and pass the reduced panel size through packing and kernel execution. This allows more workloads to use the MMA kernel and reduces fallback to mnpack.

Co-authored-by: Aaron Teo taronaeo@gmail.com


Co-authored-by: Aaron Teo taronaeo@gmail.com

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI: