xelpg: jit: gemm: additional f16 accumulation strategies #3417

petercad · 2025-06-11T17:50:09Z

Adds some f16 accumulation FMA strategies (opt-in with --attr-acc-mode=f16) for MTL. Theoretical peak is 2x faster than f32 accumulation and actual performance speedup is similar.

petercad · 2025-06-16T16:19:11Z

make test linters

petercad · 2025-06-16T16:20:00Z

make test
disable test_device_cpu
disable build_cpu_runtime_omp
disable build_cpu_runtime_sycl
disable build_cpu_runtime_tbb
disable benchdnn_all
enable benchdnn_matmul

xelpg: jit: gemm: additional f16 accumulation strategies

c918657

petercad requested a review from a team as a code owner June 11, 2025 17:50

github-actions bot added the platform:gpu-intel label Jun 11, 2025

kealan-barbieri approved these changes Jun 11, 2025

View reviewed changes

echeresh approved these changes Jun 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

xelpg: jit: gemm: additional f16 accumulation strategies #3417

xelpg: jit: gemm: additional f16 accumulation strategies #3417

Uh oh!

petercad commented Jun 11, 2025

Uh oh!

petercad commented Jun 16, 2025

Uh oh!

petercad commented Jun 16, 2025

Uh oh!

Uh oh!

xelpg: jit: gemm: additional f16 accumulation strategies #3417

Are you sure you want to change the base?

xelpg: jit: gemm: additional f16 accumulation strategies #3417

Uh oh!

Conversation

petercad commented Jun 11, 2025

Uh oh!

petercad commented Jun 16, 2025

Uh oh!

petercad commented Jun 16, 2025

Uh oh!

Uh oh!