[ROCDL] Added rocdl.cvt.scale.sr.pk8 ops #162244

ravil-mobile · 2025-10-07T08:58:10Z

This patch introduces some missing FP conversion instructions in the ROCDL dialect for the GFX1250 arch.

Specifically:

Downscaling 8x packed F16, Bf16, Fp32 values to Fp8, Bf8, Fp4 with stochastic rounding
Tests:

Added lit-tests to check MLIR -> LLVM lowering

krzysz00

lgtm, approved, thank you

ravil-mobile · 2025-10-07T15:11:45Z

Contributor

Thanks!

[ROCDL] Added rocdl.cvt.scale.sr.pk8 ops

4f5d0ff

amd-eochoalo approved these changes Oct 7, 2025

View reviewed changes

krzysz00 approved these changes Oct 7, 2025

View reviewed changes

amd-eochoalo merged commit 3f62407 into llvm:main Oct 7, 2025
10 checks passed

ravil-mobile deleted the ravil/rocdl-fp-conv branch October 8, 2025 14:36

Provide feedback