AVX-512 throughput improvement opportunties #83946
Labels
area-CodeGen-coreclr
CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI
avx512
Related to the AVX-512 architecture
Milestone
The PR to enable EVEX support by default introduced some JIT throughput regressions. The comments in that PR analyzed the cause of these regressions and identified possible follow-up investigations and improvements.
This issue tracks recovering some of the TP regressions by investigating the proposed improvements or mitigations.
For example, LSRA has a number of places with the following loop structure:
and with AVX-512 available, there are an additional 16 SIMD registers and 8 opmask (k) registers, so these loops iterate more.
The text was updated successfully, but these errors were encountered: