Avx512: Abs, ceil, floor, min, max #84937

jkrishnavs · 2023-04-17T18:12:31Z

Adding support for AVX 512:
Abs, ceil, floor, min, max, sqrt, Negate, and Unary Addition.

Covered cases:

Abs:
Abs(Vector512), Abs(Vector512), Abs(Vector512), Abs(Vector512), Abs(Vector512)
, Abs(Vector512)
ceil:
ceil(Vector512), ceil(Vector512)
Floor:
Floor(Vector512), Floor(Vector512)
Min
Min(Vector512, Vector512), Min(Vector512, Vector512),
Min(Vector512, Vector512), Min(Vector512, Vector512),
Min(Vector512, Vector512), Min(Vector512, Vector512)
Max
Max(Vector512, Vector512), Max(Vector512, Vector512),
Max(Vector512, Vector512), Max(Vector512, Vector512),
Max(Vector512, Vector512), Max(Vector512, Vector512)
Negate & Unary Addition:
Negate(Vector512), Negate(Vector512), Negate(Vector512), Negate(Vector512), Negate(Vector512), Negate(Vector512)

Operator for Negate and Unary Addition.

Introduced a few new instructions that were not supported by lower vector intrinsics:

pabsq: Abs for long
min max for long

# Conflicts: # src/coreclr/jit/hwintrinsiclistxarch.h # src/coreclr/jit/instrsxarch.h

src/coreclr/jit/hwintrinsiclistxarch.h

src/coreclr/jit/gentree.cpp

Removing unnecessary HW_Flag_NoEvexSemantics flag Co-authored-by: Tanner Gooding <tagoo@outlook.com>

Co-authored-by: Tanner Gooding <tagoo@outlook.com>

# Conflicts: # src/coreclr/jit/hwintrinsiclistxarch.h

tannergooding · 2023-04-21T13:02:54Z

Resolved the merge conflict caused by other merged PRs

src/coreclr/jit/gentree.cpp

src/coreclr/jit/hwintrinsic.h

tannergooding

Changes LGTM. Just a couple minor code cleanup requests.

tannergooding

Actually, on second thought, remembered that there is still a need to fixup the roundps and roundpd instructions in instrsxarch.h as they are still marked as EVEX incompatible.

They do actually, in that they become the vrndscaleps/vrndscalepd instructions and the upper 4-bits of the immediate can now be non-zero.

That also means we probably need to update instr.cpp to ensure that roundps/roundpd are emitted as vrndscale in the disassembly

Without this fix, Ceiling and Floor are going to fail in their tests

-- The scalar versions, roundss and roundsd probably need the same treatment.

Co-authored-by: Tanner Gooding <tagoo@outlook.com>

…into avx512minmax

jkrishnavs · 2023-04-21T21:53:23Z

to update instr.cpp to ensure that roundps/roundpd are emitted as vrndscale in the disassembly

We have tried to add vrndscaleps/vrndscalepd instructions. hope this will find the round instruction

src/coreclr/jit/instrsxarch.h

… the code gen part with EVEX encoding.

src/coreclr/jit/instrsxarch.h

Co-authored-by: Tanner Gooding <tagoo@outlook.com>

tannergooding

LGTM, thanks!

jkrishnavs · 2023-04-22T01:03:06Z

for some weird reason, git + was added to the code change. making an additional push to remove that.

src/coreclr/jit/instrsxarch.h

tannergooding · 2023-04-22T15:59:06Z

CC. @dotnet/jit-contrib, @dotnet/avx512-contrib

this needs a secondary sign-off from the JIT side

jkrishnavs added 6 commits April 13, 2023 22:29

Minmax implementation

c44a922

Fixing new instruction issues

20d19f1

Merge branch 'main' into avx512minmax

4692e6f

Abs, Ceil and Floor implementation.

179dca9

Merge branch 'main' into avx512minmax

964e7c4

Merge branch 'main' into avx512minmax

024123a

dotnet-issue-labeler bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Apr 17, 2023

jkrishnavs added 9 commits April 17, 2023 12:11

Merge branch 'main' into avx512minmax

f9d9664

unary oprations: Sqrt, Negate, Addition

ce78b0a

Removing debug function for condition

5448d87

Merge branch 'main' into avx512minmax

a73f4ca

Fixing formatting issues

fd002fb

Merge branch 'main' into avx512minmax

28fc029

# Conflicts: # src/coreclr/jit/hwintrinsiclistxarch.h # src/coreclr/jit/instrsxarch.h

fixing merge issue

3fa7343

Fixing rebase formatting

a4006e1

Reverting changes to fix rebase issues

def94d5

build-analysis bot mentioned this pull request Apr 19, 2023

Tracking issue for CI build timeouts #76454

Closed

tannergooding added the avx512 Related to the AVX-512 architecture label Apr 20, 2023