You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I' m wondering whether it is the best way to do that in ispc.
And the asm use ymm1 register. How can i use only xmm1 to do vcmpeqps and vmovmskps operation.
I think In this case only xmm is enough.
(In C/C++ I can use _mm_cmple_ps/ _mm_movemask_ps for 128 bit wide.)
I don‘t want to change target to avx2-i32x4
I want't to keep the TARGET_WIDTH = 8 to get other functions in my program effecient
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
I'm going to get the index of max element in float<4> in avx2-i32x8.
And here is my code.
I get the asm by this command
ispc -O2 -g --target=avx2-i32x8 --emit-asm
And the asm code like below
I' m wondering whether it is the best way to do that in ispc.
And the asm use ymm1 register. How can i use only xmm1 to do vcmpeqps and vmovmskps operation.
I think In this case only xmm is enough.
(In C/C++ I can use _mm_cmple_ps/ _mm_movemask_ps for 128 bit wide.)
I don‘t want to change target to avx2-i32x4
I want't to keep the TARGET_WIDTH = 8 to get other functions in my program effecient
Beta Was this translation helpful? Give feedback.
All reactions