Correct allowed IMM values for cvtps_ph#2147
Conversation
|
There is a ~2m increase in CI time, because we are testing all possible values of |
|
Hmm, we should start sharding these really... |
Wdym exactly? I am pretty free for the next couple of days |
|
I mean somehow split the workload so that the total wall time is smaller. Maybe just using Alternatively we split e.g. the |
|
The increase is from intrinsic-test, due to SDE being hopelessly slow. We can probably make intrinsic-test emit the tests in two batches, one with SSE, AVX and whatever else qemu can handle, and the other one with AVX512, Then we can probably use qemu for the first case, although I'm not sure how beneficial that would be. also, interestingly the dev profile tests are slower, with clang being the fastest (~18m) and gcc and icx being similar (~21m). The release profile tests all take ~15m. |
Related: #t-libs/stdarch > Documentation of _mm256_cvtps_ph seems incorrect
This corrects all the
cvtps_phfunctions, almost all of them had wrong documentation, and all of them had wrong checks for the rounding imm. Also updates intrinsic-test with the correct valuesr? @folkertdev