Skip to content

Commit

Permalink
x86/cpu: set avxslow cpuflag on btver2 CPUs
Browse files Browse the repository at this point in the history
They are also slow when using 256 bit wide registers

Reviewed-by: Hendrik Leppkes <h.leppkes@gmail.com>
Signed-off-by: James Almer <jamrial@gmail.com>
  • Loading branch information
jamrial committed Feb 7, 2016
1 parent ba618bd commit be22bd3
Showing 1 changed file with 4 additions and 6 deletions.
10 changes: 4 additions & 6 deletions libavutil/x86/cpu.c
Original file line number Diff line number Diff line change
Expand Up @@ -182,13 +182,11 @@ int ff_get_cpu_flags_x86(void)

/* Similar to the above but for AVX functions on AMD processors.
This is necessary only for functions using YMM registers on Bulldozer
based CPUs as they lack 256-bits execution units. SSE/AVX functions
using XMM registers are always faster on them.
and Jaguar based CPUs as they lack 256-bits execution units. SSE/AVX
functions using XMM registers are always faster on them.
AV_CPU_FLAG_AVX and AV_CPU_FLAG_AVXSLOW are both set so that AVX is
used unless explicitly disabled by checking AV_CPU_FLAG_AVXSLOW.
TODO: Confirm if Excavator is affected or not by this once it's
released, and update the check if necessary. Same for btver2. */
if (family == 0x15 && (rval & AV_CPU_FLAG_AVX))
used unless explicitly disabled by checking AV_CPU_FLAG_AVXSLOW. */
if ((family == 0x15 || family == 0x16) && (rval & AV_CPU_FLAG_AVX))
rval |= AV_CPU_FLAG_AVXSLOW;
}

Expand Down

0 comments on commit be22bd3

Please sign in to comment.