Add AVX2 HBD SATD asm for 8x8-transformed blocks #3048

redzic · 2022-10-26T06:43:16Z

A little more than 2x faster for 8x8 block size than the current intrinsics code. Relative speedup should be higher for larger block sizes because of lower looping overhead and using a vertical sum for each 8x8 satd, and using 1 horizontal sum at the end instead of doing a horizontal sum for each block, which is much slower.

Might be possible to go faster by recursively splitting each transform into 2 smaller transforms (like a radix-2 FFT), but I figured this is good enough for now.

codecov-commenter · 2022-10-26T06:56:42Z

Codecov Report

Base: 86.41% // Head: 86.22% // Decreases project coverage by -0.18% ⚠️

Coverage data is based on head (4625bd8) compared to base (f869e16).
Patch has no changes to coverable lines.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #3048      +/-   ##
==========================================
- Coverage   86.41%   86.22%   -0.19%     
==========================================
  Files          84       84              
  Lines       33158    33158              
==========================================
- Hits        28652    28591      -61     
- Misses       4506     4567      +61

Impacted Files	Coverage Δ
src/asm/x86/dist/hbd.rs	`58.44% <ø> (-41.56%)`	⬇️
src/context/frame_header.rs	`66.06% <0.00%> (-2.27%)`	⬇️
src/encoder.rs	`87.08% <0.00%> (-0.07%)`	⬇️
src/tiling/tile_state.rs	`89.13% <0.00%> (ø)`
src/me.rs	`95.60% <0.00%> (+0.09%)`	⬆️
src/rdo.rs	`85.65% <0.00%> (+0.41%)`	⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

barrbrain approved these changes Oct 26, 2022

View reviewed changes

x86: Add AVX2 HBD SATD asm for 8x8-transformed blocks

045137b

redzic force-pushed the avx2-hbd-satd branch from 4625bd8 to 045137b Compare October 26, 2022 08:06

redzic merged commit 76cfbea into xiph:master Oct 26, 2022

redzic deleted the avx2-hbd-satd branch October 26, 2022 08:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add AVX2 HBD SATD asm for 8x8-transformed blocks #3048

Add AVX2 HBD SATD asm for 8x8-transformed blocks #3048

redzic commented Oct 26, 2022

codecov-commenter commented Oct 26, 2022 •

edited

Add AVX2 HBD SATD asm for 8x8-transformed blocks #3048

Add AVX2 HBD SATD asm for 8x8-transformed blocks #3048

Conversation

redzic commented Oct 26, 2022

codecov-commenter commented Oct 26, 2022 • edited

Codecov Report

codecov-commenter commented Oct 26, 2022 •

edited