New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
AVX2 intrinsic optimization #71
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Code reviewed ( removed unused transform SSE code and added c kernel for BiPredClippingOnTheFly).
Code tested and ready to be merged
HEVC_BiPredClipping width 4 2.23x 259.01 576.49 HEVC_BiPredClipping width 8 8.55x 247.48 2116.88 HEVC_BiPredClipping width 16 13.71x 608.02 8337.47 HEVC_BiPredClipping width 32 31.22x 2011.41 62787.17 HEVC_BiPredClipping width 64 65.04x 8295.15 539476.88
EstimateInvTransform32x32_AVX2 7.83x 44473.62 348382.25
+ add c kernel for BiPredClippingOnTheFly
Hi @ttrigui, I am not familiar with ASM to do code review. Could you let me know how I can help to get this patch merged? Thanks. |
hevc_fwd_txfm8 4.22x 1220.40 5150.50
Added AVX2 intrinsic codes for
|
Fixed Linux build error. |
This reverts commit fa0702c.
This reverts commit fa0702c.
BiPredClipping_AVX2
BiPredClippingOnTheFly_AVX2
invTransform32