New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[LayerNorm Optimize x86] AVX512/AVX/SSE intrinsic #4060
Conversation
LRY89757
commented
Jul 21, 2022
- Add the AVX512/AVX/SSE intrinsic for layernorm
- Add some test samples for elempack == 16
Codecov Report
@@ Coverage Diff @@
## master #4060 +/- ##
==========================================
- Coverage 94.41% 92.98% -1.44%
==========================================
Files 745 743 -2
Lines 178496 178516 +20
==========================================
- Hits 168533 165990 -2543
- Misses 9963 12526 +2563
Continue to review full report at Codecov.
|
I will try to finish the merge of instancenorm and the batchnorm then. At the same time, I will also finish the layernorm in my own way |
close for 03f2ad3 |