Skip to content

DGEMM Optimizations for Cortex-A57#802

Merged
xianyi merged 3 commits intoOpenMathLib:developfrom
ashwinyes:develop_20160314_dgemm_optimization
Mar 15, 2016
Merged

DGEMM Optimizations for Cortex-A57#802
xianyi merged 3 commits intoOpenMathLib:developfrom
ashwinyes:develop_20160314_dgemm_optimization

Conversation

@ashwinyes
Copy link
Copy Markdown
Contributor

Further optimizations to DGEMM for Cortex-A57
* Single core performance improved by ~10%
Additional functional (non-optimized) GEMM assembly kernels for Cortex-A57


This change is Review on Reviewable

Ashwin Sekhar T K added 3 commits March 14, 2016 19:33
Adding functional (non-optimized) kernels for Cortex-A57
with the following layouts.
SGEMM - 16x4, 8x8
CGEMM - 8x4
DGEMM - 8x4, 4x8
xianyi added a commit that referenced this pull request Mar 15, 2016
…ation

DGEMM Optimizations for Cortex-A57
@xianyi xianyi merged commit e173039 into OpenMathLib:develop Mar 15, 2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants