Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with
or
.
Download ZIP

Loading…

Improve BLAS Level 3 multi-threads performance on ICT Loongson 3A #47

Closed
xianyi opened this Issue · 0 comments

1 participant

@xianyi
Owner

The average speedup of multi-threads dgemm is about 2.83 on 4 cores.
Need optimization.

@xianyi xianyi was assigned
@xianyi xianyi referenced this issue from a commit
@xianyi Refs #47. On Loongson 3A, set DGEMM_R parameter depending on differen…
…t number of threads. It would improve double precision BLAS3 on multi-threads.
4727fe8
@xianyi xianyi closed this
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Something went wrong with that request. Please try again.