Allow to do gemv and ger buffer allocation on the stack #482

jeromerobert · 2014-12-27T13:53:52Z

ger and gemv call blas_memory_alloc/free which in their turn
call blas_lock. blas_lock create thread contention when matrices
are small and the number of thread is high enough. We avoid
call blas_memory_alloc by replacing it with stack allocation.
This can be enabled with:
make -DMAX_STACK_ALLOC=2048
The given size (in byte) must be high enough to avoid thread contention
and small enough to avoid stack overflow.

Fix #478

ger and gemv call blas_memory_alloc/free which in their turn call blas_lock. blas_lock create thread contention when matrices are small and the number of thread is high enough. We avoid call blas_memory_alloc by replacing it with stack allocation. This can be enabled with: make -DMAX_STACK_ALLOC=2048 The given size (in byte) must be high enough to avoid thread contention and small enough to avoid stack overflow. Fix OpenMathLib#478

Allow to do gemv and ger buffer allocation on the stack

For gemv_t, directly use malloc to create the buffer.

Refs OpenMathLib#478, OpenMathLib#482, 9798481, fd9fd42

xianyi added a commit that referenced this pull request Jan 1, 2015

Merge pull request #482 from jeromerobert/develop

41aad04

Allow to do gemv and ger buffer allocation on the stack

xianyi merged commit 41aad04 into OpenMathLib:develop Jan 1, 2015

jeromerobert mentioned this pull request Apr 8, 2015

OpenBLAS 6 times slower than MKL on DGEMV() #532

Closed

xianyi added a commit that referenced this pull request Apr 13, 2015

Refs #478, #482. Fix segfault bug for gemv_t with MAX_ALLOC_STACK flag.

9798481

For gemv_t, directly use malloc to create the buffer.

xianyi added a commit that referenced this pull request Apr 13, 2015

Refs #478, #482. Fixed bug on previous commit.

fd9fd42

jeromerobert added a commit to jeromerobert/OpenBLAS that referenced this pull request Apr 15, 2015

Fix a buffer overflow with MAX_STACK_ALLOC size in dgemv_t

f31e27a

Refs OpenMathLib#478, OpenMathLib#482, 9798481, fd9fd42

jeromerobert added a commit to jeromerobert/OpenBLAS that referenced this pull request Apr 15, 2015

Fix a buffer overflow with MAX_STACK_ALLOC size in dgemv_t

a4c96ec

Refs OpenMathLib#478, OpenMathLib#482, 9798481, fd9fd42

jeromerobert mentioned this pull request Apr 15, 2015

Fix a buffer overflow with MAX_STACK_ALLOC size in dgemv_t #543

Merged

hiccup7 mentioned this pull request Apr 20, 2015

Build OpenBLAS with MAX_STACK_ALLOC=2048 JuliaLang/julia#10780

Closed

xianyi added a commit that referenced this pull request Apr 20, 2015

Refs #478,#482, Enable stack alloc for s/dgemv_t.(revert 9798491)

847e19c

OtacilioNeto mentioned this pull request Feb 3, 2017

DGESVD slow coparing with intel implementation #1077

Open

martin-frbg mentioned this pull request Jun 3, 2020

cmake: DYNAMIC_ARCH build broken on OS X #2634

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow to do gemv and ger buffer allocation on the stack #482

Allow to do gemv and ger buffer allocation on the stack #482

jeromerobert commented Dec 27, 2014

Allow to do gemv and ger buffer allocation on the stack #482

Allow to do gemv and ger buffer allocation on the stack #482

Conversation

jeromerobert commented Dec 27, 2014