Skip to content

Commit

Permalink
Fixed #27. Temporarily walk around axpy's low performance issue with …
Browse files Browse the repository at this point in the history
…small imput size & multithreads.
  • Loading branch information
xianyi committed Jun 19, 2011
1 parent b3d1887 commit aeed8d6
Show file tree
Hide file tree
Showing 2 changed files with 6 additions and 1 deletion.
1 change: 1 addition & 0 deletions Changelog.txt
Original file line number Diff line number Diff line change
Expand Up @@ -25,6 +25,7 @@ x86/x86_64:
* Fixed #28 a wrong result of dsdot on x86_64.
* Fixed #32 a SEGFAULT bug of zdotc with gcc-4.6.
* Fixed #33 ztrmm bug on Nehalem.
* Walk round #27 the low performance axpy issue with small imput size & multithreads.

MIPS64:
* Fixed #28 a wrong result of dsdot on Loongson3A/MIPS64.
Expand Down
6 changes: 5 additions & 1 deletion interface/axpy.c
Original file line number Diff line number Diff line change
Expand Up @@ -85,7 +85,11 @@ void CNAME(blasint n, FLOAT alpha, FLOAT *x, blasint incx, FLOAT *y, blasint inc
//In that case, the threads would be dependent.
if (incx == 0 || incy == 0)
nthreads = 1;


//Temporarily walk around the low performance issue with small imput size & multithreads.
if (n <= 10000)
nthreads = 1;

if (nthreads == 1) {
#endif

Expand Down

0 comments on commit aeed8d6

Please sign in to comment.