numpy.dot out of memory when multiplying big matrix #4062

grfo · 2013-11-19T09:30:44Z

I have a 2000 by 1,000,000 matrix A and want to calculate the 2000 by 2000 matrix

.> B = numpy.dot(A,A.T)

but numpy just eats up all my memory, slows down my whole computer and crashes after a couple of hours.

I then rewrote the matrix multiplication to

.> B = numpy.zeros(2000,2000)
.> A.shape = (2000,10000,100)
.> for M in numpy.rollaxis(A,2):
.>.. B += numpy.dot(M,M.T)

and it just runs fine in a couple of minutes.

I don't see the reason numpy needs so much memory for a matrix multiplication. But I think at least it should not crash on that problem.

seberg · 2013-11-19T09:47:37Z

Got to ask, but are you sure you didn't transpose the wrong argument making the result 1 Million x 1 Million instead of 2000x2000?

seberg · 2013-11-19T10:09:05Z

Ah, sorry. There is a point. The thing is that np.dot(A, A.T) is not a valid blas operation I think, because the order of A and A.T do not match, so one of them needs to be copied. So requiring that one copy (i.e. A.T.copy() instead of A.T) might already be eating your RAM.

If this is not the problem, maybe check the blas you are linking, i.e. some openblas versions are apparently known to sometimes misbehave for large arrays.

grfo · 2013-11-19T12:00:01Z

This would explain the situation. The matrix A uses indeed over half of my memory.

I would still put it on the wishlist. np.dot should divide a problem into subproblems if it'll not be able to deal with it at once. This should be easy to fix.

Thanks!

nouiz · 2013-11-19T13:56:35Z

Hi,

NumPy don't do the minimium amount of copy. For example, it was coping c
contiguous input up to the new release of numpy 1.8. What version of numpy
do you use? If you update to 1.8, I think it will fix your problem if A is
c contiguous.

If A is not c contiguous, current version of numpy will copy it (and its
transpose). If one of the strides is itemsize, it is possible by modifing
numpy to don't do this copy.

On Tue, Nov 19, 2013 at 7:00 AM, grfo notifications@github.com wrote:

This would explain the situation. The matrix A uses indeed over half of my
memory.

I would still put it on the wishlist. np.dot should divide a problem into
subproblems if it'll not be able to deal with it at once. This should be
easy to fix.

Thanks!

?
Reply to this email directly or view it on GitHubhttps://github.com//issues/4062#issuecomment-28784384
.

grfo · 2013-11-19T15:05:05Z

I'm using 1.7.1 but I'm unfortunatelly not allowed to upgrade to 1.8 on this machine. However, good to see this should be better in 1.8.

I would like to see numpy be better prepared for big data. I can work out a solution for my own. But I would expect me to not be the only one running in this kind of trouble.

seberg · 2013-11-19T18:24:47Z

Yeah 1.8. has fixed this exact thing. The ld(a|b) parameter (in numpy naming at least) can do non-contiguous arrays as long as one dimension is unit stride? I think that would definetly make sense to implement into numpy, and not make the code more complicating.

juliantaylor · 2013-11-19T18:34:59Z

#3916 might solve this problem

juliantaylor · 2013-11-19T18:36:54Z

no, that pr it does not prevent the large copy, but the chunking could be moved to a higher level to make smaller copies.

larsmans · 2013-11-21T16:53:14Z

#3916 only affects vector dot products, but 14153dc should make improvements to the way GEMM/GEMV is called a bit easier.

charris · 2014-02-24T17:22:50Z

@seberg Sounds like this is fixed in 1.8. True?

seberg · 2014-02-24T17:50:06Z

We still do copies when in principle blas could do it without the copy I think. But yes the specific operation won't copy since 1.8.

charris · 2014-02-24T18:07:07Z

I'll close this then. If dot can still use improvement in that respect, might open a task issue for that.

Mullahz · 2021-04-09T15:49:59Z

I am getting memory crash error while executing the following code in Linux sever

d is a matrix of size 1024x50625

I need to find

dTd = np.matmul(d.T, d)

charris · 2021-04-09T16:03:55Z

You need > 20 GiB for that just for the result. how much memory do you have?

Mullahz · 2021-04-09T17:43:37Z

I did the implementation on Google Colab.

…

On Fri, 9 Apr, 2021, 9:34 pm Charles Harris, ***@***.***> wrote: You need > 20 GiB for that just for the result. how much memory do you have? — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#4062 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACTALHJBKGI3EB5QDJUDSN3TH4QQFANCNFSM4AJZFDLQ> .

charris closed this as completed Feb 24, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

numpy.dot out of memory when multiplying big matrix #4062

numpy.dot out of memory when multiplying big matrix #4062

grfo commented Nov 19, 2013

seberg commented Nov 19, 2013

seberg commented Nov 19, 2013

grfo commented Nov 19, 2013

nouiz commented Nov 19, 2013

grfo commented Nov 19, 2013

seberg commented Nov 19, 2013

juliantaylor commented Nov 19, 2013

juliantaylor commented Nov 19, 2013

larsmans commented Nov 21, 2013

charris commented Feb 24, 2014

seberg commented Feb 24, 2014

charris commented Feb 24, 2014

Mullahz commented Apr 9, 2021

charris commented Apr 9, 2021

Mullahz commented Apr 9, 2021 via email

numpy.dot out of memory when multiplying big matrix #4062

numpy.dot out of memory when multiplying big matrix #4062

Comments

grfo commented Nov 19, 2013

seberg commented Nov 19, 2013

seberg commented Nov 19, 2013

grfo commented Nov 19, 2013

nouiz commented Nov 19, 2013

grfo commented Nov 19, 2013

seberg commented Nov 19, 2013

juliantaylor commented Nov 19, 2013

juliantaylor commented Nov 19, 2013

larsmans commented Nov 21, 2013

charris commented Feb 24, 2014

seberg commented Feb 24, 2014

charris commented Feb 24, 2014

Mullahz commented Apr 9, 2021

charris commented Apr 9, 2021

Mullahz commented Apr 9, 2021 via email