Reimplement matrix_integer_dense using FLINT #16803

mmasdeu · 2014-08-12T16:39:14Z

In this ticket we reimplement all of matrix_integer_dense using FLINT fmpz_mat_t.

The speed-up is substantial. With the new code:

sage: A = Matrix(ZZ,1000,1000,range(10^6))
sage: %time B = A*A
CPU times: user 883 ms, sys: 3.33 ms, total: 887 ms
Wall time: 888 ms

This takes more than 1 minute in the same computer using Sage 6.3.

CC: @pjbruin

Component: linear algebra

Keywords: flint, matrix

Author: Marc Masdeu

Branch: 812a509

Reviewer: William Stein, Jeroen Demeyer

Issue created by migration from https://trac.sagemath.org/ticket/16803

The text was updated successfully, but these errors were encountered:

mmasdeu · 2014-08-12T17:02:16Z

Branch: u/mmasdeu/16803-matrix_integer_dense_flint

mmasdeu · 2014-08-12T17:02:16Z

Commit: a51a2fb

mmasdeu · 2014-08-12T18:21:58Z

comment:2

By the way, after rebasing it does not compile. I will fix it tomorrow hopefully.

fredrik-johansson · 2014-08-12T22:06:35Z

comment:3

A word of caution about algorithm defaults: FLINT should generally be faster for small input but IML/Linbox etc. probably have much better code for things like nullspace, charpoly... as soon as the matrices start to get large.

Replacing _rational_kernel_iml with _rational_kernel_flint, for example, is strange. Surely both algorithms should be available.

sagetrac-git · 2014-08-15T18:11:38Z

Branch pushed to git repo; I updated commit sha1. New commits:

`f1862b3`	`Trac 16803: Fixed compilation problems.`
`d180ee2`	`Trac 16803: Finished re-adding legacy functionality.`

sagetrac-git · 2014-08-15T18:11:38Z

Changed commit from a51a2fb to d180ee2

mmasdeu · 2014-08-15T18:23:45Z

comment:5

I rewrote the code to have all the previous functionality available. The code compiles and passes all the doctests in the matrix/ folder. I will have it run all the doctests during the weekend.

sagetrac-git · 2014-08-19T17:11:44Z

Changed commit from d180ee2 to e26c347

sagetrac-git · 2014-08-19T17:11:44Z

Branch pushed to git repo; I updated commit sha1. New commits:

`e26c347`	`Trac 16803: Modified to pass doctests from outside matrix folder.`

sagetrac-git · 2014-08-21T14:07:55Z

Changed commit from e26c347 to f2ac03c

sagetrac-git · 2014-08-21T14:07:55Z

Branch pushed to git repo; I updated commit sha1. New commits:

`a6e9a3d`	`Moved ntl dependency inside the pyx file.`
`f2ac03c`	`Trac 16803: Added sig_on/off, changed some defaults after benchmarking.`

sagetrac-git · 2014-08-22T12:54:44Z

Branch pushed to git repo; I updated commit sha1. New commits:

`888451b`	`Trac 16803: Fixed a missing sig_off() and doctests.`
`166bae5`	`All doctests pass, documentation builds well also.`

sagetrac-git · 2014-08-22T12:54:44Z

Changed commit from f2ac03c to 166bae5

williamstein · 2014-08-29T07:41:03Z

comment:9

Hi,

A first remark playing around with timings is that in sage-6.3 somebody or something has definitely completely massively screwed up Linbox, or how Linbox is built, or how we use linbox, which was the default representation and library for arithmetic. Consider this:

sage: n=200; set_random_seed(0); a = random_matrix(ZZ,n); b = random_matrix(ZZ,n)
sage: %time c=a._multiply_linbox(b)
CPU times: user 326 ms, sys: 3.63 ms, total: 330 ms
Wall time: 347 ms
sage: %time c=a._multiply_multi_modular(b)
CPU times: user 51.5 ms, sys: 1.6 ms, total: 53.1 ms
Wall time: 56.1 ms
sage: n=400; set_random_seed(0); a = random_matrix(ZZ,n); b = random_matrix(ZZ,n)
sage: %time c=a._multiply_linbox(b)
CPU times: user 5.28 s, sys: 22.8 ms, total: 5.3 s
Wall time: 5.37 s
sage: %time c=a._multiply_multi_modular(b)
CPU times: user 230 ms, sys: 6.08 ms, total: 236 ms
Wall time: 248 ms

Linbox should easily beat the multiply_multi_modular approach -- that's just basically a naive version of the same thing linbox would be doing if it were properly built and linked. Instead, Linbox is massively slower. In fact, almost exactly as much slower as I would expect if the wrong BLAS were being linked. Note that I tested against the new flint backend and now a._multiply_multi_modular(b) is asymptotically similar to flint, though asymptotically (for n=2000 already), they are pretty close to the same in speed (8.55s versus 11.4s). In any case, somebody should really look into why linbox is so screwed up -- we switched to it initially since it was much better than doing a._multiply_multi_modular(b) and also our own strassen implementation.

williamstein · 2014-08-29T16:15:20Z

comment:10

OK, I've been skeptically looking at and testing this patch much of today. I made a few tiny changes, but frankly I think this is overall AWESOME. FLINT's linear algebra seems quite good.

The following three methods are now still gone. Add them back so I can run some randomized consistency tests more easily...

     a._multiply_classical      a._multiply_linbox         a._multiply_multi_modular

When these are added back, and I run tests of correctness by comparing with them, then I'll give this an enthusiastic positive review.

sagetrac-git · 2014-09-01T08:34:33Z

Branch pushed to git repo; I updated commit sha1. New commits:

`7f4f903`	`Merge branch 'develop' into matflint`
`5f21e3d`	`Minor changes during review. Put back old multiplication functionality.`

sagetrac-git · 2014-09-01T08:34:33Z

Changed commit from 166bae5 to 5f21e3d

sagetrac-git · 2014-09-01T10:55:01Z

Changed commit from 5f21e3d to a13b79c

sagetrac-git · 2014-09-01T10:55:01Z

Branch pushed to git repo; I updated commit sha1. New commits:

`a13b79c`	`Fixed errors, all doctests pass now.`

mmasdeu · 2014-09-01T10:59:37Z

comment:15

I have added the three functions back, and run the long tests in sage/matrix/.

williamstein · 2014-09-01T11:13:39Z

comment:16

MAJOR PROBLEM! Now that multiply_multi_modular is back, I wrote code to do some random testing of correctness. I did not find situations where things are wrong... however, I quickly ran into major trouble because this ticket introduces a huge memory leak. I just defined two matrices and multiplied them a few hundred times, and had major ram problems. In particular, in sage-6.3 (without this branch):

sage: get_memory_usage()
1047.1875
sage: for i in range(50):
....:         a=random_matrix(ZZ,300); del a
....: 
sage: get_memory_usage()
1047.1875

With this ticket:

sage: get_memory_usage()
2132.09765625
sage: for i in range(50):
    a=random_matrix(ZZ,300); del a
....: 
sage: get_memory_usage()
2231.3203125

Conclusion: making nontrivial use of the code on this ticket will very quickly use up all RAM on your computer.

So... needs work until that is fixed.

mmasdeu · 2014-09-19T08:50:30Z

comment:77

Done.

vbraun · 2014-09-19T10:41:54Z

comment:79

Linear algebra should really give the right answer to a few ulp, and not just 1e-10. But the test is about testing that the division operator works, so ok.

vbraun · 2014-09-19T15:08:52Z

Changed branch from u/mmasdeu/16803-matflint-4 to 812a509

jdemeyer · 2014-10-02T14:00:51Z

Changed commit from 812a509 to none

jdemeyer · 2014-10-02T14:00:51Z

comment:81

Follow-ups: #17080 and #17090.

jdemeyer · 2014-10-02T18:09:36Z

comment:82

Did any of the reviewers (Volker?) actually read the code on this ticket? I've started doing this and there are many things to be improved... perhaps this ticket was merged too fast.

vbraun · 2014-10-02T18:17:31Z

comment:83

Is it perfect? No. But better than before? IMHO yes. Faster? For sure.

williamstein · 2014-10-02T18:18:27Z

comment:84

Replying to @jdemeyer:

Did any of the reviewers (Volker?) actually read the code on this ticket? I've started doing this and there are many things to be improved...

I certainly read the version of the code 5 weeks ago, and had many, many, many suggestions for improvement, found numerous major bugs, etc.

perhaps this ticket was merged too fast.

One has to balance that with the fact that for whatever reason, without this ticket a vast range of linear algebra in Sage is so utterly orders of magnitude too slow, as to make it painful and useless for research work. This new code is dramatically faster than before.

jdemeyer · 2014-10-02T18:20:36Z

comment:85

Replying to @williamstein:

This new code is dramatically faster than before.

I only hope we're not trading correctness for speed.

williamstein · 2014-10-02T18:25:37Z

comment:86

Replying to @jdemeyer:

Replying to @williamstein:

This new code is dramatically faster than before.

I only hope we're not trading correctness for speed.

Well when I looked over the code 5 weeks ago there are numerous memory leaks and other causes for concern (e.g., unitialized memory assumed to be 0). But those were I think all addressed. Did you find anything that is actually incorrect? Or does the overall code just make you unhappy and nervous? Or are you not happy building on FLINT?

williamstein · 2014-10-02T18:36:29Z

comment:87

I just have to add jdemeyer that I'm really glad you're looking at this too, since you have an amazing eye for quality (similar to your recent remarks about the pexpect interfaces).

jdemeyer · 2014-10-02T20:29:51Z

comment:88

Replying to @williamstein:

Did you find anything that is actually incorrect?

The segmentation fault in _lmul_, which is the reason for creating #17090.

Or does the overall code just make you unhappy and nervous?

When looking at this code, there are some sloppy things (like why is the _det_pari() function duplicated?). Overall, I have less faith in sloppy code than clear code.

Or are you not happy building on FLINT?

That's not the problem. In general, I think it's good to rely on external packages if they do a good job (which seems to be the case here).

jdemeyer · 2014-10-02T21:30:51Z

comment:90

What's with the changes in padics in this ticket? Surely, yet another mistake...

jdemeyer · 2014-10-03T09:03:59Z

comment:91

Yet another follow-up: #17094

mmasdeu · 2014-10-03T09:46:38Z

comment:92

I must say that IMO some of the comments (82 and 90) made above are not very constructive. jdemeyer questions the thoroughness of the reviewing process, while clearly he himself didn't read the comments at the beginning of the ticket (at least those that refer to the convenience of leaving in the legacy functions).

After the experience with this ticket I feel less inclined to take on the analogous job for matrix_rational_dense. Sorry!

jdemeyer · 2014-10-03T10:36:09Z

comment:93

Replying to @mmasdeu:

I must say that IMO some of the comments (82 and 90) made above are not very constructive.

[comment:82] was made out of frustration by seeing so many things here which are dubious. So okay, that's not constructive.

However in [comment:90], I noticed a mistake, mentioned it in the comment and fixed it in #17090. How is that not constructive?

while clearly he himself didn't read the comments at the beginning of the ticket (at least those that refer to the convenience of leaving in the legacy functions).

That's true, but after William Stein's comment, I have fixed that back in #17090.

I feel less inclined to take on the analogous job for matrix_rational_dense.

Please don't. I think it's absolutely good to improve matrices like you did here. The fact that I am making so many comments and creating follow-up tickets means that I care. I make mistakes, you make mistakes, that's normal and that's why have this review process.

My only complaint (and I still stand behind this) is that this ticket was given positive_review too soon. Ideally, those 3 follow-up tickets should have been reviewer patches on this ticket.

mmasdeu · 2014-10-03T11:10:42Z

comment:94

Replying to @jdemeyer:

[comment:82] was made out of frustration by seeing so many things here which are dubious. So okay, that's not constructive.

I can understand your frustration. But the bar is high enough (python + cython + sage oddities + legacy code + git + trac, and I am probably missing something here) that makes it very hard for people to contribute, and these comments do not help. But I take your last comment in good faith :-).

Also, if there are other things that are dubious please share them because I (and possibly others!) would like to learn from it.

However in [comment:90], I noticed a mistake, mentioned it in the comment and fixed it in #17090. How is that not constructive?

Surely I did not go and touch p-adics for no reason. These changes weren't there in the first version of the my patch, and I had to add them because during the time it took to from the first version until this ticket got closed other changes were being merged, and these made the patch to stop compiling. Changing those lines in p-adics is how I managed to fix it, but the hackish idea was not mine, I saw it used elsewhere. I appreciate you having fixed it back.

jdemeyer · 2014-10-03T11:29:58Z

comment:95

Replying to @mmasdeu:

Surely I did not go and touch p-adics for no reason. These changes weren't there in the first version of the my patch, and I had to add them because during the time it took to from the first version until this ticket got closed other changes were being merged, and these made the patch to stop compiling.

I think you simply did something compiling Sage, because those changes to p-adics are totally unrelated to matrices. I cannot imagine how the change to matrices could break building p-adics or vice versa. And indeed, I simply undid the changes in #17090 without any bad consequences.

williamstein · 2014-10-03T15:26:00Z

comment:96

Replying to @mmasdeu:

Replying to @jdemeyer:

[comment:82] was made out of frustration by seeing so many things here which are dubious. So okay, that's not constructive.

I can understand your frustration. But the bar is high enough (python + cython + sage oddities + legacy code + git + trac, and I am probably missing something here) that makes it very hard for people to contribute, and these comments do not help. But I take your last comment in good faith :-).

Marc -- don't you want to be a bad ass computer programmer, in addition to mathematician? Just redo matrix_rational_dense, and you'll be way closer :-)

jdemeyer · 2014-10-03T19:23:34Z

comment:97

Replying to @mmasdeu:

Also, if there are other things that are dubious please share them because I (and possibly others!) would like to learn from it.

Well, you can look at the follow-up tickets #17090 and #17094 and see what I changed (and preferably review those tickets).

Some general things which haven't been mentioned yet:

clean-up your code when you change things: when you remove code, try to remove everything related to the old code. When you change code, the old structure might not be applicable anymore, so refactor if needed (e.g. the sepatate branch for n <= 4 in determinant() no longer serves any purpose)
also fix documentation: when you change something, also change the corresponding documentation. When you copy/paste something, be sure to update documentation and doctests (see the various set_unsafe methods).
don't add unneeded import statements or includes, those add needless extra dependencies. Like, why did you add include "sage/libs/ntl/decl.pxi" in several places when you don't use any NTL code?
try to use correct formatting: spacing, indentation, docstring formatting (this was mostly good, but a bit messed up in src/module_list.py). If anything, this makes your code look more professional and less sloppy.
don't use assert for checking user input: use assert for when you know that something is true (but want to check just in case), not when you need to check that something is true.

mmasdeu · 2014-10-03T20:00:29Z

comment:98

Replying to @jdemeyer:

Thanks! These comments are helpful indeed.

William -- I rather try to be a better mathematician...it has the advantage that although you get constantly evaluated, the evaluations are mostly anonymous :-). But that was in my to-do list. And it still is, despite my previous comments in this ticket!

mmasdeu added this to the sage-6.4 milestone Aug 12, 2014

mmasdeu added c: linear algebra labels Aug 12, 2014

williamstein added s: needs review and removed s: needs review labels Aug 29, 2014

mmasdeu added s: needs review and removed s: needs work labels Sep 1, 2014

mmasdeu added s: needs review and removed s: needs work labels Sep 19, 2014

vbraun added s: positive review and removed s: needs review labels Sep 19, 2014

vbraun removed the s: positive review label Sep 19, 2014

vbraun closed this as completed in 1f70c18 Sep 19, 2014

This was referenced Oct 2, 2014

Bad compile flags for sage/matrix/matrix_modn_dense_double.pyx #17080

Closed

Reviewer patch for #16803 #17090

Closed

Fix memleaks and brokenness of non-FLINT methods after #16803 #17094

Closed

Reimplement matrix_integer_dense using FLINT #16803

Reimplement matrix_integer_dense using FLINT #16803

Comments

mmasdeu commented Aug 12, 2014

mmasdeu commented Aug 12, 2014

mmasdeu commented Aug 12, 2014

mmasdeu commented Aug 12, 2014

fredrik-johansson commented Aug 12, 2014

sagetrac-git mannequin commented Aug 15, 2014

sagetrac-git mannequin commented Aug 15, 2014

mmasdeu commented Aug 15, 2014

sagetrac-git mannequin commented Aug 19, 2014

sagetrac-git mannequin commented Aug 19, 2014

sagetrac-git mannequin commented Aug 21, 2014

sagetrac-git mannequin commented Aug 21, 2014

sagetrac-git mannequin commented Aug 22, 2014

sagetrac-git mannequin commented Aug 22, 2014

williamstein commented Aug 29, 2014

williamstein commented Aug 29, 2014

sagetrac-git mannequin commented Sep 1, 2014

sagetrac-git mannequin commented Sep 1, 2014

sagetrac-git mannequin commented Sep 1, 2014

sagetrac-git mannequin commented Sep 1, 2014

mmasdeu commented Sep 1, 2014

williamstein commented Sep 1, 2014

mmasdeu commented Sep 19, 2014

vbraun commented Sep 19, 2014

vbraun commented Sep 19, 2014

jdemeyer commented Oct 2, 2014

jdemeyer commented Oct 2, 2014

jdemeyer commented Oct 2, 2014

vbraun commented Oct 2, 2014

williamstein commented Oct 2, 2014

jdemeyer commented Oct 2, 2014

williamstein commented Oct 2, 2014

williamstein commented Oct 2, 2014

jdemeyer commented Oct 2, 2014

jdemeyer commented Oct 2, 2014

jdemeyer commented Oct 3, 2014

mmasdeu commented Oct 3, 2014

jdemeyer commented Oct 3, 2014

mmasdeu commented Oct 3, 2014

jdemeyer commented Oct 3, 2014

williamstein commented Oct 3, 2014

jdemeyer commented Oct 3, 2014

mmasdeu commented Oct 3, 2014