Add some bugfixes to the PARI package #10430

jdemeyer · 2010-12-04T23:21:09Z

We should add bugfixes for

http://pari.math.u-bordeaux.fr/cgi-bin/bugreport.cgi?bug=1132 (see Bug in factor of polynomials over number fields #10279)
http://pari.math.u-bordeaux.fr/cgi-bin/bugreport.cgi?bug=1144 (see Add interface to PARI's rnfisnorm() #2329)
http://pari.math.u-bordeaux.fr/cgi-bin/bugreport.cgi?bug=1143 (see Add interface to PARI's rnfisnorm() #2329)
http://pari.math.u-bordeaux.fr/cgi-bin/bugreport.cgi?bug=1084 (see conflicting branch cut conventions #9620)
http://pari.math.u-bordeaux.fr/cgi-bin/bugreport.cgi?bug=1141 (see Yet another bug in factorization over number fields #10369)
path to perl hardcoded in gphelp (GP/PARI) #10559: path to perl hardcoded in gphelp (GP/PARI)

New spkg: http://sage.math.washington.edu/home/jdemeyer/spkg/pari-2.4.3.alpha.p5.spkg

CC: @sagetrac-drkirkby

Component: packages: standard

Keywords: pari spkg bugs patches

Author: Jeroen Demeyer

Reviewer: Leif Leonhardy, Volker Braun

Merged: sage-4.6.2.alpha2

Issue created by migration from https://trac.sagemath.org/ticket/10430

nexttime · 2010-12-05T04:54:56Z

comment:2

Perhaps we should really also address #10120, as more systems than originally reported seem to be affected, i.e. reduce (perhaps partially) optimization to -O1 to work around obvious bugs in GCC 4.4.1 on these platforms.

Did someone report this to the PARI guys? Perhaps they could provide a patch such that we don't have to maintain it (that selectively changes the compiler flags for only some files).

Unfortunately(?), not all people building on e.g. openSUSE 11.2 run into these problems, apparently.

jdemeyer · 2010-12-05T09:09:01Z

comment:3

Replying to @nexttime:

Perhaps we should really also address #10120, as more systems than originally reported seem to be affected, i.e. reduce (perhaps partially) optimization to -O1 to work around obvious bugs in GCC 4.4.1 on these platforms.

Here's an idea: we first try to build with -O3 and when that doesn't work, fall back to -O2, then -O1, then -O0.

This way we don't have to find out exactly which versions of gcc are broken.

I think reporting this to PARI is pointless, because they can't help (and probably won't care about) a broken gcc.

nexttime · 2010-12-05T14:28:14Z

comment:4

Replying to @jdemeyer:

Replying to @nexttime:

Perhaps we should really also address #10120, as more systems than originally reported seem to be affected, i.e. reduce (perhaps partially) optimization to -O1 to work around obvious bugs in GCC 4.4.1 on these platforms.

Here's an idea: we first try to build with -O3 and when that doesn't work, fall back to -O2, then -O1, then -O0.

Koen reported:
"For reference: OpenSuse? 11.2 (gcc (SUSE Linux) 4.4.1 [gcc-4_4-branch revision 150839]) has the same problem when building PARI: on a machine with 64GB of RAM, it eventually fails after all memory is exhausted (takes hours). [...]"

So I don't think that's the way to go. (Other machines might start swapping, which effectively "freezes" some systems.)

Or should we do something like

    (ulimit -St 900; $MAKE) # Which value is appropriate?

?

I think reporting this to PARI is pointless, because they can't help (and probably won't care about) a broken gcc.

They at least perhaps have better experience which files are most likely to trigger failures due to GCC bugs.

jdemeyer · 2010-12-05T14:32:05Z

comment:5

Replying to @nexttime:
Or should we do something like

    (ulimit -St 900; $MAKE) # Which value is appropriate?

?

How about ulimiting the memory?

nexttime · 2010-12-05T14:49:19Z

comment:6

Replying to @jdemeyer:

Replying to @nexttime:
Or should we do something like

    (ulimit -St 900; $MAKE) # Which value is appropriate?

?

How about ulimiting the memory?

Much harder to estimate, isn't it? (Feel free to test out adequate values, with -O3 etc.; perhaps something Dave likes...)

Ok, if a process starts thrashing, it won't consume much (user) CPU time as well.

nexttime · 2010-12-05T14:49:19Z

Changed keywords from pari spkg to pari spkg bugs patches

sagetrac-drkirkby · 2010-12-05T15:01:47Z

comment:7

I don't think we should be changing ulimit. Sage used to unset it at one point, and that was changed in a trac ticket.

Changing it could cause all sorts of problems for someone. If Sage fails with the limit they set, then tough - they set the limit.

Once we start changing limits, we could cause other proceses to fail, which might be more important to someone.

Dave

jdemeyer · 2010-12-05T15:07:15Z

comment:8

David: we could check the current value of ulimit -v to make sure we are only decreasing the value, not increasing.

I quickly tested ulimit -v on a few systems, this is what I found for the minimal power of 2 for ulimit -v to have a successful build of the pari spkg:

Gentoo Linux, kernel 2.6.32, x86_64, gcc 4.6.0: 128 MB
Ubuntu Linux 8.04.4 LTS, kernel 2.6.24, x86_64, gcc 4.5.1: 128 MB
Mac OS X 10.4 PPC, gcc 4.0.1: ulimit -v doesn't seem to work

nexttime · 2010-12-05T16:04:09Z

comment:9

Replying to @sagetrac-drkirkby:

I don't think we should be changing ulimit. Sage used to unset it at one point, and that was changed in a trac ticket.

Changing it could cause all sorts of problems for someone. If Sage fails with the limit they set, then tough - they set the limit.

Once we start changing limits, we could cause other proceses to fail, which might be more important to someone.

We would only set limits in (PARI's) spkg-install.

Note that ulimit only affects the current process and its subprocesses (i.e. gets inherited), therefore I also used the parentheses in the example above.

ulimit is (also) a bash built-in btw. We could also limit its use to Linux.

And ordinary users (i.e., their processes) cannot increase limits once they are set.

nexttime · 2010-12-05T16:22:02Z

comment:10

P.S.:

If we do "trial building" with some limit(s), we should also make sure that the build actually failed due to a resource limit before retrying with less optimization, e.g. check that the exit code was 152 (SIGXCPU + 128) if we use a CPU time limit.

nexttime · 2010-12-05T16:31:08Z

comment:11

Replying to @nexttime:

P.S.:

If we do "trial building" with some limit(s), we should also make sure that the build actually failed due to a resource limit before retrying with less optimization, e.g. check that the exit code was 152 (SIGXCPU + 128) if we use a CPU time limit.

With ulimit -v I receive SIGKILL on exhausted memory, which isn't very specific...

jdemeyer · 2010-12-05T16:41:00Z

comment:12

Replying to @nexttime:

P.S.:

If we do "trial building" with some limit(s), we should also make sure that the build actually failed due to a resource limit before retrying with less optimization

The build could fail for many various reasons, including but not limited to allocating too much memory. There are various other tickets where a PARI build fails because of a broken gcc. All these should be caught, not only the cases where we run out of memory.

nexttime · 2010-12-05T17:18:52Z

comment:13

Replying to @jdemeyer:

Replying to @nexttime:

P.S.:

If we do "trial building" with some limit(s), we should also make sure that the build actually failed due to a resource limit before retrying with less optimization

The build could fail for many various reasons, including but not limited to allocating too much memory. There are various other tickets where a PARI build fails because of a broken gcc. All these should be caught, not only the cases where we run out of memory.

Of course.

I wonder if we then would get PARI build errors due to GCC bugs reported any longer... ;-)

nexttime · 2010-12-06T06:38:11Z

comment:14

Got one more PARI flaw:

It installs three real copies of the shared library rather than one with two symbolic links to it.

Currently not sure if (but I believe) that's an upstream matter, or if we do that.

jdemeyer · 2010-12-09T21:49:23Z

Author: Jeroen Demeyer

jdemeyer · 2010-12-10T08:57:46Z

comment:16

Very preliminary spkg: http://sage.math.washington.edu/home/jdemeyer/spkg/pari-2.4.3.alpha.p1.spkg (not yet tested properly)

jdemeyer · 2010-12-10T09:58:54Z

Doctest fixes

jdemeyer · 2010-12-10T12:45:33Z

Attachment: 10430_branch_cut.patch.gz

spkg patch for reference

jdemeyer · 2010-12-10T13:54:10Z

comment:17

Attachment: pari-2.4.3.alpha.p1.diff.gz

jdemeyer · 2010-12-27T11:53:15Z

comment:49

Replying to @sagetrac-drkirkby:

One would need to ascertain if this is a gcc bug or a Pari bug. Badly written code will cause more aggressive optimisations to fail. It does not necessary mean it is a compiler bug.

Badly written code should never cause the compiler to crash or to use infinite memory. These things are certainly compiler bugs.

jdemeyer · 2011-01-12T00:54:57Z

comment:52

I removed all the trial building code from the spkg in light of #10572. Also added patches for #10559. New spkg needs review.

vbraun · 2011-01-12T01:51:05Z

comment:53

I'm happy with the current version, so I'll give this ticket a positive review. If any compiler bugs are still preventing pari from being built on some hardware then this should be reported to the gcc wrapper.

vbraun · 2011-01-12T01:51:05Z

Changed reviewer from Leif Leonhardy to Leif Leonhardy, Volker Braun

jdemeyer · 2011-01-19T22:23:17Z

Merged: sage-4.6.2.alpha1

jdemeyer · 2011-01-20T09:08:49Z

comment:55

Testing on the buildbot seems to indicate there might still be some race conditions in parallel make install. So maybe we should avoid doing that.

jdemeyer · 2011-01-20T09:08:49Z

Changed merged from sage-4.6.2.alpha1 to none

jdemeyer · 2011-01-23T14:25:06Z

comment:56

Fixed race conditions in make install by using -j1.

vbraun · 2011-01-23T19:12:45Z

comment:57

That'll get rid of potential races in installation. Perhaps we should disable parallel make for all spkgs that don't use a proven build system like autotools or SCons. Chances are that any hand-rolled makefile has concurrency issues...

I'll take it that you are going to commit the changes to the included repository before adding the spkg to the next Sage release, because right now they are not.

jdemeyer · 2011-01-23T19:17:21Z

comment:58

Replying to @vbraun:

I'll take it that you are going to commit the changes to the included repository before adding the spkg to the next Sage release, because right now they are not.

Yes, done.

jdemeyer · 2011-01-25T08:15:49Z

Merged: sage-4.6.2.alpha2

jdemeyer added this to the sage-4.6.2 milestone Dec 4, 2010

jdemeyer added c: packages: standard labels Dec 4, 2010