New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Random failure in gf2x testsuite #18882
Comments
comment:1
What does "retrying" refer to? Does it pass if you just rerun the test suite (as opposed to rebuilding and retuning in advance)? |
comment:2
rebuild; this particular example was on the buildbot. |
comment:3
This still occurs. It doesn't happen every time, but it happens frequently with OS X, especially on a loaded system, and also sometimes on a less stressed system. |
comment:4
Seems to be happening every time for me when using #12426. Works all right on linux+clang, just OS X seems to be affected here. I wonder if it is hardware configuration related. |
comment:5
OK so with clang I can now reproduce the issue at will on linux (contrary to my last post). After exploring building and testing manually I finally discovered the tuning section of
When
|
comment:6
@ Paul does the testsuite failure above , which is linked to the use of tune-toom, ring any bells? I see some "recent" commit about toom on the git server of inria. |
comment:7
git master for gf2x doesn't seem to exhibit the problem. tune-toom is much faster but I haven't completed testing with tune-fft yet. That tuning is long. |
comment:8
OK git master passes its testsuite in all configurations - including the ones where it is failing in 1.1. |
comment:9
Replying to @kiwifb:
maybe we should make a new release, since we have fixed several bugs in 1.1. |
comment:10
Replying to @zimmermann6:
I can reproduce all that tomorrow and send it to you privately I guess. On my system I can generate the failure with clang (but the failure is not specific to clang since this was spotted before anyone was toying with it) but things work with gcc so I could even give you a good set of toom tuning and a bad one. |
comment:11
my guess is that with clang the basic routines chosen by |
comment:12
Paul found where the error is in the current code and provided the fix. I'll provide a patch against 1.1 shortly. |
Upstream: Fixed upstream, but not in a stable release. |
Commit: |
Author: François Bissey |
Branch: u/fbissey/gf2x_test |
comment:13
Ready for review and I am making this a blocker since it means by default some install are getting a bug - even if we haven't seen a consequence so far apart from that test failure (note that on system where this failure is apparent, the ntl testsuite will also fail). New commits:
|
Reviewer: Volker Braun |
comment:15
On http://build.sagemath.org/#/builders/23/builds/1/steps/5/logs/gf2x
|
comment:20
Replying to @vbraun:
Sure. It would be nice to have them a little bit longer though.
If you still get some of these without the ticket, it is indeed probably a separate issue. But it would have been nice to at least have the result of the low-level tuning |
comment:21
Actually it is impossible to know if this is related to the original issue without knowing the results of the low level tuning. |
comment:22
Incremental build after removing this ticket fails: http://build.sagemath.org/#/builders/66/builds/2 Full rebuild (make distclean) on the same commit ef05f46ddbc6c94cd1e911b0090efdfe8d7ec02f succeeds: http://build.sagemath.org/#/builders/30/builds/1 I saw this on at least two buildslaves; Could be a timing / cpu load issue but suspicious nevertheless. |
comment:23
Nothing much in those linked logs. They look like cmake upgrade logs not gf2x bug fix logs. |
comment:24
There are a number of tickets merged in the branch but it has nothing to do with cmake specifically (which is optional and unused by default). Click on "view all" to see the start of the git log... |
comment:25
OK, not sure what happened the first time I looked at the link
The failure has to be unrelated. The fix I committed was for |
Changed branch from u/fbissey/gf2x_test to |
comment:28
I confirm the issue of comment [comment:15] is unrelated. We are working on the new release. In the meantime I cannot understand how the failure can happen. Apparently |
Changed commit from |
comment:29
From what I can see a lot of people get the new failure. It seems like people who would have gotten However my own tuning with gcc also uses |
comment:30
I believe the new failure is independent from the low-level tuning. However as said in comment [comment:28] I don't understand how it can happen. On a machine where it happens, please can you print the value of |
comment:31
@ Travis: Can you provide the info requested by Paul Zimmermann in the previous comment? Like I said I don't experience it, so I cannot provide that info. |
comment:32
I see this error on Fedora Linux machine, I'll try to see what I can supply. |
comment:33
I've added
and so it did
I don't know what goes wrong---more things to print? |
comment:34
I get similar errors on a |
comment:35
the fix is to replace in
by
which better corresponds to the accompagnying comment. |
comment:36
Hum...doing |
gf2x-1.1.p2.log |
comment:37
Attachment: gf2x-1.1.p2.log I am trying to create the patch:
And as you see from the log, it is getting applied (and I inspect the source after the package building ends in error, I see it applied).
Please advise - it looks as if some weirdness is going on I don't get. |
comment:38
did you run |
comment:39
Replying to @zimmermann6:
|
comment:40
I even tried removing the build directory manually and doing |
comment:41
I am lost here. Could it be that you have a |
comment:42
it was This is something I would call a bug in Anyhow, all good now. |
comment:43
the branch with the fix will be posted on #23225, as this ticket in closed. |
Randomly (rarely), gf2x fails its tests:
Retrying ends successfully:
This is on Linux 64-bit
Upstream: Fixed upstream, but not in a stable release.
CC: @zimmermann6 @tscrim
Component: packages: standard
Author: François Bissey
Branch:
a64f2a5
Reviewer: Volker Braun
Issue created by migration from https://trac.sagemath.org/ticket/18882
The text was updated successfully, but these errors were encountered: