New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Errno 16 / NFS problems with parallel/decorate.py #9616
Comments
comment:1
For what they're worth, tests on sage.math with variations on #!/bin/bash
# This does not keep overall statistics:
# env SAGE_TEST_GLOBAL_ITER=100 ./sage -tp 1 -long /path/to/file.py
SAGE_TEST="./sage -t -long"
#SAGE_TEST="env DOT_SAGE=/dev/shm/$USER/.sage $SAGE_TEST"
#SAGE_TEST="env DOT_SAGE=/scratch/$USER/.sage $SAGE_TEST"
RUNS=100
for I in `seq 1 $RUNS`;
do
$SAGE_TEST devel/sage/sage/parallel/decorate.py
CODE[$I]=$?
echo "Results after $I of $RUNS runs:"
echo "${CODE[*]}" | tr ' ' '\n' | sort -n | uniq -c
done end with
for the default DOT_SAGE, |
comment:2
For now, should we tag the relevant tests with |
comment:3
By the way, here are the latest doctesting exist codes (cf. #9243), from the top of
|
comment:4
According to William on sage-release, the segfault is an intentional part of a doctest, so I've changed the ticket's title. |
Changed keywords from fork nfs segfault to fork nfs device resource busy |
comment:5
Replying to @qed777:
If we backout the whole patch, I have more confidence that the doctests will get fixed quickly. |
Attachment: trac_9616-backout_9501_fork_deco.patch.gz Backout #9501 |
Author: Mitesh Patel |
comment:6
Replying to @jhpalmieri:
Adapting the procedure in this comment at #9583, I've attached a patch that undoes (or should undo) all of #9501. If the patch gets a positive review, we can open a new ticket for re-merging #9501. |
comment:7
Hmmm, of course a simple procedure, but we'd back out too much in my opinion... But I can live with that. (And I'm currently too (laz|bus)y to sort out the desirable parts of the original patch.) |
Attachment: trac_9616-backout_only_some_of_9501.patch.gz Backouts only ticket-relevant parts of #9501 (subset of Mitesh's patch) |
comment:8
Replying to @nexttime:
Couldn't resist though (simpler as expected). Not very tested, only successfully ran So now two concurrent patches to review... ;-) |
comment:9
I've tested mpatel's patch on 5 machines: 4 on which the problem originally occurred (sage.math and skynet machines eno, iras, and taurus) and one machine (running OS X) which didn't have the original problem. After applying the patch, all tests pass for the directory "parallel" on all 5 machines. Long doctests for the whole Sage library pass on sage.math and taurus except for previously known, unrelated, failures. I don't know if I'll get to leif's patch. Since this is a rollback to a previous situation, I think this is good enough for a positive review for mpatel's patch, though. Opinions? |
comment:10
Let the release managers decide... ;-) |
Reviewer: John Palmieri |
comment:11
Replying to @jhpalmieri:
You've done some good testing, and since the original patch was an enhancement, and didn't fix any bugs or failing doctests (right?), I think a positive review is warranted here. |
comment:12
Replying to @dandrake:
Well, I pushed back in mostly fixes (and improvements) to the documentation (one might consider bugfixes, too). |
comment:13
Besides the above mentioned, this would completely miss, too:
|
comment:14
Ooops, forget my last comment: The reset is performed just unconditionally in Mitesh's version. |
comment:15
A partial backout, since it retains only some of the changes from #9501, needs a new review, which currently, at least, we don't have. Given the need to press forward with the 4.5.2 release cycle, I'm merging attachment: trac_9616-backout_9501_fork_deco.patch into 4.5.2.rc0. This may not be an ideal resolution, but it seems reasonable given the circumstances. Absolutely no offense is intended. I've opened #9631 for re-merging #9501 after we fix the NFS/doctest problem. |
Merged: sage-4.5.2.rc0 |
In 4.5.2.alpha1, we have for many people:
and so on. See https://groups.google.com/group/sage-release/msg/88b030aa31926459 and that thread.
This seems related to #9501.
CC: @nexttime @jhpalmieri @kcrisman @malb @sagetrac-mvngu @simon-king-jena @williamstein
Component: doctest coverage
Keywords: fork nfs device resource busy
Author: Mitesh Patel
Reviewer: John Palmieri
Merged: sage-4.5.2.rc0
Issue created by migration from https://trac.sagemath.org/ticket/9616
The text was updated successfully, but these errors were encountered: