HANG in parallel test of examples/chemotaxis/input2D.py on some configurations #264
Comments
|
guyer attached bt1 on 09-14-2011 at 09:23 |
|
guyer attached bt0 on 09-14-2011 at 09:23 |
|
See issue #275 as it happened on bunter, but is no longer occurring on bunter. Trac comment by wd15 on 11-16-2011 at 17:04 |
|
Trac comment by guyer on 12-05-2011 at 14:57 |
|
dogbert and bunter both experience freezes (issue #275), but buildbot shows that [http://build.cmi.kent.edu:8010/builders/Ubuntu-trunk-full/builds/0/steps/trial_2/logs/stdio zizou] and [http://build.cmi.kent.edu:8010/builders/OS%20X-trunk-full/builds/0/steps/trial_2/logs/stdio paco] don't, even though full test suite now being run instead of split into Trac comment by guyer on 12-05-2011 at 15:03 |
|
stripped examples/tests.py@5004 down to def _suite():
return _LateImportTestSuite(testModuleNames = (
# 'diffusion.test',
'chemotaxis.test',
# 'phase.test',
# 'convection.test',
# 'elphf.test',
# 'levelSet.test',
# 'cahnHilliard.test',
# 'flow.test',
# 'meshing.test',
# 'reactiveWetting.test',
# 'riemann.test'
), base = __name__)and fipy/tests.py@5004 down to def _suite():
return _LateImportTestSuite(testModuleNames = (
# 'solvers.test',
# 'models.test',
# 'terms.test',
# 'tools.test',
# 'matrices.test',
# 'meshes.test',
'variables.test',
# 'viewers.test',
# 'boundaryConditions.test',
), base = __name__)and still get the freeze in chemotaxis, whereas [Wheeler's results on bunter](issue #275) seemed more ambigous. Trac comment by guyer on 12-05-2011 at 15:07 |
|
Reduced trunk/fipy/variables/test.py@5004 to def _suite():
return _LateImportDocTestSuite(
docTestModuleNames = (
# 'fipy.variables.variable',
# 'fipy.variables.meshVariable',
'fipy.variables.cellVariable',
# 'fipy.variables.faceVariable',
# 'fipy.variables.operatorVariable',
# 'fipy.variables.betaNoiseVariable',
# 'fipy.variables.exponentialNoiseVariable',
'fipy.variables.gammaNoiseVariable',
# 'fipy.variables.gaussianNoiseVariable',
# 'fipy.variables.uniformNoiseVariable',
# 'fipy.variables.cellVolumeAverageVariable',
# 'fipy.variables.modularVariable',
# 'fipy.variables.binaryOperatorVariable',
# 'fipy.variables.coupledCellVariable',
# 'fipy.variables.cellToFaceVariable',
# 'fipy.variables.faceGradVariable',
# 'fipy.variables.gaussCellGradVariable',
# 'fipy.variables.faceGradContributionsVariable'
))and still get the freeze. Through process of elimination, found that commenting out trunk/fipy/variables/cellVariable.py@5004#L198 was enough to remove the freeze. Restoring all other tests in trunk/fipy/variables/test.py@5004 did not resurrect the freeze, but restoring all tests in trunk/fipy/test.py@5004 did bring the freeze back. Trac comment by guyer on 12-05-2011 at 19:38 |
|
Reducing trunk/fipy/test.py#5004 to def _suite():
return _LateImportTestSuite(testModuleNames = (
# 'solvers.test',
# 'models.test',
# 'terms.test',
# 'tools.test',
'matrices.test',
'meshes.test',
# 'variables.test',
# 'viewers.test',
# 'boundaryConditions.test',
), base = __name__)produces the freeze, but commenting out either Trac comment by guyer on 12-06-2011 at 09:53 |
|
Commenting out just about any one doctest line in trunk/fipy/matrices/trilinosMatrix.py@5004, e.g., trunk/fipy/matrices/trilinosMatrix.py@5004#L193 or trunk/fipy/matrices/trilinosMatrix.py@5004#L194, but not both, trunk/fipy/matrices/trilinosMatrix.py@5004#L229, trunk/fipy/matrices/trilinosMatrix.py@5004#L231, etc. Even trunk/fipy/matrices/trilinosMatrix.py@5004#L918. Sometimes, commenting out an Trac comment by guyer on 12-06-2011 at 10:54 |
|
This also hangs on bunter (issue #275), but apparently only with Trac comment by guyer on 12-08-2011 at 09:43 |
|
deadlock occurs on both dogbert (Mac OS X Snow Leopard) and bunter (Debian squeeze), but not on the buildbot slaves paco (Mac OS X Snow Leopard) and zizou (Ubuntu something). Is this because buildbot slaves are different (stderr and stdout are captured, so different buffering?) or because there's something different about those installations? Trac comment by guyer on 12-08-2011 at 09:59 |
|
Removing from milestone 3.0 This hang is too idiosyncratic and Wheeler and I have both spent too much time trying to isolate and debug it. Trac comment by guyer on 12-09-2011 at 12:45 |
|
Redundant with issue #305 Trac comment by guyer on 01-31-2012 at 13:47 |
mpirun -np 2 python setup.py testhangs indefinitely atDoctest: examples.chemotaxis.input2D ....mpirun -np 2 python examples/chemotaxis/input2D.pyruns successfully.mpirun -np 2 python examples/chemotaxis/test.pyruns successfully.Doctest: examples.chemotaxis.inputsucceeds, but ifexamples/chemotaxis/input2Dis removed from the test suite, thenDoctest: examples.chemotaxis.inputhangs.examples/chemotaxisis removed from the test suite, all other tests run to completion.print >>sys.stderr, iafter trunk/examples/chemotaxis/input.py@4131#L44, after the solve loop, causesDoctest: examples.chemotaxis.inputto hang after 60 to 70 steps. Ditto forinput2D.py.print >>sys.stderr, iat trunk/examples/chemotaxis/input.py@4131#L44, within the solve loop, causesDoctest: examples.chemotaxis.inputto run to completion. Ditto forinput2D.py.Both processes are hung in
Epetra.Mapbut the backtraces https://raw.githubusercontent.com/wd15/fipy-attachments/master/raw-attachment/ticket/360/bt0 and https://raw.githubusercontent.com/wd15/fipy-attachments/master/raw-attachment/ticket/360/bt1 show that they diverge in
Epetra_BlockMap::Epetra_BlockMap(line 264 vs 252) and seem to hang in a race betweenEpetra_MpiComm::GatherAll/MPI_AllgatherandEpetra_MpiComm::MaxAll/MPI_Allreduce.Why does this only happen for chemotaxis?
h3. Configuration
Mac OS X Snow Leopard 10.6.8
I have seen this with a variety of builds, but current installation was done with wiki:InstallFiPy/MacOSX/HomeBrew and wiki:InstallFiPy/PipInstallsPython
Imported from trac ticket #360, created by guyer on 09-14-2011 at 09:22, last modified: 01-31-2012 at 13:47
The text was updated successfully, but these errors were encountered: