Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pytrlinos from conda-forge crashes tests with MKL ERROR #622

Open
1 of 2 tasks
guyer opened this issue Mar 5, 2019 · 3 comments
Open
1 of 2 tasks

pytrlinos from conda-forge crashes tests with MKL ERROR #622

guyer opened this issue Mar 5, 2019 · 3 comments
Labels
Milestone

Comments

@guyer
Copy link
Member

guyer commented Mar 5, 2019

Hard failure on CircleCI

Doctest: examples.diffusion.steadyState.otherMeshes.grid3Dinput ... 
Intel MKL ERROR: Parameter 8 was incorrect on entry to DORGQR.
Error in dorgqr on 0 row (dims are 150, 1)

Error in CoarsenMIS: dorgqr returned a non-zero
napplication called MPI_Abort(MPI_COMM_WORLD, 1) - process 0
[unset]: write_line error; fd=-1 buf=:cmd=abort exitcode=1
:
system msg for write_line failure : Bad file descriptor
Exited with code 1

Issue appears to be mesh size:

  • 10x5x3 crashes
  • 8x5x3 runs
  • 9x5x3 crashes
  • 8x6x3 crashes

Much larger 2D meshes have no problem.

Partial list of installed conda packages:

mkl                pkgs/main/linux-64::mkl-2019.1-144
mkl_fft            conda-forge/linux-64::mkl_fft-1.0.10-py27h14c3975_1
mkl_random         conda-forge/linux-64::mkl_random-1.0.2-py27h637b7d7_2
pytrilinos         conda-forge/linux-64::pytrilinos-12.10.1-py27h7127e17_2
  • Temporary solution is to downsize the mesh to allow the tests to run.
  • Longer term solution is to file a MCVE with conda-forge/pytrilinos-feedstock
@guyer guyer added the solvers label Mar 5, 2019
@guyer
Copy link
Member Author

guyer commented Mar 6, 2019

No crash on MacOS

mkl                       2019.0                      118  
pytrilinos                12.10.1          py27h7127e17_2    conda-forge

@guyer
Copy link
Member Author

guyer commented Mar 6, 2019

examples/cahnHilliard/mesh3D.py is tested on a much bigger mesh (10x10x10) and does not crash

guyer added a commit to guyer/fipy that referenced this issue Mar 6, 2019
@guyer
Copy link
Member Author

guyer commented Mar 8, 2019

Similar crash in parallel, but for a different test:

Doctest: examples.diffusion.electrostatics ... electrostatics (examples.diffusion)
Doctest: examples.diffusion.electrostatics ... 

Intel MKL ERROR: Parameter 8 was incorrect on entry to DORGQR.
Intel MKL ERROR: Parameter 8 was incorrect on entry to DORGQR.
Error in dorgqr on 0 row (dims are 100, 1)
Error in dorgqr on 0 row (dims are 100, 1)

Error in CoarsenMIS: dorgqr returned a non-zero
napplication called MPI_Abort(MPI_COMM_WORLD, 1) - process 0

Error in CoarsenMIS: dorgqr returned a non-zero
napplication called MPI_Abort(MPI_COMM_WORLD, 1) - process 1
Exited with code 1

@guyer guyer added this to the 4.0 milestone Jul 31, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant