Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Running SPECFEM2D with Slurm #924

Closed
tianzeliu opened this issue Apr 23, 2018 · 7 comments
Closed

Running SPECFEM2D with Slurm #924

tianzeliu opened this issue Apr 23, 2018 · 7 comments

Comments

@tianzeliu
Copy link

I was trying to run SPECFEM2D on a computer cluster using Slurm as the system management tool. In this case I need to submit jobs as oppose to running the code directly. Although I could get SPECFEM2D running, it was never able to generate any output. Instead it would keep running until the time assigned by Slurm was up and then crash (the time should be enough for the simulation to finish as I have tested it on my own machine). Is there any idea on why this happened? Thanks a lot.

@komatits
Copy link
Contributor

komatits commented Apr 23, 2018 via email

@tianzeliu
Copy link
Author

Hi Dimitri,
Thank you for the quick response. Is it possible for you to provide a sample submission script so that I could modify on top of that? Thanks a lot.

Best,
Tianze

@komatits
Copy link
Contributor

komatits commented Apr 23, 2018 via email

@tianzeliu
Copy link
Author

Hi Dimitri,
The submission script you provided seemed to work. However, the code now breaks at a fixed point of the simulation, which only happens on the cluster not on my local machine. The only difference between the cluster and the local machine is that the version on the cluster may be newer (I installed it just now). Here is the error message it gives:
Backtrace for this error: #0 0x7F3BEF41E6F7 #1 0x7F3BEF41ED3E #2 0x7F3BEE70726F #3 0x45078D in compute_forces_viscoelastic_ at compute_forces_viscoelastic.F90:465 #4 0x45572E in compute_forces_viscoelastic_main_ at compute_forces_viscoelastic_calling_routine.F90:67 #5 0x49A0D6 in iterate_time_ at iterate_time.F90:165
It does not seem to be a problem with an unstable time scheme, because the CFL number and the suggested minimum time step both look OK. Thanks a lot!

Tianze

@komatits
Copy link
Contributor

komatits commented Apr 27, 2018 via email

@tianzeliu
Copy link
Author

tianzeliu commented Apr 27, 2018

Hi Dimitri,
I have tried two examples and they ran successfully, so I guess it is something wrong with my input. I am attaching the parameter files I use. The code seemed to break when the wave front hit the first interface. Thanks a lot!
interfaces.txt
Par_file.txt
SOURCE.txt

Best,
Tianze

@tianzeliu
Copy link
Author

Hi Dimitri,
I increased the size of spectral element in both X and Z dimension by a factor of 2 while fixing the time step size and the code ran successfully. So I guess the previous problem was indeed caused by an unstable numeric scheme, though the CFL number appeared to be stable.
Thank you for your help anyway!

Best,
Tianze

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants