Join GitHub today
GitHub is home to over 31 million developers working together to host and review code, manage projects, and build software together.Sign up
Fix `test_midway_ipp.ssh.slurm.config1.py` test #196
I see the following error when running
with the singleNode config:
Spits out the following log:
While the queue at the time of execution is:
This only happens when the config has multiple defined sites, otherwise it's fine. It's only been tested on slurm, so it may or may not occur on other clusters. Notice the job ID that produces the key error does actually exist, but it's always the oldest job ID in the queue. Some of the engines do successfully execute jobs, but the script as a whole will never terminate.
The stategy module polls the status of all jobs periodically. Often this poll happens prior to the actual submission of jobs to the site. At this point when there are no jobs in the resources list, slurm returns counter-intuitive results when called with
I'm able to reproduce this issue. A fix is in testing.