Join GitHub today
GitHub is home to over 40 million developers working together to host and review code, manage projects, and build software together.Sign up
LSF Adapter: Add optional delay before returning id after successfully submitting a job #81
then after bsub-ing the job we call bjobs to verify the submitted job has appeared in the queue. If it has not yet appeared, we then sleep for several seconds, then call bjobs again. We repeat until the job appears in the queue or the time since bsub returned the id exceeds the number of seconds specified in "verify_submit_timeout". If "verify_submit_timeout" is 0 or -1 or a non number then we don't check the queue for the job, we just return.
This will allow addressing the edge case were after a job is submitted in multi-cluster node there is a delay before bjobs displays the job in the queue.
I propose we should hold off on this until we can confirm that the
can fix this issue by setting
The issue turned out that we were calling the job status on a host group and not a cluster. So the delay was because the job was Pending and had yet to be dispatched to a host in the requested host group. Once the job was dispatched to a valid host under the requested host group and entered the Running state it would appear in the job status request.