combining multiple job starts #12

bernstei · 2022-03-28T14:15:55Z

On some remote machines just the ssh connection is somewhat slow. It would be nice if multiple job start commands could be combined, perhaps by gathering all the remote commands into an array of strings, and then running all of them in a single ssh connection.

bernstei · 2022-03-28T14:25:47Z

Note - it's unclear, in retrospect, what makes these remote job starts slow. Need to investigate further before determining how to increase rate.

bernstei · 2022-04-01T17:31:56Z

Looks like the staging in of files and ssh qsub each take a non-negligible time (around 1s). Both would need to be batched to fully help.

gabor1 · 2022-04-04T10:12:09Z

is this really an issue? I guess you are already batching individual configs, so it won't be the case that you'd want to qsub 10,000 individual jobs (many queueing systems would choke as well)

bernstei · 2022-04-04T12:40:43Z

It is when you have 1000 jobs (one per config to re-evaluate an entire fitting database with tighter DFT params), and each one take 3 seconds, because the rsync to stage in fils take 1.5 s and the ssh to qsub takes 1.5 s. I guess I could set chunksize=1 and job_chunksize > 1 to do job_chunksize DFT evaluation per job, and reduce the number of rsync/ssh+qsub by a factor of job_chunksize. Maybe that's the right approach.

bernstei · 2022-04-21T20:41:03Z

I have a solution for this, where ExPyRe, system, and scheduler can all be told to store information in a buffer, and then start all the jobs in buffer at once (one ssh to set up the directories, one rsync to stage in the run dirs, and one ssh to submit all the jobs). A PR will be available eventually - it'd be useful if people tested the SGE implementation, which I do not have access to.

bernstei changed the title ~~combining job start~~ combining multiple job starts Mar 28, 2022

bernstei mentioned this issue Apr 21, 2022

buffer multiple job submissions for improved performance libAtoms/workflow#62

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

combining multiple job starts #12

combining multiple job starts #12

bernstei commented Mar 28, 2022

bernstei commented Mar 28, 2022

bernstei commented Apr 1, 2022

gabor1 commented Apr 4, 2022

bernstei commented Apr 4, 2022

bernstei commented Apr 21, 2022

combining multiple job starts #12

combining multiple job starts #12

Comments

bernstei commented Mar 28, 2022

bernstei commented Mar 28, 2022

bernstei commented Apr 1, 2022

gabor1 commented Apr 4, 2022

bernstei commented Apr 4, 2022

bernstei commented Apr 21, 2022