How to edit the settings of the job manager to do sequential submission of the job queue ? #3

bradraj · 2018-07-14T04:50:07Z

The abipy submits the job simultaneously and it crashes the system. When the next job input depends on the previous job then it is running sequencially. But when it is not dependent, all the jobs are submitted at once and when the RAM of the system overloads, it crashes the system.

Is there any way to submit the job using job scheduler sequencially one by one ???

gmatteo · 2018-07-16T16:33:21Z

Add the following options to scheduler.yml

# Limit on the number of jobs that can be present in the queue. (DEFAULT: 200)
max_njobs_inqueue: 2

# Maximum number of cores that can be used by the scheduler.
max_ncores_used: 4

To get the list of options supported in scheduler.yml and manager.yml, use:

abidoc.py scheduler
abidcoc.py manager

bradraj · 2018-07-17T02:18:26Z

I have given the following options in scheduler.yml

max_njobs_inqueue:1
max_nlaunches:1
#no of seconds to wait
seconds: 5

One job is submitted at a given time. If the job takes more than 5 seconds, the previous job is running and at the same time the new job is getting submitted. No matter the status of the old job whether it is running or not, for every 5 second a new job is getting submitted. This when the jobs are not inter-related

I don't know how much time each job will take and I can't give a fiex wait time. Is there a way to fix this ?

Is there a way to make all the jobs wait till previous one gets completed ?

gmatteo · 2018-07-17T21:41:55Z

Try to set max_ncores_used to the total number of physical CPUS available on your machine.
This adds an additional constraint to the scheduler.

If the problems persists, send me your manager.yml and the script used to run the calculations.

bradraj · 2018-07-18T11:02:18Z

Adding the max_ncores_used did the trick. It limited the submission of programs. It would be useful if you can in the future add max_memory_used for the scheduler. Thanks a ton.

gmatteo mentioned this issue Jul 17, 2018

How to run a flow with specific hosts? abinit/abipy#157

Closed

bradraj closed this as completed Jul 18, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to edit the settings of the job manager to do sequential submission of the job queue ? #3

How to edit the settings of the job manager to do sequential submission of the job queue ? #3

bradraj commented Jul 14, 2018

gmatteo commented Jul 16, 2018

bradraj commented Jul 17, 2018

gmatteo commented Jul 17, 2018

bradraj commented Jul 18, 2018

How to edit the settings of the job manager to do sequential submission of the job queue ? #3

How to edit the settings of the job manager to do sequential submission of the job queue ? #3

Comments

bradraj commented Jul 14, 2018

gmatteo commented Jul 16, 2018

bradraj commented Jul 17, 2018

gmatteo commented Jul 17, 2018

bradraj commented Jul 18, 2018