optimal run args for running on HPC? #387

mgxd · 2017-02-15T22:36:08Z

This is my current command:

mriqc --use-plugin plugin.yml --n_procs 8 --mem_gb 30 [mandatories]

plugin.yml
{plugin: 'SLURM', plugin_args: {'sbatch_args: --time=1-00:00:00 --mem=30GB -c 4'}}

I have to wait around 3 hours before the workflows actually start to run, and even then seems to take longer than expected. Any suggestions for improving this?

*I'm running on Prisma data with multi-slice acquisition, functional are about 300 vols, 2-4 runs each

The text was updated successfully, but these errors were encountered:

chrisgorgo · 2017-02-15T23:07:56Z

Are you saying that your job waits in your cluster queue for 3h or that the jobs starts but mriqc is idle for 3h before it starts doing any computation?

…

On Wed, Feb 15, 2017 at 2:36 PM, Mathias Goncalves ***@***.*** > wrote: This is my current command: mriqc --use-plugin plugin.yml --n_procs 8 --mem_gb 30 [mandatories] plugin.yml {plugin: 'SLURM', plugin_args: {'sbatch_args: --time=1-00:00:00 --mem=30GB -c 4'}} I have to wait around 3 hours before the workflows actually start to run, and even then seems to take longer than expected. Any suggestions for improving this? *I'm running on Prisma data with multi-slice acquisition, functional are about 300 vols, 2-4 runs each — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#387>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAOkp1IsztK1ktBmQinj3A1eYmwvlSxYks5rc33YgaJpZM4MCVt6> .

mgxd · 2017-02-16T00:49:50Z

The job starts but it seems it is building the functional workflows for almost 3 hours.

…

On Feb 15, 2017, at 6:07 PM, Chris Filo Gorgolewski ***@***.***> wrote: Are you saying that your job waits in your cluster queue for 3h or that the jobs starts but mriqc is idle for 3h before it starts doing any computation? On Wed, Feb 15, 2017 at 2:36 PM, Mathias Goncalves ***@***.*** > wrote: > This is my current command: > > mriqc --use-plugin plugin.yml --n_procs 8 --mem_gb 30 [mandatories] > > plugin.yml > {plugin: 'SLURM', plugin_args: {'sbatch_args: --time=1-00:00:00 --mem=30GB -c 4'}} > > I have to wait around 3 hours before the workflows actually start to run, > and even then seems to take longer than expected. Any suggestions for > improving this? > > *I'm running on Prisma data with multi-slice acquisition, functional are > about 300 vols, 2-4 runs each > > — > You are receiving this because you are subscribed to this thread. > Reply to this email directly, view it on GitHub > <#387>, or mute the thread > <https://github.com/notifications/unsubscribe-auth/AAOkp1IsztK1ktBmQinj3A1eYmwvlSxYks5rc33YgaJpZM4MCVt6> > . > — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or mute the thread.

chrisgorgo · 2017-02-16T00:54:47Z

This is very unusual - could you share a dataset that you get this performance issues with? On Wed, Feb 15, 2017 at 4:49 PM, Mathias Goncalves <notifications@github.com

…

wrote: The job starts but it seems it is building the functional workflows for almost 3 hours. > On Feb 15, 2017, at 6:07 PM, Chris Filo Gorgolewski < ***@***.***> wrote: > > Are you saying that your job waits in your cluster queue for 3h or that the > jobs starts but mriqc is idle for 3h before it starts doing any computation? > > On Wed, Feb 15, 2017 at 2:36 PM, Mathias Goncalves < ***@***.*** > > wrote: > > > This is my current command: > > > > mriqc --use-plugin plugin.yml --n_procs 8 --mem_gb 30 [mandatories] > > > > plugin.yml > > {plugin: 'SLURM', plugin_args: {'sbatch_args: --time=1-00:00:00 --mem=30GB -c 4'}} > > > > I have to wait around 3 hours before the workflows actually start to run, > > and even then seems to take longer than expected. Any suggestions for > > improving this? > > > > *I'm running on Prisma data with multi-slice acquisition, functional are > > about 300 vols, 2-4 runs each > > > > — > > You are receiving this because you are subscribed to this thread. > > Reply to this email directly, view it on GitHub > > <#387>, or mute the thread > > <https://github.com/notifications/unsubscribe-auth/ AAOkp1IsztK1ktBmQinj3A1eYmwvlSxYks5rc33YgaJpZM4MCVt6> > > . > > > — > You are receiving this because you authored the thread. > Reply to this email directly, view it on GitHub, or mute the thread. > — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#387 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAOkp2yN8mvYS-h6q4I452Q6T5cO8jL9ks5rc50vgaJpZM4MCVt6> .

A step towards some recommendations for nipreps#387 and nipreps#388

oesteban · 2017-03-24T17:29:02Z

Hi @mgxd, I was coming back to this and realized that you are using the SLURM plugin. I don't really know the details on how the plugin is built in nipype, but that long set-up time seems to me very related to the plugin choice.

When we run mriqc on our HPC these are our settings:

One subject per compute node,
with one process per subject using the MultiProc
with n_procs depending on the memory maximum - so for your 30GB I would probably go with n_procs<=12 or so.

For parallelizing these many process you could use a job-array. If you are not allowed to use job arrays, then a solution like launcher may do.

I am about to update the documentation with some profiling we've been doing on mriqc.

Does this reply your question?

…ac (close #388)

mgxd · 2017-03-24T17:48:39Z

@oesteban thanks for the info, I'm looking forward to the documentation!

oesteban added the documentation label Mar 10, 2017

oesteban added this to the MRIQC 1.0.0 milestone Mar 10, 2017

oesteban modified the milestones: MRIQC 1.0.0, MRIQC 1.0.1 Mar 22, 2017

oesteban added a commit to oesteban/mriqc that referenced this issue Mar 23, 2017

[ENH] Add --profile flag

21a3045

A step towards some recommendations for nipreps#387 and nipreps#388

oesteban added a commit that referenced this issue Mar 24, 2017

add memory profiling (#387), and warning about memory in Docker for M…

c053dd8

…ac (close #388)

mgxd closed this as completed Mar 24, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

optimal run args for running on HPC? #387

optimal run args for running on HPC? #387

mgxd commented Feb 15, 2017

chrisgorgo commented Feb 15, 2017 via email

mgxd commented Feb 16, 2017 via email

chrisgorgo commented Feb 16, 2017 via email

oesteban commented Mar 24, 2017 •

edited

Loading

mgxd commented Mar 24, 2017

optimal run args for running on HPC? #387

optimal run args for running on HPC? #387

Comments

mgxd commented Feb 15, 2017

chrisgorgo commented Feb 15, 2017 via email

mgxd commented Feb 16, 2017 via email

chrisgorgo commented Feb 16, 2017 via email

oesteban commented Mar 24, 2017 • edited Loading

mgxd commented Mar 24, 2017

oesteban commented Mar 24, 2017 •

edited

Loading