Tag nodes with virtual memory footprints #857

effigies · 2017-11-29T21:41:43Z

Right now we're tagging high-memory usage nodes with their maximum demands for resident memory, which is appropriate for systems that permit the kernel to overcommit memory. However, in strict overcommit mode, the kernel counts all virtual memory against its total limits.

For non-strict mode, using resident memory requirements will allow us to better utilize a system's resources, while for strict mode, using virtual memory requirements will prevent us from crashing. So it seems sensible to allow both options, which will adjust the tagging.

I suspect we'll want a function or an object to handle the memory options. If we make some rough assumptions of VM/RSS ratios, we can simply keep the numbers we have in the case of non-strict, and apply a scaling factor for strict. e.g.

MEMORY_MODE = 'strict'
VM_RSS_RATIO = 2

def scale_mem(rss, vm=None):
    if MEMORY_MODE == 'nonstrict':
        return rss
    if vm is not None:
        return vm
    return rss * VM_RSS_RATIO

# Profiled RSS only
node = pe.Node(Interface(), mem_gb=scale_mem(3))

# Profiled RSS AND VM
node = pe.Node(Interface(), mem_gb=scale_mem(3, 5))

oesteban · 2017-11-29T22:05:16Z

We can check for /proc/sys/vm/overcommit_memory < 2 to see if overcommit is allowed at all (default would be yes, in case the file can't be read). And a default RSS_VM_RATIO (I switched the order, so ratios will be <1.0 in that case) can be read from /proc/sys/vm/overcommit_ratio. A default of 0.5 makes sense.

oesteban · 2017-11-29T22:06:39Z

Actually, this is probably something we may want to integrate directly into nipype, provide nipype config options etc.

oesteban · 2018-05-04T01:00:52Z

Please have a look at: https://neurostars.org/t/how-much-ram-cpus-is-reasonable-to-run-pipelines-like-fmriprep/1086/5

effigies added the optimization label Dec 11, 2017

oesteban added potential hackathon project help wanted labels May 4, 2018

effigies mentioned this issue Aug 27, 2018

Broken process pool #1260

Closed

oesteban added this to To do in Documentation Mar 11, 2019

mgxd mentioned this issue Jun 23, 2020

fmriprep docker error: concurrent.futures.process.BrokenProcessPool #2199

Closed

dPys mentioned this issue Jul 29, 2020

Node check_orient_and_dims_clust_mask_node failed to run dPys/PyNets#404

Closed

oesteban removed this from To do in Documentation Sep 29, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tag nodes with virtual memory footprints #857

Tag nodes with virtual memory footprints #857

effigies commented Nov 29, 2017

oesteban commented Nov 29, 2017 •

edited

Loading

oesteban commented Nov 29, 2017 •

edited

Loading

oesteban commented May 4, 2018

Tag nodes with virtual memory footprints #857

Tag nodes with virtual memory footprints #857

Comments

effigies commented Nov 29, 2017

oesteban commented Nov 29, 2017 • edited Loading

oesteban commented Nov 29, 2017 • edited Loading

oesteban commented May 4, 2018

oesteban commented Nov 29, 2017 •

edited

Loading

oesteban commented Nov 29, 2017 •

edited

Loading