Support docker container option --shm-size, per process #2282

maxbates · 2021-08-27T17:34:43Z

New feature

Some programs which use large file-based databases, like jackhmmer, see a significant performance increase when the database can be loaded into memory using a shm (e.g. /dev/shm) or tmpfs volume.

I would like to specify the size of a ramdisk, dependent on the memory allocated to the container. E.g. If I allocate 64Gb to the container, I would pass --shm-size=64g.

The size needs to be dynamic, so it matches the memory available to the container. Mounting /dev/shm is problematic in this case, because I do not want processes to compete for the host machine's shared memory (i.e. when multiple containers are scheduled onto the same instance).

on AWS, because nextflow creates job definitions and does not support the process directive containerOptions, I do not think it is possible to provision a dynamically sized shared memory volume without manually creating a job definition (which I would like to avoid).

By default, docker allocates 64Mb to /dev/shm, but can be configured using --shm-size (ref)[https://docs.docker.com/engine/reference/run/]. The size cannot be changed from within the container, as it requires remounting the volume.

AWS supports specifying sharedMemorySize in the job definition, which simply passes through to docker's --shm-size

Usage scenario

The use of a ramdisk in alphafold's colab notebook for running jackhmmer can be see here (creating /tmp/ramdisk). There is a similar recommendation on github for speeding up hhblits.

Suggest implementation

Add a process directory shmSize and update AWS Batch plugin's newSubmitRequest(TaskRun task) ref, and the local container executor.

The text was updated successfully, but these errors were encountered:

pditommaso · 2021-08-30T17:26:59Z

Think what could be done here is parsing the containerOptions and map selected option to the corresponding Batch API such as --shm-size, --ulimit, etc

maxbates · 2021-08-30T22:17:53Z

That would absolutely work for me!

Nice to have: is the directive is dynamic (i.e. it can support $task.memory), if we scale-up with each attempt?

pditommaso · 2021-10-02T17:30:47Z

Would this require also the use of --tmpfs Docker option

pditommaso · 2021-12-22T19:56:54Z

Solved by #2471

abhi18av added the software/docker label Aug 27, 2021

pditommaso added the help wanted label Sep 6, 2021

pditommaso mentioned this issue Sep 21, 2021

add container run options with AWS Batch #793

Closed

pditommaso mentioned this issue Oct 27, 2021

Privileged tasks on AWS #2413

Closed

manuelesimi mentioned this issue Dec 1, 2021

Set AWS container properties from container options #2471

Merged

pditommaso closed this as completed Dec 22, 2021

pditommaso added this to the 22.04.0 milestone Dec 22, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support docker container option --shm-size, per process #2282

Support docker container option --shm-size, per process #2282

maxbates commented Aug 27, 2021 •

edited

Loading

pditommaso commented Aug 30, 2021

maxbates commented Aug 30, 2021 •

edited

Loading

pditommaso commented Oct 2, 2021

pditommaso commented Dec 22, 2021

Support docker container option --shm-size, per process #2282

Support docker container option --shm-size, per process #2282

Comments

maxbates commented Aug 27, 2021 • edited Loading

New feature

Usage scenario

Suggest implementation

pditommaso commented Aug 30, 2021

maxbates commented Aug 30, 2021 • edited Loading

pditommaso commented Oct 2, 2021

pditommaso commented Dec 22, 2021

maxbates commented Aug 27, 2021 •

edited

Loading

maxbates commented Aug 30, 2021 •

edited

Loading