Feature request: volume mounting #2190

vdauwera · 2017-04-21T15:38:02Z

To address the type of use case described in http://gatkforums.broadinstitute.org/gatk/discussion/comment/38188#Comment_38188

katevoss · 2017-09-26T14:14:48Z

@vdauwera can you summarize the use case in the forum?

vdauwera · 2017-09-26T16:27:25Z

I don't have a good handle on the details but it looked like @ChrisL understood it well.

CarlosBorroto · 2017-10-06T11:11:40Z

Found the forum entry looking for a solution for mounting a docker volumen.

In my case I would like to run Ensembl VEP with Cromwell/WDL. Using VEP in cache/offline mode has many advantages, among them much better performance. When running VEP in cache mode it is necessary to have a large set of files locally installed. Downloading these files using the provided INSTALL.pl will be very inefficient. I plan for now to tar everything together and download and untar from a google bucket every time I run the task. However, it would be much better if I could mount a docker volume to the container running the task.

The way I see it I would be able to define an snapshot in the runtime section of the task definition. I would also be able to define the mount point (docker run -v *:{mount point}) where this snapshot would be available as a docker volume. In the background Cromwell would provision a disk using the snapshot, mount it to the VM and use the correct docker run -v /path/to/disk:/requested/mount/point docker run command.

Hope this helps defining this issue.

Thanks for considering raising the priority of this.

lbergelson · 2017-12-06T20:23:22Z

We have a very similar use case. We'd like to be able to run a different annotator that has a massive pile of data sources ~20gb. We want an easy way to package different sets of test files and make them available for people to use with our docker image, without having to make a 20gb docker image.

Selonka · 2018-02-28T09:28:16Z

Hi, the same problem as @CarlosBorroto ... !
Just wanted to push the issue!

vinash85 · 2018-03-12T14:21:19Z

Hi, the same problem. It will be great addition to Cromwell. thnx

jason-weirather · 2018-03-14T16:05:48Z

Hello @vdauwera , I have a similar use case in Cromwell that I think this could cover. We specifically hope we can mount the type=tmpfs volume. This creates a ram disk which we use to unpack data that has tens of thousands of files very quickly.

Google describes how to do this in their docs
https://cloud.google.com/compute/docs/containers/configuring-options-to-run-containers#mounting_tmpfs_file_system_as_a_data_volume

We have had success using this in our Slurm Cromwell by launching the docker docker through submit ourselves and giving the docker run the parameter to mount

${'--mount type=tmpfs,destination='+mount_tmpfs}

It would be great if declaring a tmpfs mount point could also be supported by cromwell in google cloud submissions. Thanks!

dinvlad · 2018-08-17T21:40:13Z

+1 on tmpfs. Currently, we have to create a directory under /dev/ and rely on the assumption that that directory gets mounted by default as a tmpfs with 1/2 of the available RAM (at least on GCP). This is obviously not ideal. Delocalization of such files is problematic as well.

Our use case is exactly the same, to unpack/process tens or hundreds of thousands of small files (in a BCL). Doing so with any "normal" disk is much slower than with RAM.

armedgorillas · 2019-03-28T16:52:33Z

Hi -- I know this is an old issue, but has there been any further discussion on how to mount persistent disks? We're using PAPIv2 as the backend, and we'd like to expose reference databases (stored as filesystems) to our docker containers via a mounted volume.

Selonka · 2019-03-28T23:08:03Z

Hi @armedgorillas,

you can bypass this by calling the docker-container from the task itself. I wrote about it in the GATK-Forum a while ago take a look at:
https://gatkforums.broadinstitute.org/gatk/discussion/comment/50056#Comment_50056

Greetings Selonka

armedgorillas · 2019-03-29T11:58:34Z

Thanks @Selonka! That looks like a nifty workaround.

Does this work with the Pipelines API backend, or just with a local backend?

dinvlad · 2019-03-29T14:08:57Z

I don’t think it works for the PAPI backend, because one needs to mount docker.sock into the container to be able to invoke Docker commands. I’m honestly a little surprised it even works locally.

antonkulaga · 2021-04-11T12:03:38Z

so, no progress on this issue for years?

valentynbez · 2022-01-14T17:39:34Z

+1, similar issue needed for workflow to run with docker container & massive databases necessary for app to finish.

vdauwera assigned katevoss Apr 21, 2017

katevoss removed their assignment May 13, 2017

katevoss added the PO Cleanup label Sep 13, 2017

katevoss removed the PO Cleanup label Jan 18, 2018

gemmalam added the Needs Triage Ticket needs further investigation and refinement prior to moving to milestones label Mar 28, 2019

noooonee mentioned this issue Aug 18, 2022

Mount PVC with reference data to executor elixir-cloud-aai/TESK#130

Open

uniqueg mentioned this issue Aug 18, 2022

Supporting persistent data volumes accessible by all executors ga4gh/task-execution-schemas#186

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: volume mounting #2190

Feature request: volume mounting #2190

vdauwera commented Apr 21, 2017

katevoss commented Sep 26, 2017

vdauwera commented Sep 26, 2017

CarlosBorroto commented Oct 6, 2017 •

edited

lbergelson commented Dec 6, 2017

Selonka commented Feb 28, 2018

vinash85 commented Mar 12, 2018

jason-weirather commented Mar 14, 2018 •

edited

dinvlad commented Aug 17, 2018 •

edited

armedgorillas commented Mar 28, 2019

Selonka commented Mar 28, 2019

armedgorillas commented Mar 29, 2019

dinvlad commented Mar 29, 2019

antonkulaga commented Apr 11, 2021

valentynbez commented Jan 14, 2022

Feature request: volume mounting #2190

Feature request: volume mounting #2190

Comments

vdauwera commented Apr 21, 2017

katevoss commented Sep 26, 2017

vdauwera commented Sep 26, 2017

CarlosBorroto commented Oct 6, 2017 • edited

lbergelson commented Dec 6, 2017

Selonka commented Feb 28, 2018

vinash85 commented Mar 12, 2018

jason-weirather commented Mar 14, 2018 • edited

dinvlad commented Aug 17, 2018 • edited

armedgorillas commented Mar 28, 2019

Selonka commented Mar 28, 2019

armedgorillas commented Mar 29, 2019

dinvlad commented Mar 29, 2019

antonkulaga commented Apr 11, 2021

valentynbez commented Jan 14, 2022

CarlosBorroto commented Oct 6, 2017 •

edited

jason-weirather commented Mar 14, 2018 •

edited

dinvlad commented Aug 17, 2018 •

edited