Allow 'planemo run' to stage in existing datasets without reuploading #10334

simonbray · 2020-09-30T12:34:53Z

Fixes galaxyproject/planemo#1080. This was much easier than I expected.

The idea is that the job or test yaml file can contain galaxy_ids for any File or Collection instead of a local path. Assuming these are valid for a particular galaxy server, the user can then use planemo run and execute the workflow using the existing datasets/collections and therefore can skip the upload step.

mgf_input_list:
  class: Collection
  galaxy_id: 1e94904b7ba7b9cc
metapeptides:
  class: File
  galaxy_id: 11ac94870d0bb33a71934a4bcff13734
gene_ontology_terms:
  class: File
  galaxy_id: 11ac94870d0bb33aa74376be3811a4fe

I tested this with both files and collections and it seems to work fine. Also, specifying galaxy_id for some files and location for others (i.e. so some files are uploaded and others not) works.

lib/galaxy/tool_util/cwl/util.py

jmchilton · 2020-09-30T14:02:31Z

I'm glad that was so simple! This seems awesome to me.

simonbray added 3 commits September 29, 2020 19:21

allow copying datasets rather than uploading in StagingInterface

cbbe316

remove copy_func, appears unnecessary

2652f58

tidy

f653eb6

simonbray commented Sep 30, 2020

View reviewed changes

lib/galaxy/tool_util/cwl/util.py Show resolved Hide resolved

galaxybot added the triage label Sep 30, 2020

galaxybot added this to the 21.01 milestone Sep 30, 2020

bgruening requested a review from jmchilton September 30, 2020 13:22

jmchilton merged commit ee2eb2b into galaxyproject:dev Sep 30, 2020

simonbray mentioned this pull request Sep 30, 2020

[20.09] Allow 'planemo run' to stage in existing datasets without reuploading #10335

Merged

nsoranzo added area/tool-framework kind/enhancement and removed triage labels Sep 30, 2020

galaxyproject deleted a comment from galaxybot Sep 30, 2020

simonbray mentioned this pull request Jan 4, 2021

Bump galaxy-tool-util version in requirements galaxyproject/planemo#1128

Merged

simonbray deleted the staging-with-id branch January 4, 2021 11:07

simonbray mentioned this pull request May 12, 2021

[20.09] Force galaxy_id to string in galactic_job_json #11965

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow 'planemo run' to stage in existing datasets without reuploading #10334

Allow 'planemo run' to stage in existing datasets without reuploading #10334

simonbray commented Sep 30, 2020 •

edited

Loading

jmchilton commented Sep 30, 2020

Allow 'planemo run' to stage in existing datasets without reuploading #10334

Allow 'planemo run' to stage in existing datasets without reuploading #10334

Conversation

simonbray commented Sep 30, 2020 • edited Loading

jmchilton commented Sep 30, 2020

simonbray commented Sep 30, 2020 •

edited

Loading