Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow 'planemo run' to stage in existing datasets without reuploading #10334

Merged
merged 3 commits into from
Sep 30, 2020

Conversation

simonbray
Copy link
Member

@simonbray simonbray commented Sep 30, 2020

Fixes galaxyproject/planemo#1080. This was much easier than I expected.

The idea is that the job or test yaml file can contain galaxy_ids for any File or Collection instead of a local path. Assuming these are valid for a particular galaxy server, the user can then use planemo run and execute the workflow using the existing datasets/collections and therefore can skip the upload step.

mgf_input_list:
  class: Collection
  galaxy_id: 1e94904b7ba7b9cc
metapeptides:
  class: File
  galaxy_id: 11ac94870d0bb33a71934a4bcff13734
gene_ontology_terms:
  class: File
  galaxy_id: 11ac94870d0bb33aa74376be3811a4fe

I tested this with both files and collections and it seems to work fine. Also, specifying galaxy_id for some files and location for others (i.e. so some files are uploaded and others not) works.

@galaxybot galaxybot added this to the 21.01 milestone Sep 30, 2020
@jmchilton
Copy link
Member

I'm glad that was so simple! This seems awesome to me.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Allow existing dataset / dataset collection ids in jobs for planemo run
4 participants