Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Download files in a list of datasets with the name from the list #2023

Closed
jennaj opened this issue Mar 28, 2016 · 6 comments
Closed

Download files in a list of datasets with the name from the list #2023

jennaj opened this issue Mar 28, 2016 · 6 comments

Comments

@jennaj
Copy link
Member

jennaj commented Mar 28, 2016

Idea:

Download files in a list of datasets with the name from the list (Samplexyz.bam) as output instead of the name given by galaxy (Galaxy5-[Filter_SAM_or_BAM,_output_SAM_or_BAM_on_data_181__bam]).

Submitted by:

markus.ball via https://biostar.usegalaxy.org/p/16676/

@lparsons
Copy link
Contributor

This is basically a duplicate of #2140 and related to #2346.

My general suggestion would be name the files according to the element identifiers in the collection. However, when downloading, prefix the names with the name of the collection to provide context. This would be made even better if one could easily rename the collection in a workflow.

@jmchilton
Copy link
Member

Okay - so currently the naming for downloaded datasets from the history panel when coming in through a collection is:

<GalaxyDatasetHID>-[<name>].<extension>

But it would be more usable if it was:

<GalaxyDatasetCollectionName>-[<element_identifier>].<extension>?

What about the HID of the collection - should that be included? (Currently the HID of the dataset is included - which is useless and really problematic that it even has an HID - see #1810).

@lparsons
Copy link
Contributor

At the moment, I'm voting for <GalaxyDatasetCollectionName>-[<element_identifier>].<extension>. Thanks @jmchilton.

@stevecassidy
Copy link
Contributor

Just wanted to comment on this. An important use case for us would be for the user to upload a collection of files, run some processing pipeline on them and download the result. We can preserve the element_identifier between processing steps, but then they can't download the resulting dataset. If they could, the filenames would have a useless (for us) prefix named for the dataset collection . Better would be to download this as a zip file with a directory structure, the directory could be the dataset name but the individual data files would be <element_identifier>..

Perhaps the addition of the prefix could be optional in a download dialogue?

@jmchilton jmchilton assigned jmchilton and unassigned nturaga Feb 7, 2017
@jmchilton jmchilton removed the triage label Feb 7, 2017
@jmchilton
Copy link
Member

@stevecassidy Indeed - this would be great. Thanks for the input!

jmchilton added a commit to jmchilton/galaxy that referenced this issue Apr 27, 2017
…ollections.

xref galaxyproject#1675

This is of limited utility since we don't really expose the name - and intentionally so. Related open bugs/enhancements that still need to be addressed are:

 - Applying rename to the collection (in addition to the elements) - galaxyproject#1680.
 - Download of collection elements with element identifier instead of the name: galaxyproject#2023 / galaxyproject#2140.
@jmchilton
Copy link
Member

Closing as a duplicate of #2140. Opened an issue based on @stevecassidy's comment above with #3984.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants