Allow multiple simultaneous uploads via single POST. #4563

jmchilton · 2017-09-05T19:39:03Z

The upload.py tool itself already allowed this.

guerler · 2017-09-06T18:38:57Z

Is this ready for review and will this allow to specify separate dbkey/datatype pairs for each file?

jmchilton · 2017-09-07T15:34:34Z

It lets you add different names, space_to_tab and to_posix_lines values, and such - but I guess not different dbkeys and filetypes. I'll update the PR to do this and add a test case for that this morning.

guerler · 2017-09-07T17:43:46Z

lib/galaxy/tools/parameters/grouping.py

+        if override_file_type:
+            return override_file_type
+        else:
+            return context.get(self.file_type_name, self.default_file_type)


Very minor but could we do something like unless you think its less performant:

default_type = context.get(self.file_type_name, self.default_file_type)
return context.get("override_file_type", default_type)

I also wonder if we could just name it file_type instead of override_file_type since they can be distinguished by being specified on request vs file bunch level. Thanks a lot for this addition. This is awesome.

Yeah - so I had that at first but the fact that a value - even though it was empty - was in the parent child context means the parent context was ignored. I should probably take another swing at this though - because you are right - the way it is now kinda sucks.

anuprulez · 2017-09-15T11:36:20Z

lib/galaxy/tools/parameters/grouping.py

@@ -279,10 +285,18 @@ def value_from_basic(self, value, app, ignore_errors=False):
            rval.append(rval_dict)
        return rval

+    def get_file_count(self, trans, context):
+        file_count = context.get("file_count", "auto")
+        if file_count == "auto":


Just a minor one. Can we make it one liner?

file_count = len(self.get_datatype(trans, context).writable_files) if file_count == "auto" else int(file_count)

thank you!

I'll rebase with this change, thanks!

The upload.py tool itself already allowed this and update upload dataset grouping to handle this.

jmchilton · 2017-10-02T19:01:55Z

I've changed the interface here so that "file_type" or "dbkey" in the parent context are taken as defaults and in the "per-file" context these can be overridden with "file_type" or "dbkey" instead of requiring a different variable "override_file_type" / "override_dbkey" as I had it before. I added a few more tests for different combinations of override versus default for dbkey and file_type.

guerler · 2017-10-03T15:41:59Z

Thanks a lot for working on this. I aligned the submission format in #4513 and it seems to work well.

guerler · 2017-10-04T12:19:45Z

lib/galaxy/tools/parameters/grouping.py

+        if dbkey == "":
+            if parent_context:
+                dbkey = parent_context.get("dbkey", dbkey)
+        return dbkey


I might be missing something but why are we not just returning parent_context.get("dbkey", context.get("dbkey"))?

So the inner dbkey - the per file one - is just a hidden field in the tool form with a default of "" - that is what the outer one is checking the top-level dbkey param in the tool that I think will default to ?. So if the API request sends a dbkey for all files and then one for a specific file - I think we should use the base for all files that don't specify an explicit dbkey and then use the specific keys as set. I tried to use this strategy with the file extensions also. This implementation is crap because we are mapping API requests to a tool that wasn't really ever designed to do this well (these frustrations made led me to create #4734).

So I implemented API tests to verify all this behavior with respect to specific file dbkey versus dbkey for all files - if you can rework the implementation to be cleaner in such a way that the API calls don't break - I'm totally on board. I'm quite frustrated with a lot of grouping.py. There are more test cases in #4746 that should also help verify cleanups to the implementation don't break FTP uploads and such.

jmchilton · 2017-10-04T13:13:02Z

Thanks for the merge @guerler - let me know if there is anything else I can do to help with optimizing uploads.

jmchilton added area/API area/tools kind/enhancement labels Sep 5, 2017

galaxybot added this to the 17.09 milestone Sep 5, 2017

guerler mentioned this pull request Sep 6, 2017

Allow users to upload multiple FTP files within a single request #4513

Merged

jmchilton modified the milestones: 18.01, 17.09 Sep 7, 2017

guerler reviewed Sep 7, 2017

View reviewed changes

jmchilton added the status/WIP label Sep 7, 2017

anuprulez reviewed Sep 15, 2017

View reviewed changes

jmchilton force-pushed the multiple_files_per_upload branch from ddeb712 to e6fbd9b Compare October 2, 2017 18:53

Allow multiple simulatenous uploads via single POST.

1fa4ea2

The upload.py tool itself already allowed this and update upload dataset grouping to handle this.

jmchilton force-pushed the multiple_files_per_upload branch from e6fbd9b to 1fa4ea2 Compare October 2, 2017 18:59

jmchilton added status/review and removed status/WIP labels Oct 2, 2017

jmchilton mentioned this pull request Oct 3, 2017

Many more upload tests. #4746

Merged

guerler reviewed Oct 4, 2017

View reviewed changes

guerler merged commit 79a16da into galaxyproject:dev Oct 4, 2017

jmchilton mentioned this pull request Dec 14, 2017

Hierarchical upload API optimized for folders & collections. #5220

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow multiple simultaneous uploads via single POST. #4563

Allow multiple simultaneous uploads via single POST. #4563

jmchilton commented Sep 5, 2017

guerler commented Sep 6, 2017 •

edited

jmchilton commented Sep 7, 2017 •

edited

guerler Sep 7, 2017 •

edited

jmchilton Sep 7, 2017

anuprulez Sep 15, 2017

jmchilton Oct 2, 2017

jmchilton commented Oct 2, 2017

guerler commented Oct 3, 2017

guerler Oct 4, 2017

jmchilton Oct 4, 2017

jmchilton commented Oct 4, 2017

Allow multiple simultaneous uploads via single POST. #4563

Allow multiple simultaneous uploads via single POST. #4563

Conversation

jmchilton commented Sep 5, 2017

guerler commented Sep 6, 2017 • edited

jmchilton commented Sep 7, 2017 • edited

guerler Sep 7, 2017 • edited

Choose a reason for hiding this comment

jmchilton Sep 7, 2017

Choose a reason for hiding this comment

anuprulez Sep 15, 2017

Choose a reason for hiding this comment

jmchilton Oct 2, 2017

Choose a reason for hiding this comment

jmchilton commented Oct 2, 2017

guerler commented Oct 3, 2017

guerler Oct 4, 2017

Choose a reason for hiding this comment

jmchilton Oct 4, 2017

Choose a reason for hiding this comment

jmchilton commented Oct 4, 2017

guerler commented Sep 6, 2017 •

edited

jmchilton commented Sep 7, 2017 •

edited

guerler Sep 7, 2017 •

edited