Chunked upload support #1468

noirbizarre · 2018-03-01T10:14:24Z

This PR adds chunked upload support, ie. uploads are no more limited by max_body_size and udata is able to handle big files upload.

This is is a first pass only adding chunked and concurrent upload support.

What can be improved in other PRs:

pause/resume support (admin widgets controls)
by chunks controls (size, supported format...)
more test refactoring to test both classic blueprint endpoint and API endpoint

abulte

IMHO file and print and maybe end of line need to be changed then 👍

abulte · 2018-03-02T08:59:11Z

udata/core/dataset/api_fields.py

@@ -187,4 +187,4 @@
        description='The web page URL for this dataset', readonly=True),
    'score': fields.Float(
        description='The internal match score', required=True),
-})
+})


abulte · 2018-03-02T09:03:34Z

udata/core/storages/api.py

+
+def on_upload_status(status):
+    '''Not an error, just raised when chunk is processed'''
+    print('in handler', status.ok, status.error)


abulte · 2018-03-02T09:05:50Z

udata/core/storages/api.py

+def handle_upload(storage, prefix=None):
+    args = upload_parser.parse_args()
+    is_chunk = args['totalparts'] > 1
+    file = args['file']


Reserved name in Python 2

abulte · 2018-03-02T09:07:01Z

udata/core/storages/utils.py

+
+
+def extension(filename):
+    '''Properly extract the extension from filename'''


Cool, we had problems with that 👍

I think we'll still have problems with that kind of filenames though Capture d'écran 2018-02-13 16.03.07.png. Maybe check against a list of known extensions and if found do not try to parse a compounded one? But it could get complicated.

Yes, I didn't changed anything, I just grouped an existing function.
I think I can improve the behavior, but in another PR

abulte · 2018-03-02T09:08:45Z

udata/tests/api/test_datasets_api.py

+                'partbyteoffset': 0,
+                'totalfilesize': parts,
+                'totalparts': parts,
+                'chunksize': 1


Add a comma here if we you add one on the last item of the dict below ;-)

abulte · 2018-03-02T09:10:29Z

udata/tests/test_storages.py

+                'partbyteoffset': 0,
+                'totalfilesize': parts,
+                'totalparts': parts,
+                'chunksize': 1


noirbizarre · 2018-03-02T09:50:11Z

Changes done

noirbizarre added enhancement performance quick-win-big-impacts current-sprint labels Mar 1, 2018

noirbizarre added this to the 1.3.0 milestone Mar 1, 2018

noirbizarre self-assigned this Mar 1, 2018

noirbizarre requested a review from a team March 1, 2018 10:14

noirbizarre added the in progress label Mar 1, 2018

abulte suggested changes Mar 2, 2018

View reviewed changes

Chunked upload support

fc9abd5

abulte approved these changes Mar 2, 2018

View reviewed changes

noirbizarre merged commit ad89011 into opendatateam:master Mar 2, 2018

noirbizarre deleted the chunked-upload branch March 2, 2018 10:07

noirbizarre removed the in progress label Mar 2, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chunked upload support #1468

Chunked upload support #1468

noirbizarre commented Mar 1, 2018

abulte left a comment

abulte Mar 2, 2018

abulte Mar 2, 2018

abulte Mar 2, 2018

abulte Mar 2, 2018

abulte Mar 2, 2018

noirbizarre Mar 2, 2018

abulte Mar 2, 2018

abulte Mar 2, 2018

noirbizarre commented Mar 2, 2018



		def extension(filename):
		'''Properly extract the extension from filename'''

Chunked upload support #1468

Chunked upload support #1468

Conversation

noirbizarre commented Mar 1, 2018

abulte left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

noirbizarre commented Mar 2, 2018