Async file operations #24509

PVince81 · 2016-05-09T11:45:52Z

Some file operations, especially deleting from external storage can take a long time as it needs to download the files to trash in the background.

Also there were talks about async PUT: #12097

So opening this ticket to discuss the possibility of having asynchronous file operations.
The good part is that we already have file locking, so it might be possible to leverage this to avoid concurrency issues.

Also, need to make sure we stay compatible to Webdav. So async operations would have to be custom-Webdav/extensions..

@DeepDiver1975

PVince81 · 2016-06-14T10:34:43Z

Another reason: when someone is syncing a folder and there are big files to be downloaded, another use will not be able to delete or rename that folder due to locking and it causes bad UX. See #21574

Maybe some kind of queue of file operations could help.

CC @dragotin

PVince81 · 2016-08-16T13:39:13Z

I see more and more reports of people having discrepancies due to PHP timeouts.

Unfortunately we can't rollback a FS change from a killed PHP process. However if we had some kind of journal or operations queue, it should be possible to either redo or rollback the last operation. This all fits well with the "async file operation" concept.

CC @butonic

PVince81 · 2016-09-16T06:48:58Z

If one day we ever go the "Webdav sync" route, that one will need a table containing all changes.
Coincidentally if we had async file operations we would need such table too. Maybe the table would get pre-populated with "pending" changes. Those changes could then be reverted in the event of PHP timeout.

@DeepDiver1975 @butonic

PVince81 · 2017-04-03T11:55:44Z

Some further ideas:

The most extreme: get rid of long running operations. This means no more recursive MOVE or DELETE. Expect clients to do a MOVE or DELETE for every single file. Puts the burden on the client devs and also on the network. Not ideal but could work.
Similar to 1) but find a way to execute such operations in batches. Likely too complicated for clients to implement as every client would need to implement that.
Set the size of the parent folder to "-1" when doing long running operations. This way if the PHP process is killed, the background scanner will find these entries and rescan them in the background.

@jvillafanez

PVince81 · 2017-04-03T11:56:23Z

or well, get rid of the filecache...

PVince81 · 2017-04-03T12:07:04Z

get rid of "path" column, because that's what actually needs adjusting and takes a long time: https://github.com/owncloud/core/blob/v9.1.4/lib/private/Files/Cache/Cache.php#L521

I tried renaming "test" to "test2" with a lot of children inside. In theory it's only about renaming "test" to "test2" without touching ever children. Even the file ids stay the same.
Still, it will iterate over all these entries to adjust the "path" and "path_hash" columns.

PVince81 · 2017-04-03T12:08:04Z

Maybe we do need closure tables to get rid of the "path" column: #4209.

While closure tables might not increase regular read speed, if it can help solve the timeout issues on long-running MOVE or DELETE then they might be worth it. Data loss 🔔

PVince81 · 2017-04-03T12:19:40Z

A good read but probably not useful as it will likely not work on shared hosters: http://symcbean.blogspot.de/2010/02/php-and-long-running-processes.html

PVince81 · 2017-04-03T13:07:07Z

If we do make a request async (like DELETE or MOVE), we could use this approach: http://restcookbook.com/Resources/asynchroneous-operations/

But not sure how standard Webdav clients would react... Or we'd need to optimistically tell them that we succeeded even though we just queued the request.

PVince81 · 2017-04-03T13:08:20Z

Oh oh, looks like 202 might be acceptable, see https://msdn.microsoft.com/en-us/library/aa142865(v=exchg.65).aspx which says that it could be used for DELETE.

PVince81 · 2017-04-03T13:14:44Z

I hacked Sabre locally for a quick test:

cadaver accepts 202 for DELETE and MOVE and considers it a success
dolphin's Webdav client however says "An unexpected errror (202) occurred..."

butonic · 2017-09-25T13:49:42Z

Large file uploads also require this. The assemble step can take a long time. Not only the file chunks need to be assembled, but also the antivirus scan will kick in or any other postprocessing. IMO we should show the upload is completed and mark the file as 'in postprocessing'. Probably even exposing this in the web interface. A PROPFIND will be able to get the metadata, but actually accessing the file should cause a 403 Forbidden together with a Retry-After header? Marking a file as 'in postprocessing' may lead to a new lifetime column, eg to also mark files as deleted. Hm what do we have: Receiving chunks, assembling file, antivirus scan, content extraction (for workflow), indexing (for search), thumbnail generation, deleted. Those can roughly be separated into where is the file stored and what is done with the content. In that light for federated shares a status like 'cached locally' would make sense. But I don't know if it makes sense to fit all these into a single column. It does make sense to have a common pipeline for files that applications can then hook into ... hm need to think on this further.

00006520

jvillafanez · 2017-10-02T12:03:53Z

I think the key point here is to have 2 different processes:

User makes a request and start process A
Process A spawns process B to perform the upload / download / whatever
Process A returns something to the user while process B is still running
User can check at some point the status of the process B

If I remember correctly, there is a trick we can use to spawn the process B in a async way without cli access although I don't remember if there are some caveats to take into account.

Taking into account what we have, we'll need to expose at least one additional endpoint for each async operation we want and at least an additional endpoint to check the operation status. For example, for uploads we'll have the sync upload (we can use whatever we're doing right now and the same endpoint), and the async upload which will trigger the sync one at some point. We'll need additional columns / tables in the DB to track the status of the sync operations so we can poll for changes and check when the sync operation is finished.

Although this doesn't seem too intrusive, we'll need to take into account the sync operations needs to notify its status somehow so the users can check the status periodically.
In addition, we'll need to:

Check how we can integrate (if possible) these new async endpoints in webdav. Define what these endpoints should return.
Find a place to store the operation status. To be decided if it will be a column in the filecache table, or a new table for all the async operations, or any other place.

Note that these endpoints don't need to rely on webdav, so worst case we can use these async operations ourselves even though 3rdparty software would still use the sync ones through webdav.

guruz · 2017-10-02T13:16:49Z

@butonic FYI we had the asynchronous PUT implemented in a private fork of the client and server some time ago. It worked by returning (as header) a "poll URL" from the PUT of the latest chunk. The client would (after having uploaded all chunks) check a poll URL every few seconds to see if the file was uploaded the backend.

Contact @ogoffart or me if you want more info and/or sources.

DeepDiver1975 · 2017-10-02T13:23:19Z

All these ideas are pointless as long as we have no active job execution mechanism in place.

PVince81 · 2017-10-09T10:52:13Z

There is also an additional challenge. While blocking access to a pending file is one thing, what happens with external storage ?

It seems that we need to first upload the pending file to some local invisible temporary space in which it is assembled/virus-scanned, etc, and then upload it to the final storage. But that would cause delays.

Or upload it as a temporary part file like we already do. Part files are invisible to the clients.
We then only need to have the part file stay at the end of the upload and have background processes run on this part file. For this we need to track uploads and part files.

PVince81 · 2018-01-15T10:30:59Z

We could also use "part folders" for some operations: #13756

PVince81 added the enhancement label May 9, 2016

PVince81 added this to the backlog milestone May 9, 2016

This was referenced May 9, 2016

Dropbox Synchronization fails (delete OC->DB) #24484

Closed

Asynchronous PUT #12097

Closed

PVince81 mentioned this issue May 17, 2016

Cannot delete file: Still in ownCloud DB, but not on file system #24625

Closed

PVince81 mentioned this issue Jun 14, 2016

It's impossible to delete file while it's being synced #21574

Closed

PVince81 mentioned this issue Jun 14, 2016

Delting huge directory (25GB, 50 000 files) over webinterface impossible #21337

Closed

PVince81 mentioned this issue Aug 16, 2016

Error to download/sync files from a specific sharing folder #25806

Closed

PVince81 mentioned this issue Sep 20, 2016

Files deleted after renaming folder #25814

Closed

This was referenced Oct 6, 2016

Moving/renaming huge folder from web UI timeout causing huge oc_filecache inconsistency #25569

Closed

Update file cache before doing file operation instead of after, rollback possibility #24665

Closed

Tasks for preserving data integrity #14052

Closed

PVince81 mentioned this issue Mar 6, 2017

fresh desktop client unable to sync with Service Unavailable, trying to get deleted files #27302

Closed

e-alfred mentioned this issue Aug 25, 2017

Add option to copy/duplicate files nextcloud/server#3206

Closed

butonic added the blue-ticket label Sep 25, 2017

PVince81 mentioned this issue Oct 30, 2017

deleting/moving of shared folder possible although locked through parallel user upload owncloud/pyocclient#211

Closed

ownclouders added the status/STALE label Nov 16, 2017

PVince81 mentioned this issue Dec 14, 2017

For future - Preview for shared folders #10603

Closed

ownclouders removed the status/STALE label Jan 15, 2018

PVince81 mentioned this issue Jan 22, 2018

Dealing with the deletion of huge folders #30197

Closed

ownclouders added the status/STALE label Feb 15, 2018

PVince81 mentioned this issue Mar 26, 2018

Sharing big folders still a problem in my opinion #30868

Closed

ownclouders mentioned this issue Jun 25, 2018

Implement async file upload in web UI upload #31880

Closed

ownclouders mentioned this issue Sep 5, 2018

async operation always returns HTTP code 202 #32598

Closed

AlexAndBear closed this as completed Sep 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Async file operations #24509

Async file operations #24509

PVince81 commented May 9, 2016 •

edited by DeepDiver1975

Loading

PVince81 commented Jun 14, 2016

PVince81 commented Aug 16, 2016

PVince81 commented Sep 16, 2016

PVince81 commented Apr 3, 2017

PVince81 commented Apr 3, 2017

PVince81 commented Apr 3, 2017

PVince81 commented Apr 3, 2017

PVince81 commented Apr 3, 2017

PVince81 commented Apr 3, 2017

PVince81 commented Apr 3, 2017

PVince81 commented Apr 3, 2017

butonic commented Sep 25, 2017 •

edited

Loading

jvillafanez commented Oct 2, 2017

guruz commented Oct 2, 2017

DeepDiver1975 commented Oct 2, 2017

PVince81 commented Oct 9, 2017

PVince81 commented Jan 15, 2018

Async file operations #24509

Async file operations #24509

Comments

PVince81 commented May 9, 2016 • edited by DeepDiver1975 Loading

PVince81 commented Jun 14, 2016

PVince81 commented Aug 16, 2016

PVince81 commented Sep 16, 2016

PVince81 commented Apr 3, 2017

PVince81 commented Apr 3, 2017

PVince81 commented Apr 3, 2017

PVince81 commented Apr 3, 2017

PVince81 commented Apr 3, 2017

PVince81 commented Apr 3, 2017

PVince81 commented Apr 3, 2017

PVince81 commented Apr 3, 2017

butonic commented Sep 25, 2017 • edited Loading

jvillafanez commented Oct 2, 2017

guruz commented Oct 2, 2017

DeepDiver1975 commented Oct 2, 2017

PVince81 commented Oct 9, 2017

PVince81 commented Jan 15, 2018

PVince81 commented May 9, 2016 •

edited by DeepDiver1975

Loading

butonic commented Sep 25, 2017 •

edited

Loading