How does a client handle backtracking? #53

guoye-zhang · 2022-06-21T08:01:58Z

If the request body is dynamically generated and the server backtracks to an offset that's no longer available, what should a client do?

If the server backtracks to 0, could the client upload a different thing? (My thinking is no)

gregw · 2022-06-21T08:22:57Z

A client that is sending a dynamically generated body that cannot be regenerated exactly MUST NOT send an upload-token

Also a resumed request must send identical bytes to any already sent.

gregw · 2022-06-21T08:28:41Z

I also think a note in the security implications about non-idempotent requests would be good.

guoye-zhang · 2022-06-21T08:47:44Z

I think how the content is generated shouldn’t be the focus. It works fine as long as the client can reproduce the exact same bytes, so that requirement alone should be sufficient.

We should also mention upload cancellation in case that the client cannot resume.

Acconut · 2022-07-19T15:21:11Z

I can share some details on how we handle backtracking in tus-js-client and tus-java-client. Basically, we divide data sources into two categories:

Seekable data sources, where we can easily seek back to any position of the data source without any additional buffering. Examples are files on disk or in-memory blobs. Backtracking for them is easy and non-problematic.
Non-seekable data sources, which are generated dynamically and are not permanently stored on disk or in-memory. As such, seeking backwards is not possible. Examples are web cam streams, streaming output from other programs etc.

For non-seekable data sources, the client forces the end user to specify a maximum request payload size, which is the maximum amount of bytes in a single Upload Transfer Procedure. If the entire upload size exceeds this payload size, the upload has to be split across multiple consecutive Upload Transfer Procedures, which is easily possible.

The clue is that the client will buffer the entire data that it sends in this Upload Transfer Procedure (either in memory or on disk). If this procedure gets interrupted due to connection issues, the client can simply backtrack to any position in this Upload Transfer Procedure and resume from there. If the Upload Transfer Procedure is complete, the client will simply discard the previous buffer, and read the next chunk of data to fill up the new buffer for the next Upload Transfer Procedure. We can safely discard the previous buffer because the Upload-Offset header in the response guarantees us the server has successfully saved all of the upload until this offset.

The maximum request payload size allows to balance between smaller buffer usage and larger request sizes (for less overhead).

guoye-zhang · 2022-07-19T19:39:49Z

Yes, that makes a lot of sense. What I'm thinking about is implementing a rolling window buffer (maybe the last 1MB, but will adjust based on telemetry) in the case that the content is dynamically generated. Having chunked uploads is definitely more reliable, since you get real confirmation from the server that a chunk has been received, but in our situation if we are not aware of server support, we can't start chunking uploads.

Acconut · 2022-07-20T07:34:42Z

Your approach is more optimistic and sounds like an best-effort approach without strict guarantees. But that might be OK, depending on your application and situation.

Maybe in the future it will be possible to receive new Upload-Offsets using multiple intermediate 1XX responses, so the client has guarantees about the uploaded data without having to use multiple Upload Transfer Procedures. However, I don't know if multiple intermediate responses are allowed right now in HTTP.

I think the tus v2 specification should not force clients to a specific behavior regarding backtracking, so they can choose how to handle such situations. But we can add recommendations so people know how to handle it.

guoye-zhang · 2022-07-21T05:05:57Z

Yes, multiple intermediate responses are allowed. It could work, sort of like an HTTP-level ACK.

Agreed, we don't need to specify how to handle backtracking, just need to mention that backtracking is expected and should be handled gracefully.

gregw mentioned this issue Jun 21, 2022

Can server skip ahead? #52

Closed

guoye-zhang mentioned this issue Jul 14, 2022

Clarify invalid offset and backtracking #54

Merged

guoye-zhang closed this as completed Jul 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How does a client handle backtracking? #53

How does a client handle backtracking? #53

guoye-zhang commented Jun 21, 2022

gregw commented Jun 21, 2022

gregw commented Jun 21, 2022

guoye-zhang commented Jun 21, 2022

Acconut commented Jul 19, 2022

guoye-zhang commented Jul 19, 2022

Acconut commented Jul 20, 2022

guoye-zhang commented Jul 21, 2022

How does a client handle backtracking? #53

How does a client handle backtracking? #53

Comments

guoye-zhang commented Jun 21, 2022

gregw commented Jun 21, 2022

gregw commented Jun 21, 2022

guoye-zhang commented Jun 21, 2022

Acconut commented Jul 19, 2022

guoye-zhang commented Jul 19, 2022

Acconut commented Jul 20, 2022

guoye-zhang commented Jul 21, 2022