Read remaining bytes when cleaning dropped payload #2764 #2772

squidpickles · 2022-06-02T21:59:58Z

PR Type

Bug Fix

PR Checklist

Tests for the changes have been added / updated.
Documentation comments have been added / updated.
A changelog entry has been made for the appropriate packages.
Format code with the latest stable rustfmt.
(Team) Label with affected crates and semver status.

Overview

Existing behavior did not completely read client data if a Payload was dropped without consuming all of its data. This calls read_available() before each Payload chunk is discarded. This does add a read_available_projected() so it can be called when only a pinned reference to the dispatcher is available.

Closes #2764

squidpickles · 2022-06-06T22:57:35Z

I noticed curl stops sending bytes when the request is over a certain size, and the server returns an error response. (curl message HTTP error before end of send, stop sending)

I can see that we're reading zero bytes in the added read here https://github.com/squidpickles/actix-web/blob/30004d5b9f7fb7f03ed98f5a6f0f26ee67c17186/actix-http/src/h1/dispatcher.rs#L710 but it's not clear whether the true return (called should_disconnect elsewhere) is correct in all cases. Investigating... (also not clear how to write a unit test that mimics this client behavior)

squidpickles · 2022-06-10T23:07:01Z

Still working on this, partly because I realized I'm not totally clear what the desired behavior should be here.

Right now (in the released code), if the Payload is dropped before it has been read completely, the Dispatcher will attempt to "clean" the connection by consuming any bytes in the read buffer. The trouble is that if there are more than decoder::MAX_BUFFER_SIZE bytes sent by the client, the cleaning process will fail (because PayloadDecoder::decode() will be called with bytes remaining and an empty read buffer), and it will jump to the 500 error and connection close.

I added an additional read into the cleaning loop if the read buffer is empty, so now the bytes will be consumed even if there are more than a single read buffer's worth.

However, the situation is complicated because a handler that does not consume the whole Payload will return a response to the client before all bytes are read from the wire. Some clients, such as cURL, will stop sending data if they get an early error response.

I think the main thing I would change here is the 500 error response, and the error logging. It seems best not to log a server error when it's a relic of client behavior. In this case, the 500 is triggered by a client either disconnecting or stopping sending bytes before the cleaning process completes. The handler has already sent a response if it was ever going to, so nothing further really needs to be sent to the client. The only case where it should be logging an error is if there's some reason the server can't read, but the client is still sending bytes. I'm not sure that's something we'd catch here and not elsewhere, so probably not worth treating as a potential error in this situation.

So my proposal is to disconnect when client send terminates early (fewer bytes are sent than the content-length header suggests) as the server does now, but not to send an additional 500 response, and not to log an error. All while preserving the extra read added in this PR, so larger requests still get cleaned properly.

squidpickles · 2022-06-11T00:11:04Z

Ok, well, it looks like that extra read hangs the worker. I haven't worked out the polling mechanism well enough to understand why. Going to leave this PR open and maybe someone with better understanding can comment, but for now, I'll work around it and explicitly consume the entire Payload before returning from my handlers.

squidpickles and others added 3 commits June 2, 2022 14:51

Read remaining bytes when cleaning dropped payload actix#2764

b381052

Added change note for payload read buffer flush actix#2764

9863237

Merge branch 'actix:master' into flush-payload

30004d5

squidpickles added 2 commits June 10, 2022 16:09

Drop connection without error on early read termination actix#2764

a9e89b0

Removed extra read, since it hangs the worker actix#2764

7998c20

robjtede mentioned this pull request Jun 11, 2022

revert broken fix in #2624 #2779

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Read remaining bytes when cleaning dropped payload #2764 #2772

Read remaining bytes when cleaning dropped payload #2764 #2772

squidpickles commented Jun 2, 2022

squidpickles commented Jun 6, 2022

squidpickles commented Jun 10, 2022

squidpickles commented Jun 11, 2022

Read remaining bytes when cleaning dropped payload #2764 #2772

Are you sure you want to change the base?

Read remaining bytes when cleaning dropped payload #2764 #2772

Conversation

squidpickles commented Jun 2, 2022

PR Type

PR Checklist

Overview

squidpickles commented Jun 6, 2022

squidpickles commented Jun 10, 2022

squidpickles commented Jun 11, 2022