Fixed #33699 -- Made ASGI request read body on-demand. #15704

Flauschbaellchen · 2022-05-18T06:14:59Z

Prior to this change, the request has been written into a spooled temporary file
as the HTTPRequest class depends on a byte/io-like stream to process and
parse its content as a whole.
As the ASGI request is given in seperate chunks,
those chunks has been concatenated first using this file.

However, doing so resulted in an increasing latency on the server side,
as the file needed to be always written and re-read to be parsed afterwards.
This was especially bad for big file uploads resulting in (Gateway)
Timeouts if the Django server was running behind a reverse-proxy.

This change fixes this issue by wrapping the ASGI request and providing
a byte/io-like stream interface to it.
Doing so, reading the request's content is delayed until it needs to be
parsed and than directly delivered into the upload-handlers, resulting
in a reduced latency.

Flauschbaellchen · 2022-05-18T06:16:12Z

This PR references the discussion on https://groups.google.com/g/django-developers/c/fu6ZSmu-YJE.

github-actions · 2022-05-18T06:24:08Z

Hello @Flauschbaellchen! Thank you for your contribution 💪

As it's your first contribution be sure to check out the patch review checklist.

If you're fixing a ticket from Trac make sure to set the "Has patch" flag and include a link to this PR in the ticket!

If you have any design or process questions then you can ask in the Django forum.

Welcome aboard ⛵️!

carltongibson · 2022-05-18T06:59:38Z

Hi @Flauschbaellchen — thanks for this! First thing is to address the CI failures so we've got a clean slate — Looks like asgi.tests.ASGITest.test_disconnect (plus the lint errors).

Prior to this change, the request has been written into a spooled temporary file as the HTTPRequest class depends on a byte/io-like stream to process and parse its content as a whole. As the ASGI request is given in seperate chunks, those chunks has been concatenated first using this file. However, doing so resulted in an increasing latency on the server side, as the file needed to be always written and re-read to be parsed afterwards. This was especially bad for big file uploads resulting in (Gateway) Timeouts if the Django server was running behind a reverse-proxy. This change fixes this issue by wrapping the ASGI request and providing a byte/io-like stream interface to it. Doing so, reading the request's content is delayed until it needs to be parsed and than directly delivered into the upload-handlers, resulting in a reduced latency.

Flauschbaellchen · 2022-05-18T13:10:37Z

@carltongibson Can I ask you to take a look into the asgi.tests.ASGITest.test_disconnect test and help me out to fix this?
As the reading is now delayed until the HTTPRequest is parsing the body, the ASGIStream and its receive() is not called currently within the test - and thus no TimeoutError is raised.
I think the AsyncRequestFactory/ApplicationCommunicator needs to be adjusted, but I cannot see what I'd need to change.

carltongibson · 2022-05-18T13:17:40Z

@Flauschbaellchen — OK, let me take a look. (🤹 — Might be next week.)

carltongibson · 2022-05-19T08:22:28Z

Hey @Flauschbaellchen — I'm picking this up now.

Can I ask, could you put together a minimal app showing the upload/download example you raised in the discussion? You're creating a 1GB file; uploading; waiting for what? The same file sent back? etc. — If you could put together the minimal Django app for that, we can run it with gunicorn/uvicorn/Daphne/etc. and see the differences.
(There's a wider need to set up such benchmarks, so this would tie into that too.)

Thanks

carltongibson

Can I ask you to take a look into the asgi.tests.ASGITest.test_disconnect test and help me out to fix this?

The issue is the http.disconnect handling in _receive_more_data() is not triggered until the the input queue is read from (at least once). We'll need to consider how to handle this — maybe some kind of lookahead... 🤔 — but I need to ponder more.

carltongibson · 2022-05-19T09:21:20Z

django/core/handlers/asgi.py

+        message = async_to_sync(self.receive)()
+        if message["type"] == "http.disconnect":
+            # Early client disconnect.
+            self._has_more = False  # safeguard against trying to call receive again
+            raise RequestAborted()


This isn't triggered until the input queue is read at least once.

As this is the intended behavior, maybe a better approach would be to test ASGIStream separately for this test case as a model test instead of the current integration test?

Yes, happy to look at a refactor.

But that integration test should pass no? (I do expect a timeout after a disconnect...?)

We're not currently handling this correctly I think. Follow up at https://code.djangoproject.com/ticket/33738

carltongibson · 2023-03-09T10:13:14Z

Closing as per ticket.

carltongibson changed the title ~~Process ASGI request input directly as a stream~~ Fixed #33699 -- Made ASGI request consumed body on-demand. May 18, 2022

carltongibson changed the title ~~Fixed #33699 -- Made ASGI request consumed body on-demand.~~ Fixed #33699 -- Made ASGI request read body on-demand. May 18, 2022

Flauschbaellchen force-pushed the asgi-request-body-streaming branch from 74b99e7 to 2ef8e76 Compare May 18, 2022 13:05

carltongibson self-requested a review May 18, 2022 13:17

carltongibson reviewed May 19, 2022

View reviewed changes

carltongibson closed this Mar 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed #33699 -- Made ASGI request read body on-demand. #15704

Fixed #33699 -- Made ASGI request read body on-demand. #15704

Flauschbaellchen commented May 18, 2022 •

edited

Loading

Flauschbaellchen commented May 18, 2022

github-actions bot commented May 18, 2022

carltongibson commented May 18, 2022

Flauschbaellchen commented May 18, 2022

carltongibson commented May 18, 2022

carltongibson commented May 19, 2022

carltongibson left a comment

carltongibson May 19, 2022

Flauschbaellchen May 19, 2022

carltongibson May 19, 2022

carltongibson May 24, 2022

carltongibson commented Mar 9, 2023

Fixed #33699 -- Made ASGI request read body on-demand. #15704

Fixed #33699 -- Made ASGI request read body on-demand. #15704

Conversation

Flauschbaellchen commented May 18, 2022 • edited Loading

Flauschbaellchen commented May 18, 2022

github-actions bot commented May 18, 2022

carltongibson commented May 18, 2022

Flauschbaellchen commented May 18, 2022

carltongibson commented May 18, 2022

carltongibson commented May 19, 2022

carltongibson left a comment

Choose a reason for hiding this comment

carltongibson May 19, 2022

Choose a reason for hiding this comment

Flauschbaellchen May 19, 2022

Choose a reason for hiding this comment

carltongibson May 19, 2022

Choose a reason for hiding this comment

carltongibson May 24, 2022

Choose a reason for hiding this comment

carltongibson commented Mar 9, 2023

Flauschbaellchen commented May 18, 2022 •

edited

Loading