Optimized @async_unsafe. #14911

adamchainz · 2021-09-29T08:52:23Z

Switched the order of the checks to reduce the overhead. Async unsafe methods are normally called syncrhonously, so we can avoid the overhead of checking the environment variable in the regular path.

adamchainz · 2021-09-29T08:52:36Z

Before:

In [1]: from django.utils.asyncio import async_unsafe

In [2]: ai = async_unsafe(int)

In [3]: %timeit ai()
1.37 µs ± 13.6 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

After:

In [1]: from django.utils.asyncio import async_unsafe

In [2]: ai = async_unsafe(int)

In [3]: %timeit ai()
395 ns ± 1.68 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

Calling int() takes ~50ns, so we can take that from each number, giving ~1300ns before and ~350ns after, ~4 times faster.

A small boost but it multiplies as async_unsafe decorated functions are called several times per query.

kezabelle

Hmmmm. I never thought fetching from os.environ would be the slow part, especially compared to try/except (which won't be zero-cost until .... 3.11?).

Nevertheless, I can replicate the difference.

In [1]: import os
In [2]: %timeit os.environ.get('DJANGO_ALLOW_ASYNC_UNSAFE') # pssst, it's not set...
1.06 µs ± 2.17 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

Before:

[... prelude]
In [3]: %timeit ai()
1.79 µs ± 23.6 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

After:

[...prelude]
In [3]: %timeit ai()
454 ns ± 2.26 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

On that basis, the proposed change makes sense to me, but I'm not well-versed enough in the async story in general to say it's correct (beyond the tests passing), so I'll leave it without approval.

(eg: is there any semantic difference in asking for a loop regardless of the DJANGO_ALLOW_ASYNC_UNSAFE var? is that side-effect free? I'm not qualified to say tbh)

adamchainz · 2021-09-29T12:21:54Z

Good question on the semantics. I think it's side effect free to ask for the running loop, but this is obscured a bit by some thread local and caching logic in https://github.com/python/cpython/blob/main/Modules/_asynciomodule.c

carltongibson

I agree get_running_loop should be side-effect free. (Ultimately it just looks up the pointer set in set_event_loop().)

I never thought fetching from os.environ would be the slow part...

Indeed 🧐

Are the import changes related to the optimisation? 🤔

adamchainz · 2021-09-30T07:42:17Z

Are the import changes related to the optimisation? 🤔

They're another micro-optimization to require fewer attribute accesses.

carltongibson

OK, fine, seems sensible 👍

(I'll process this in a bit.)

tim-mccurrach · 2021-09-30T10:15:11Z

Whilst we're micro-optimising could we use _get_running_loop to shave a few more microseconds off. Then inner would become:

def inner(*args, **kwargs):
    if _get_running_loop() and not os.environ.get('DJANGO_ALLOW_ASYNC_UNSAFE'):
        raise SynchronousOnlyOperation(message)
    return func(*args, **kwargs)

Or would the use of the private function be too subject-to-change for us to use?
Unfortunately, I'm not able to test this for time right now, but I suspect it would make it marginally quicker (famous last words...).

Switched the order of the checks to reduce the overhead. Async unsafe methods are *normally* called syncrhonously, so we can avoid the overhead of checking the environment variable in the regular path.

carltongibson · 2021-09-30T10:17:23Z

@tim-mccurrach I think that would be a bridge too far.

tim-mccurrach · 2021-09-30T10:17:56Z

@tim-mccurrach I think that would be a bridge too far.

haha. Fair enough :)

tim-mccurrach · 2021-09-30T10:31:24Z

@carltongibson
Although having said that, quickly testing just now, it does seem to cut the time in half again:

>>> timeit.repeat("ai()",repeat=7, number=100000, setup="from __main__ import ai")
[0.0292515020000792, 0.023990991999653488, 0.020490372999574902, 0.020217061000039394, 0.0210146330000498, 0.020252889999937906, 0.020329448000211414]
>>> timeit.repeat("ai2()",repeat=7, number=100000, setup="from __main__ import ai2")
[0.05905059799988521, 0.04216317100008382, 0.042790552000042226, 0.04461314699983632, 0.04397789999984525, 0.04326298099977066, 0.04322846100012612]

(ai2 is the function as it stands in this PR).

It's also (perhaps more importantly) more readable (IMO).

But I agree, it may well be a bridge too far 🤷

carltongibson · 2021-09-30T10:33:34Z

@tim-mccurrach given that Django 4.1 will support 4 or 5 versions of Python using private APIs is not a good idea.

tim-mccurrach · 2021-09-30T10:34:48Z

@tim-mccurrach given that Django 4.1 will support 4 or 5 versions of Python using private APIs is not a good idea.

👍 Yes, that seems sensible.

adamchainz requested a review from kezabelle September 29, 2021 08:52

kezabelle reviewed Sep 29, 2021

View reviewed changes

felixxm requested a review from carltongibson September 30, 2021 05:03

carltongibson reviewed Sep 30, 2021

View reviewed changes

carltongibson approved these changes Sep 30, 2021

View reviewed changes

carltongibson force-pushed the micro_optimize_async_unsafe branch from 2f17cf5 to 364d86c Compare September 30, 2021 09:56

Optimized @async_unsafe.

37d9ea5

Switched the order of the checks to reduce the overhead. Async unsafe methods are *normally* called syncrhonously, so we can avoid the overhead of checking the environment variable in the regular path.

carltongibson force-pushed the micro_optimize_async_unsafe branch from 364d86c to 37d9ea5 Compare September 30, 2021 10:16

carltongibson merged commit 37d9ea5 into django:main Sep 30, 2021

adamchainz deleted the micro_optimize_async_unsafe branch September 30, 2021 16:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimized @async_unsafe. #14911

Optimized @async_unsafe. #14911

adamchainz commented Sep 29, 2021

adamchainz commented Sep 29, 2021

kezabelle left a comment

adamchainz commented Sep 29, 2021

carltongibson left a comment

adamchainz commented Sep 30, 2021

carltongibson left a comment

tim-mccurrach commented Sep 30, 2021 •

edited

carltongibson commented Sep 30, 2021

tim-mccurrach commented Sep 30, 2021

tim-mccurrach commented Sep 30, 2021

carltongibson commented Sep 30, 2021

tim-mccurrach commented Sep 30, 2021

Optimized @async_unsafe. #14911

Optimized @async_unsafe. #14911

Conversation

adamchainz commented Sep 29, 2021

adamchainz commented Sep 29, 2021

kezabelle left a comment

Choose a reason for hiding this comment

adamchainz commented Sep 29, 2021

carltongibson left a comment

Choose a reason for hiding this comment

adamchainz commented Sep 30, 2021

carltongibson left a comment

Choose a reason for hiding this comment

tim-mccurrach commented Sep 30, 2021 • edited

carltongibson commented Sep 30, 2021

tim-mccurrach commented Sep 30, 2021

tim-mccurrach commented Sep 30, 2021

carltongibson commented Sep 30, 2021

tim-mccurrach commented Sep 30, 2021

tim-mccurrach commented Sep 30, 2021 •

edited