Generator size: borrowed variables are assumed live across following yield points #59087

Matthias247 · 2019-03-11T06:16:26Z

Maybe a duplicate of #52924, but maybe also something else.

I observed that the sizes of Futures generated by async fns can grow exponentially.
The following code shows an async fn, which produces a 1kB future. Each layering in another async fn doubles it's size:

#![feature(async_await)]

async fn i_am_1kb() -> bool
{
    let x: [u8; 1*1024] = [0; 1*1024];
    async{}.await;
    let _sum: u8 = x.iter().sum();
    true
}

fn main() {
    let fut1 = i_am_1kb();
    dbg!(std::mem::size_of_val(&fut1));

    let composed_1 = async {
        let inner = i_am_1kb();
        inner.await;
    };
    dbg!(std::mem::size_of_val(&composed_1));

    let composed_2 = async {
        let inner = i_am_1kb();
        dbg!(std::mem::size_of_val(&inner));
        inner.await;
    };
    dbg!(std::mem::size_of_val(&composed_2));

    let composed_3 = async {
        let inner = async {
            let inner = async {
                i_am_1kb().await;
            };
            dbg!(std::mem::size_of_val(&inner));
            inner.await;
        };
        dbg!(std::mem::size_of_val(&inner));
        inner.await;
    };
    dbg!(std::mem::size_of_val(&composed_3));
}

Output:

[src/main.rs:16] std::mem::size_of_val(&fut1) = 1032
[src/main.rs:22] std::mem::size_of_val(&composed_1) = 1036
[src/main.rs:29] std::mem::size_of_val(&composed_2) = 2072
[src/main.rs:44] std::mem::size_of_val(&composed_3) = 4168

It doesn't matter whether the statement between the future generation and await! references the future or not. A simply println("") will have the same effect.
Only if the future is directly awaited (as in composed_1) the size will stay constant.

cc @cramertj , @nikomatsakis , @Nemo157

The text was updated successfully, but these errors were encountered:

cramertj · 2019-03-11T17:49:04Z

Closing as a sub-issue of #52924. (I've edited the top message in that thread to reference this one)

cramertj · 2019-03-11T17:52:21Z

cc @tmandry @Zoxc

Nemo157 · 2019-03-11T21:35:24Z

There definitely seems to be something causing locals to be unnecessarily put into the generator struct instead of staying as true-locals. Using a simplified function (with the same i_am_1kb as above)

async fn composed() {
    let inner = i_am_1kb();
    { let _foo = &inner; }
    await!(inner);
}

and running cargo rustc -- -Z dump-mir=generator to dump the mir, the liveness analysis shows that inner is correctly considered dead after being moved to pinned in the await! macro (and so is not alive over a yield), but it is still being put in the generator.

Zoxc · 2019-03-11T22:42:43Z

@Nemo157 The generator transformation conservatively assumes that any borrow can be converted to a raw pointer and the locals can be accessed with that until their storage slot is dead. That's why inner is considered live during the await! here.

cramertj · 2019-03-11T22:52:23Z

Yeah, @eddyb and I discussed this at the all-hands, and that making a type implement Copy is actually a potential perf regression here, which is weird. For non-Copy types, you can assume no accesses after moving out of them, but for Copy types you can't necessarily do this.

Nemo157 · 2019-03-11T22:55:02Z

Ok, so an optimisation to fix this example would be to add a less conservative check that can see when the borrow definitely wasn’t converted to a raw pointer. That seems relatively straightforward to check when the borrow never enters any unsafe code.

cramertj · 2019-03-11T23:04:50Z

add a less conservative check that can see when the borrow definitely wasn’t converted to a raw pointer

Note that for this to be very useful at all it would have to be able to see through functions (via MIR inlining) and intrinsics (e.g. size_of_val above).

Nemo157 · 2019-03-11T23:41:17Z

Can it not trust the lifetimes on safe functions signatures? size_of_val does not have a lifetime dependency so should be relied on to not stash a raw pointer to the reference away somewhere. (I guess this is an UCG question whether safe function boundaries are barriers that require safety to be upheld, and whether future unsafe code can allow prior safe code to violate lifetimes, i.e. is something like this sound or not).

cramertj · 2019-03-11T23:44:25Z

No, lifetimes in function signatures cannot necessarily be used to determine the scope of accesses to the resulting pointer. Without a memory model it's completely unclear when accesses would or wouldn't be allowed to the underlying memory. @RalfJung's work on stacked borrows is the only thing I'm aware of that would allow proper analysis, and in general anywhere there's a ref-to-ptr conversion, all bets are sort of off.

tmandry · 2019-03-11T23:52:35Z

I want to emphasize that this is still a problem even when the variable is not borrowed:

It doesn't matter whether the statement between the future generation and await! references the future or not. A simply println("") will have the same effect.

So there is likely progress to be made here without doing the analysis being discussed by @Nemo157 / @cramertj.

Nemo157 · 2019-03-12T00:07:27Z

From skimming the MIR of

async fn composed() {
    let inner = i_am_1kb();
    { foo(); fn foo() { } }
    await!(inner);
}

it looks like that could be related to the unwind edge from the function call (and println! expands to a few function calls). Another optimization related to the one mentioned in #52924 that could fix this would be to suppress moving values where their lifetimes only intersect during the move, essentially re-using the same stack slot for both inner and pinned (from inside await!) and turning the move into a no-op.

(I tried a couple of other random snippets of code and couldn't see anything else done by println!() that caused the doubled size).

EDIT: Actually, because of how drop chains work it looks like it's going to be more complex than that since inner and pinned have overlapping lifetimes, I have a simpler example for which I'll try and create a chart of the MIR and open a separate issue about this.

Nemo157 · 2019-03-12T02:47:01Z

Opened #59123 about the unwinding and drop interaction.

nikomatsakis · 2019-03-12T17:17:57Z

We discussed this issue and decided to label it as deferred for the purposes of stabilization -- it's a bit too broad. We might consider trying to fix specific instances of this problem. Certainly, to start, we would want to fix #52924 and revisit.

cramertj · 2019-03-19T18:27:40Z

#57478 contains a similar issue.

tmandry · 2019-07-08T19:29:59Z

The size growth in this issue now goes away when the size_of_val calls are replaced with an empty println!() (this was #59123).

However, I think we should leave this issue open to track the general behavior that borrowing a future and then awaiting it causes us to double-allocate space for it, with #62321 tracking the specific case of size_of_val.

tmandry · 2019-07-08T22:44:58Z

Here's the solution I mentioned in #62321 (comment):

Smarter dataflow analysis that tells us when no more borrows or pointers of any kind exist (e.g. similar to what the borrow checker is doing, but stricter so we don't make unsafe code UB).
If we want to handle cases like foo(&x), we'll need to inline those functions in MIR so we can apply our analysis (1) to them.

As noted by @cramertj and @withoutboats, there is an alternative that resolves most cases we care about: get rid of the move in await. This requires

A drop with guaranteed move elision: either std::mem::drop, or an internal one we can lower to (Drop with guaranteed move elision #62508)
Recognizing our future is StorageDead after it's dropped (Resolve the interaction between StorageDead, drops, and unwinding #61015), so we don't break the existing optimization (PR Generator optimization: Overlap locals that never have storage live at the same time #60187)

tmandry · 2019-07-08T23:05:23Z

I meant to say: Right now I prefer the approach of getting rid of the move in await, unless we hit some unforeseen problem with it.

Resolving the general problem of borrow-then-move would be nice, but it doesn't seem like the most bang for our buck at this point.

eddyb · 2019-07-09T03:40:29Z

It would be so much easier to do that drop in place and have borrowck understand the value is no longer accessible, if we were desugaring await in MIR building.
We can probably use an intrinsic for this. I think move_init_val has similar special handling?

This commit favors FutureExt::map over async blocks to mitigate the issue of async block doubled memory usage. Through the sysbench oltp_read_only test, it was observed that this adjustment resulted in approximately 26% reduction in memory usage. See: rust-lang/rust#59087 Signed-off-by: Neil Shen <overvenus@gmail.com>

close #16540 *: enable linters about async and futures We should be pedantic about writing async code, as it's easy to write suboptimal or even bloat code. See: rust-lang/rust#69826 *: remove unnecessary async blocks to save memory This commit favors FutureExt::map over async blocks to mitigate the issue of async block doubled memory usage. Through the sysbench oltp_read_only test, it was observed that this adjustment resulted in approximately 26% reduction in memory usage. See: rust-lang/rust#59087 Signed-off-by: Neil Shen <overvenus@gmail.com> Co-authored-by: ti-chi-bot[bot] <108142056+ti-chi-bot[bot]@users.noreply.github.com>

This commit favors FutureExt::map over async blocks to mitigate the issue of async block doubled memory usage. Through the sysbench oltp_read_only test, it was observed that this adjustment resulted in approximately 26% reduction in memory usage. See: rust-lang/rust#59087 Signed-off-by: Neil Shen <overvenus@gmail.com>

close #16540 *: enable linters about async and futures We should be pedantic about writing async code, as it's easy to write suboptimal or even bloat code. See: rust-lang/rust#69826 *: remove unnecessary async blocks to save memory This commit favors FutureExt::map over async blocks to mitigate the issue of async block doubled memory usage. Through the sysbench oltp_read_only test, it was observed that this adjustment resulted in approximately 26% reduction in memory usage. See: rust-lang/rust#59087 Signed-off-by: Neil Shen <overvenus@gmail.com> Co-authored-by: Neil Shen <overvenus@gmail.com>

close tikv#16540 *: enable linters about async and futures We should be pedantic about writing async code, as it's easy to write suboptimal or even bloat code. See: rust-lang/rust#69826 *: remove unnecessary async blocks to save memory This commit favors FutureExt::map over async blocks to mitigate the issue of async block doubled memory usage. Through the sysbench oltp_read_only test, it was observed that this adjustment resulted in approximately 26% reduction in memory usage. See: rust-lang/rust#59087 Signed-off-by: Neil Shen <overvenus@gmail.com> Co-authored-by: ti-chi-bot[bot] <108142056+ti-chi-bot[bot]@users.noreply.github.com> Signed-off-by: dbsid <chenhuansheng@pingcap.com>

This commit favors FutureExt::map over async blocks to mitigate the issue of async block doubled memory usage. Through the sysbench oltp_read_only test, it was observed that this adjustment resulted in approximately 26% reduction in memory usage. See: rust-lang/rust#59087 Signed-off-by: Neil Shen <overvenus@gmail.com>

close #16540 *: enable linters about async and futures We should be pedantic about writing async code, as it's easy to write suboptimal or even bloat code. See: rust-lang/rust#69826 *: remove unnecessary async blocks to save memory This commit favors FutureExt::map over async blocks to mitigate the issue of async block doubled memory usage. Through the sysbench oltp_read_only test, it was observed that this adjustment resulted in approximately 26% reduction in memory usage. See: rust-lang/rust#59087 Signed-off-by: Neil Shen <overvenus@gmail.com> Co-authored-by: Neil Shen <overvenus@gmail.com> Co-authored-by: ti-chi-bot[bot] <108142056+ti-chi-bot[bot]@users.noreply.github.com>

jonas-schievink added T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. A-async-await Area: Async & Await labels Mar 11, 2019

cramertj closed this as completed Mar 11, 2019

cramertj mentioned this issue Mar 11, 2019

Generators are too big #52924

Closed

cramertj reopened this Mar 11, 2019

cramertj changed the title ~~async/await: Not directly awaiting an async fn doubles it's size~~ Generator size: borrowed variables are assumed live across following yield points Mar 11, 2019

nikomatsakis added the AsyncAwait-Triaged Async-await issues that have been triaged during a working group meeting. label Mar 12, 2019

Matthias247 mentioned this issue Jul 4, 2019

Don't store locals that have been moved from in generators #61922

Merged

tmandry mentioned this issue Jul 8, 2019

size_of_val in a generator can make the generator bigger #62321

Open

tmandry mentioned this issue Jul 8, 2019

Drop with guaranteed move elision #62508

Open

tmandry mentioned this issue Jul 9, 2019

[Stabilization] async/await MVP #62149

Closed

4 tasks

jebrosen mentioned this issue Sep 4, 2019

Make prepare() return a Future that satisfies Send. sfackler/rust-postgres#477

Merged

tmandry mentioned this issue Dec 31, 2019

Cannot await in scope that contains call to format! #64960

Closed

jonas-schievink mentioned this issue Mar 21, 2020

Tracking Issue for Coroutine Memory Usage #69826

Open

7 tasks

jonas-schievink added A-coroutines Area: Coroutines I-heavy Issue: Problems and improvements with respect to binary size of generated code. C-enhancement Category: An issue proposing an enhancement or a PR with one. labels Mar 21, 2020

csmoe mentioned this issue Mar 8, 2023

async block occupies twice the size of its captures (causing explosion in size). #108906

Open

pnkfelix mentioned this issue Mar 8, 2023

[WIP] mir-opt for generators/futures: copy propagate upvar into locals #108590

Closed

6 tasks

workingjubilee added the C-optimization Category: An issue highlighting optimization opportunities or PRs implementing such label Oct 8, 2023

stepantubanov mentioned this issue Feb 7, 2024

Stack usage aembke/fred.rs#215

Closed

overvenus mentioned this issue Feb 20, 2024

Reduce trivial async block to reduce future size and save memory usage. tikv/tikv#16540

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generator size: borrowed variables are assumed live across following yield points #59087

Generator size: borrowed variables are assumed live across following yield points #59087

Matthias247 commented Mar 11, 2019 •

edited

Loading

cramertj commented Mar 11, 2019 •

edited

Loading

cramertj commented Mar 11, 2019

Nemo157 commented Mar 11, 2019

Zoxc commented Mar 11, 2019

cramertj commented Mar 11, 2019

Nemo157 commented Mar 11, 2019

cramertj commented Mar 11, 2019

Nemo157 commented Mar 11, 2019

cramertj commented Mar 11, 2019

tmandry commented Mar 11, 2019 •

edited

Loading

Nemo157 commented Mar 12, 2019 •

edited

Loading

Nemo157 commented Mar 12, 2019

nikomatsakis commented Mar 12, 2019 •

edited

Loading

cramertj commented Mar 19, 2019

tmandry commented Jul 8, 2019 •

edited

Loading

tmandry commented Jul 8, 2019

tmandry commented Jul 8, 2019

eddyb commented Jul 9, 2019

Generator size: borrowed variables are assumed live across following yield points #59087

Generator size: borrowed variables are assumed live across following yield points #59087

Comments

Matthias247 commented Mar 11, 2019 • edited Loading

cramertj commented Mar 11, 2019 • edited Loading

cramertj commented Mar 11, 2019

Nemo157 commented Mar 11, 2019

Zoxc commented Mar 11, 2019

cramertj commented Mar 11, 2019

Nemo157 commented Mar 11, 2019

cramertj commented Mar 11, 2019

Nemo157 commented Mar 11, 2019

cramertj commented Mar 11, 2019

tmandry commented Mar 11, 2019 • edited Loading

Nemo157 commented Mar 12, 2019 • edited Loading

Nemo157 commented Mar 12, 2019

nikomatsakis commented Mar 12, 2019 • edited Loading

cramertj commented Mar 19, 2019

tmandry commented Jul 8, 2019 • edited Loading

tmandry commented Jul 8, 2019

tmandry commented Jul 8, 2019

eddyb commented Jul 9, 2019

Matthias247 commented Mar 11, 2019 •

edited

Loading

cramertj commented Mar 11, 2019 •

edited

Loading

tmandry commented Mar 11, 2019 •

edited

Loading

Nemo157 commented Mar 12, 2019 •

edited

Loading

nikomatsakis commented Mar 12, 2019 •

edited

Loading

tmandry commented Jul 8, 2019 •

edited

Loading