AsyncRead and AsyncWrite #5

nrc · 2021-12-07T17:15:22Z

Tracking issue for work on AsyncRead and AsyncWrite. The eventual goal is that we have a single, standardised version of these traits in std which are used by all (or at least most mainstream) async runtimes.

Known technical issues:

should these traits use poll_read/poll_write functions or async fn read/async fn write
how to handle writing to uninitialised memory in AsyncRead
how to simultaneously read and write
how to do vectored IO
working in no_std scenarios (I believe this only requires moving the Error trait to libcore, which is work in progress
ensuring these traits work well as trait objects
using async drop for shutdown?

There are also several closely related traits such as AsyncSeek, and in various libraries, extension traits and AsyncBufRead.

Current implementations:

Tokio: AsyncRead AsyncWrite
Futures: AsyncRead AsyncWrite
Futures-lite: AsyncRead AsyncWrite
Async-std: Read Write

Smol re-exports the futures versions.

The text was updated successfully, but these errors were encountered:

nrc · 2021-12-07T17:22:28Z

Some resources:

Async vision doc
Async foundations issue
Paper doc on uninit memory
Blog post on unint memory
RFC 2930 (on extending Read trait to allow using uninit memory)
Tokio discussion: 1744, 2716
Futures crate discussion: 2209

TennyZhuang · 2021-12-07T18:14:07Z

should these traits use poll_read/poll_write functions or async fn read/async fn write

async fn is better, but it was blocked by async-trait which is unlikely to be stabilized in the short term. 😞

ibraheemdev · 2021-12-07T18:20:30Z

It's actually blocked on a dyn-safe (inline) async traits.

nrc · 2021-12-08T14:24:40Z

I think we have a solution to dyn-safe async traits that doesn't require 'inline' async.

nrc · 2021-12-09T13:05:35Z

ReadBuf (RFC 2930) has landed on nightly now (rust-lang/rust#81156)

yoshuawuyts · 2021-12-14T13:41:28Z

Async-std: Read Write
Smol re-exports the futures versions.

Note that async-std also re-exports the futures-io traits, but does so using an alias. This does mean the traits are compatible though. In hindsight we probably should've kept the Async{Read,Write} terminology, but we didn't know that at the time.

yoshuawuyts · 2021-12-14T13:45:56Z

Another known blocker that I haven't seen mentioned yet: the working group needs to make a decision on the feasibility of async overloading. This will have consequences for the shape and location of the async IO traits.

Async overloading is being tracked on the roadmap, but does not yet have an initiative owner.

nrc · 2021-12-16T09:29:34Z

Note that async-std also re-exports the futures-io traits

From the docs, they appear to have different definitions? The async-std versions have significantly more methods. Not sure if it a re-export of an earlier version of the futures definition or something more complex than that.

yoshuawuyts · 2021-12-16T12:19:04Z

From the docs, they appear to have different definitions? The async-std versions have significantly more methods.

yeah, that touches on another thing we probably shouldn't have done: the methods on async_std::io::{Read,Write} don't actually exist on them; they are only compiled in for the docs, to create the appearance that they do. The way they are made available is by importing async_std::io::prelude::* which includes ReadExt and WriteExt. It's very much a hack, but it allowed us to keep using the futures-io types, while still providing a cohesive feel.

The reason why we did this is because we wanted to push the "async-std is an async version of std" idea as far as we could. We wanted to prove out that it is indeed possible to map std's abstractions and usage patterns 1:1 to async Rust. And it worked; we now know that that it indeed can be done - modulo some missing language features like "async closures", "async drop" and "async traits". But perhaps if we had to try this again, we could've just exposed it as async_std::io::{AsyncRead,AsyncReadExt}. Though obviously that's speaking with the benefit of hindsight.

nrc · 2022-02-15T16:35:33Z

A proposed design: https://www.ncameron.org/blog/async-read-and-write-traits/

I should flesh out the alternatives with examples and/or why they don't meet the stated goals.

uazu · 2022-02-15T20:18:34Z

Just one comment on the proposed design (which looks fine to me although I'm not really the target audience):

For the examples where let mut buf = [0; 1024]; is done after the ready call, I'm trying to figure out how this saves memory, since usually stack space is reserved based on the maximum extent required. Does this depend on buf not carrying over any .await, which means it goes onto the real stack instead of into the coroutine context structure? So the aim is to not have the buffer held across any .await?

nrc · 2022-02-16T08:53:25Z

For the examples where let mut buf = [0; 1024]; is done after the ready call, ...

These are simplified examples, in real life we might use one shared buffer or allocate space on the heap to be used in other functions, etc.

bestouff · 2022-02-16T16:27:51Z

Shouldn't AsyncRead and AsyncReady be renamed Read and Ready in the "Complete version" code block ?

nrc · 2022-02-16T17:35:52Z

Shouldn't AsyncRead and AsyncReady be renamed Read and Ready in the "Complete version" code block ?

Whoops! They absolutely should. Fixed now, thanks!

nrc · 2022-02-18T09:23:49Z

Feedback from Fuchsia:

The proposal suggests that for optimal performance, Ready should be used followed by a read call later. I think that might be quite challenging for us to implement because it would require locking between the ready notification and the read (to prevent the kernel discarding pages under memory pressure) and AFAICT, there's no indication of how much should be locked in the ready call.

And wonder if we could add a byte count to Interest.

Discussion ongoing on Zulip

nrc · 2022-02-18T09:26:48Z

Some discussion on a possible alternative where we have a type like smol's Async and some of the API is on that type rather than the io traits (I believe the benefit here is that we keep the differences between the sync and async APIs restricted to a single location).

nrc · 2022-02-18T09:40:16Z

The read_with style helpers from smol::Async may also be helpful for making the memory-optimal path more ergonomic in some cases.

nrc · 2022-03-17T15:45:05Z

I've started to write up the designs from the blog posts in https://github.com/nrc/portable-interoperable/tree/master/io-traits

yoshuawuyts · 2022-03-17T22:52:34Z

Filed #7 related to this issue.

carllerche · 2022-06-16T18:00:42Z

The current version of the traits described in the README include vectored methods. In practice, I have found this to be a mistake because it is not possible to have a good default implementation. The user of Read/Write must have a different implementation depending on whether the I/O handle can support vectored ops. What this means in practice, a library like Hyper that takes a T: Read + Write must assume the T does not support vectored ops and avoid calling those methods.

I think, for vectored ops, it should be a separate trait. Converting a T: Read + Write -> VectoredRead + VectoredWrite means wrapping it with a buffer.

NobodyXu · 2022-06-17T03:15:51Z

Do you think the

fn is_vectored_writeable() -> bool;

interface that can be found in existing AsyncWrite/AsyncRead/Write/Read solves the problem?

carllerche · 2022-06-20T16:27:54Z

Would it work? Possibly. However, once you start adding boolean checks to see if a trait impl supports a feature or not, this seems to strongly suggest there should be two traits.

NobodyXu · 2022-06-21T01:15:02Z

That's true.

But if it is split into two traits, then what if the user has an IoSlice and doesn't care about the efficiency anyway?

NobodyXu · 2022-07-14T02:06:07Z

@Noah-Kennedy I wonder is it possible for async fn read to work with io-uring without copying.

Suppose that:

The future returned by async fn read under io-uring is lazy and does not actually issue any SQE until it is polled (and pinned).
The future returned by async fn read is not Unpin (has PhantomPinned), thus it must be Pined. And Pined object must either be leaked, or must be droped.

With these two assumptions, can we implement async fn read in io-uring without copying it into an internal owned buffer?

Noah-Kennedy · 2022-07-14T02:58:17Z

Nope, because when dropping a future for an in-flight op, it would be unsound still.

Noah-Kennedy · 2022-07-14T02:58:44Z

You can still make this work via IORING_OP_POLL_ADD, which lets you do readiness-based IO via uring.

NobodyXu · 2022-07-14T03:01:52Z

Nope, because when dropping a future for an in-flight op, it would be unsound still.

Can we add a drop implementation that cancels and wait for cancellation/IO completion?

It will be great if we have async drop though.

Noah-Kennedy · 2022-07-14T03:03:25Z

No, because this would block the runtime.

NobodyXu · 2022-07-14T03:20:09Z

No, because this would block the runtime.

Thanks, sounds like this can only be fixed with async drop.

Noah-Kennedy · 2022-07-14T03:28:03Z

Yup

nrc · 2022-07-14T08:54:59Z

Even async Drop does not fix it. See https://ncameron.org/blog/async-io-with-completion-model-io-systems/ (tl;dr: destructors are not guaranteed to run in Rust)

Thomasdezeeuw · 2022-07-14T09:02:18Z

Even async Drop does not fix it. See https://ncameron.org/blog/async-io-with-completion-model-io-systems/ (tl;dr: destructors are not guaranteed to run in Rust)

I didn't have time to read the post, but based on the tl;dir: it's okay if the destructors aren't run. The problem is if they are run while the OS is still using the buffer. Leaking the buffer would be fine, not ideal of course, but at least it won't be unsound.

I've been thinking about using a separate "clean up" future for this. It would be implemented as a queue to which items can be send, on receiving an item it would wait for it to complete and call the destructor, essentially spawning a future to the "async drop". For I/O uring this would for example send it a function/future that would cancel the I/O operation and drop/dealloc the buffer. Of course this design doesn't work with AsyncRead/AsyncWrite current borrowed buffers (&mut [u8]), but it would with owned versions.

NobodyXu · 2022-07-14T11:35:41Z

Even async Drop does not fix it. See https://ncameron.org/blog/async-io-with-completion-model-io-systems/ (tl;dr: destructors are not guaranteed to run in Rust)

If the returned future is not Unpin and has to be Pin, then it should be guaranteed that it is leaked or dropped right?

Thomasdezeeuw · 2022-07-14T12:57:37Z

If the returned future is not Unpin and has to be Pin, then it should be guaranteed that it is leaked or dropped right?

Pin doesn't guarantee anything regarding whether it's dropped or not, it's only guaranteed to be used in the same memory location.

NobodyXu · 2022-07-14T13:04:14Z

If the returned future is not Unpin and has to be Pin, then it should be guaranteed that it is leaked or dropped right?

Pin doesn't guarantee anything regarding whether it's dropped or not, it's only guaranteed to be used in the same memory location.

According to Pin::new_unchecked:

This constructor is unsafe because we cannot guarantee that the data pointed to by pointer is pinned, meaning that the data will not be moved or its storage invalidated until it gets dropped.

Also from std::pin module's drop guarantee:

To make this work, not just moving the data is restricted; deallocating, repurposing, or otherwise invalidating the memory used to store the data is restricted, too. Concretely, for pinned data you have to maintain the invariant that its memory will not get invalidated or repurposed from the moment it gets pinned until when drop is called. Only once drop returns or panics, the memory may be reused.

And this:

Notice that this guarantee does not mean that memory does not leak! It is still completely okay to not ever call drop on a pinned element (e.g., you can still call mem::forget on a Pin<Box>). In the example of the doubly-linked list, that element would just stay in the list. However you must not free or reuse the storage without calling drop.

Essentially, you can only forget a pinned data if it is Unpin or it is allocated on heap or is a global variable.
For a stack variable, you must call drop on it.

Noah-Kennedy · 2022-07-14T14:09:06Z

@Thomasdezeeuw been thinking about that as well actually

NobodyXu · 2022-07-19T06:45:40Z

@Noah-Kennedy @nrc IMO it might be a good idea to to standardise bytes, expose its internal vtable, add functions for creating Bytes and BytesMut from provided/managed buffers of io-uring and ability to detect whether they contains provided/managed buffers.

Then we can add the following interfaces:

trait Read {
    async fn get_managed_buffers(&mut self, n: NonZeroUsize) -> Result<BytesMut>;
    async fn read_owned(&mut self, n: NonZeroUsize, owned_buf: BytesMut) -> Result<BytesMut>;
    async fn read_into_provided_buf(&mut self, n: NonZeroUsize) -> Result<BytesMut>;
}
trait Write {
    async fn get_managed_buffers(&mut self, n: NonZeroUsize) -> Result<BytesMut>;
    async fn write_owned(&mut self, owned_buf: Bytes) -> Result<usize>;
}

That will enable efficient use of io-uring since it is zero-copy and we could also support provided buffer easily.

Though stablising bytes alone would take a tons of effort.

nrc · 2022-07-19T07:30:57Z

@NobodyXu The trouble with this approach is that it is a long way from the sync versions of Read/Write, and is a pretty un-ergonomic API. There is more on the requirements, etc. here: https://github.com/nrc/portable-interoperable/tree/master/io-traits#requirements

NobodyXu · 2022-07-19T07:46:21Z

@NobodyXu The trouble with this approach is that it is a long way from the sync versions of Read/Write, and is a pretty un-ergonomic API. There is more on the requirements, etc. here: https://github.com/nrc/portable-interoperable/tree/master/io-traits#requirements

Yeah, but I don't see any other way to support completion based systems efficiently, since they are most efficient with managed/provided buffer.

Passing a reference to a buffer requires copying around to internal buffer and even if we fix that, it still cannot match the performance of provided buffer due to the optimizations that can be applied to provided/managed buffer.

Perhaps we can provide separate ReadCompletion and WriteCompletion trait to avoid breaking the symmetry?

nrc · 2022-07-19T08:31:52Z

The proposal in the document is to support completion systems via BufRead and possibly a new OwnedRead trait

NobodyXu · 2022-07-19T08:57:34Z

The proposal in the document is to support completion systems via BufRead and possibly a new OwnedRead trait

Sorry that I missed that part.

Noah-Kennedy · 2022-07-19T13:53:17Z

@NobodyXu when you say "provided buffer", are you referring to owned buffers or kernel-managed buffer groups?

NobodyXu · 2022-07-19T14:15:46Z

@NobodyXu when you say "provided buffer", are you referring to owned buffers or kernel-managed buffer groups?

I am referring to kernel-managed buffer groups.

Noah-Kennedy · 2022-07-19T14:17:46Z

I think that this is something which, while very, very important, is probably out of scope of current standardization efforts.

NobodyXu · 2022-07-19T14:20:21Z

I think that this is something which, while very, very important, is probably out of scope of current standardization efforts.

Hmmm yeah, I can see that the existing proposal is already quite complex.

nrc added the A-stdlib Area: a standard library for async Rust label Dec 7, 2021

yoshuawuyts mentioned this issue Dec 14, 2021

Tracking issue: a standard library for async Rust #1

Open

thomcc mentioned this issue Feb 23, 2022

Add smoltcp TCP transport build-trust/ockam#2213

Closed

4 tasks

NobodyXu mentioned this issue Jul 13, 2022

async trait for runtime #13

Open

patrickfreed mentioned this issue Aug 3, 2022

RUST-1392 Add GridFS support: Implement public API with placeholder code mongodb/mongo-rust-driver#688

Merged

This was referenced Mar 9, 2023

AsyncRead, AsyncWrite traits rust-lang/wg-async#23

Closed

RFC: I/O traits (async Read/Write) rust-lang/wg-async#282

Open

branlwyd mentioned this issue Dec 6, 2023

Change Encode/Decode to use Read/Write instead of Cursor/Vec divviup/libprio-rs#860

Open

AsyncRead and AsyncWrite #5

AsyncRead and AsyncWrite #5

Comments

nrc commented Dec 7, 2021 • edited Loading

nrc commented Dec 7, 2021

TennyZhuang commented Dec 7, 2021 • edited Loading

ibraheemdev commented Dec 7, 2021 • edited Loading

nrc commented Dec 8, 2021

nrc commented Dec 9, 2021

yoshuawuyts commented Dec 14, 2021

yoshuawuyts commented Dec 14, 2021

nrc commented Dec 16, 2021

yoshuawuyts commented Dec 16, 2021 • edited Loading

nrc commented Feb 15, 2022

uazu commented Feb 15, 2022

nrc commented Feb 16, 2022

bestouff commented Feb 16, 2022

nrc commented Feb 16, 2022

nrc commented Feb 18, 2022

nrc commented Feb 18, 2022

nrc commented Feb 18, 2022

nrc commented Mar 17, 2022

yoshuawuyts commented Mar 17, 2022

carllerche commented Jun 16, 2022

NobodyXu commented Jun 17, 2022

carllerche commented Jun 20, 2022

NobodyXu commented Jun 21, 2022

NobodyXu commented Jul 14, 2022

Noah-Kennedy commented Jul 14, 2022

Noah-Kennedy commented Jul 14, 2022

NobodyXu commented Jul 14, 2022

Noah-Kennedy commented Jul 14, 2022

NobodyXu commented Jul 14, 2022

Noah-Kennedy commented Jul 14, 2022

nrc commented Jul 14, 2022

Thomasdezeeuw commented Jul 14, 2022

NobodyXu commented Jul 14, 2022

Thomasdezeeuw commented Jul 14, 2022

NobodyXu commented Jul 14, 2022 • edited Loading

Noah-Kennedy commented Jul 14, 2022

NobodyXu commented Jul 19, 2022 • edited Loading

nrc commented Jul 19, 2022

NobodyXu commented Jul 19, 2022

nrc commented Jul 19, 2022

NobodyXu commented Jul 19, 2022

Noah-Kennedy commented Jul 19, 2022

NobodyXu commented Jul 19, 2022

Noah-Kennedy commented Jul 19, 2022

NobodyXu commented Jul 19, 2022

nrc commented Dec 7, 2021 •

edited

Loading

TennyZhuang commented Dec 7, 2021 •

edited

Loading

ibraheemdev commented Dec 7, 2021 •

edited

Loading

yoshuawuyts commented Dec 16, 2021 •

edited

Loading

NobodyXu commented Jul 14, 2022 •

edited

Loading

NobodyXu commented Jul 19, 2022 •

edited

Loading