QUIC API Overview

This document sets out the objectives of the QUIC API design process, describes the new and changed APIs, and the design constraints motivating those API designs and the relevant design decisions.

QUIC API Overview

Overview and Implementation Status

A listing of all SSL object APIs and their implications for QUIC, including current implementation status, can be found in quic-api-ssl-funcs.md.

Non-SSL object APIs which are new or changed, or otherwise discussed in this document are listed below, along with their implementation status. SSL object APIs are not listed here; see quic-api-ssl-funcs.md for details on SSL object APIs.

Semantics	API	Status
TBD	`BIO_s_connect`	TODO
TBD	`BIO_set_conn_hostname`	TODO
TBD	`BIO_new_bio_pair`	TODO
New	`BIO_s_dgram_pair`	Done
Unchanged	`BIO_dgram_get_mtu`	Done
Unchanged	`BIO_dgram_set_mtu`	Done
New	`BIO_sendmmsg`	Done
New	`BIO_recvmmsg`	Done
New	`BIO_dgram_set_no_trunc`	Done
New	`BIO_dgram_get_no_trunc`	Done
New	`BIO_dgram_set_caps`	Done
New	`BIO_dgram_get_caps`	Done
New	`BIO_dgram_get_effective_caps`	Done
New	`BIO_dgram_get_local_addr_cap`	Done
New	`BIO_dgram_set_local_addr_enable`	Done
New	`BIO_dgram_get_local_addr_enable`	Done
New	`BIO_get_rpoll_descriptor`	Done
New	`BIO_get_wpoll_descriptor`	Done
New	`BIO_err_is_non_fatal`	Done

Objectives

The objectives of the QUIC API design are:

to provide an API suitable for use with QUIC, now and in the future;
to reuse the existing libssl APIs to the extent feasible;
to enable existing applications to adapt to using QUIC with only minimal API changes.

SSL Objects

Structure of Documentation

Each API listed below has an information table with the following fields:

Semantics: This can be one of:
- Unchanged: The semantics of this existing libssl API call are unchanged.
- Changed: The semantics are changed for QUIC.
- New: The API is new for QUIC.
SSL_get_error: Can this API, when used with QUIC, change the state returned by SSL_get_error? This can be any combination of:
- Never: Does not interact with SSL_get_error.
- Error: Non-WANT_READ/WANT_WRITE errors can be raised.
- Want: WANT_READ/WANT_WRITE can be raised.
Can Tick?: Whether this function is allowed to perform event processing for the QUIC state machine and potentially perform network I/O.
CSHL: Connection/Stream/Handshake Layer classification. This can be one of:
- HL: This is a handshake layer related call. It should be supported on a QUIC connection SSL object, forwarding to the handshake layer SSL object. QUIC stream SSL objects do not allow these calls to be forwarded.
- HL-Forbidden: This is a handshake layer related call, but it is inapplicable to QUIC, so it is not supported.
- C: Not handshake-layer related. QUIC connection SSL object usage only. Fails on a QUIC stream SSL object.
- CS: Not handshake-layer related. Can be used on any QUIC SSL object.
- S: Requires a QUIC stream SSL object or a QUIC connection SSL object with a default stream attached.

Existing APIs

`SSL_set_connect_state`

Semantics	`SSL_get_error`	Can Tick?	CSHL
Unchanged	Never	No	HL

`SSL_set_accept_state`

Semantics	`SSL_get_error`	Can Tick?	CSHL
Unchanged	Never	No	HL

Note: Attempting to proceed in this state will not function for now because we do not implement server support at this time. However, the semantics of this function as such are unchanged.

`SSL_is_server`

Semantics	`SSL_get_error`	Can Tick?	CSHL
Unchanged	Never	No	HL

`SSL_connect`

Semantics	`SSL_get_error`	Can Tick?	CSHL
Unchanged	Error/Want	Yes	HL

Simple composition of SSL_set_connect_state and SSL_do_handshake.

`SSL_accept`

Semantics	`SSL_get_error`	Can Tick?	CSHL
Unchanged	Error/Want	Yes	HL

Simple composition of SSL_set_accept_state and SSL_do_handshake.

`SSL_do_handshake`

Semantics	`SSL_get_error`	Can Tick?	CSHL
Unchanged	Error/Want	Yes	HL

Note: Idempotent if handshake already completed.

Blocking Considerations: Blocks until handshake completed if in blocking mode.

`SSL_read`, `SSL_read_ex`, `SSL_peek`, `SSL_peek_ex`

Semantics	`SSL_get_error`	Can Tick?	CSHL
Unchanged	Error/Want	Yes	CS

Blocking Considerations: Blocks until at least one byte is available or an error occurs if in blocking mode (including the peek functions).

If the read part of the stream has been finished by the peer, calls to SSL_read will fail with SSL_ERROR_ZERO_RETURN.

If a stream has terminated in a non-normal fashion (for example because the stream has been reset, or the connection has terminated), calls to SSL_read will fail with SSL_ERROR_SSL.

SSL_get_stream_read_state can be used to clarify the stream state when an error occurs.

`SSL_write`, `SSL_write_ex`

Semantics	`SSL_get_error`	Can Tick?	CSHL
Unchanged	Error/Want	Yes	CS

We have to implement all of the following modes:

SSL_MODE_ENABLE_PARTIAL_WRITE on or off
SSL_MODE_ACCEPT_MOVING_WRITE_BUFFER on or off
Blocking mode on or off

Blocking Considerations: Blocks until libssl has accepted responsibility for (i.e., copied) all data provided, or an error occurs, if in blocking mode. In other words, it blocks until it can buffer the data. This does not necessarily mean that the data has actually been sent.

SSL_get_stream_write_state can be used to clarify the stream state when an error occurs.

`SSL_pending`

Semantics	`SSL_get_error`	Can Tick?	CSHL
Unchanged	Never	No	CS

`SSL_has_pending`

Semantics	`SSL_get_error`	Can Tick?	CSHL
Unchanged	Never	No	CS

TBD. Options:

Semantics unchanged or approximated (essentially, SSL_pending() || any RXE queued || any URXE queued).
Change semantics to only determine the return value based on if there is data in the stream receive buffer.

`SSL_shutdown`

Semantics	`SSL_get_error`	Can Tick?	CSHL
Unchanged	Error	Yes	CS

See SSL_shutdown_ex below for discussion of how this will work for QUIC.

Calling SSL_shutdown is always exactly identical in function to calling SSL_shutdown_ex with flags set to 0 and args set to NULL.

`SSL_clear`

Semantics	`SSL_get_error`	Can Tick?	CSHL
TBD	TBD	No	C

There are potential implementation hazards:

SSL_clear() resets the SSL object to allow for another connection. The reset operation however keeps several settings of the last sessions (some of these settings were made automatically during the last handshake). It only makes sense for a new connection with the exact same peer that shares these settings, and may fail if that peer changes its settings between connections.

TBD: How should SSL_clear be implemented? Either:

Modernised implementation which resets everything, handshake layer re-instantiated (safer);
Preserve SSL_clear semantics at the handshake layer, reset all QUIC state (QUIC_CHANNEL torn down, CSM reset).

TBD: Semantics of this on stream objects.

`SSL_free`

Semantics	`SSL_get_error`	Can Tick?	CSHL
Changed	Never	No	CS

QUIC stream SSL objects. When used on a QUIC stream SSL object, parts of the stream state may continue to exist internally, managed inside the QUIC connection SSL object, until they can be correctly torn down, or until the QUIC connection SSL object is freed.

If a QUIC stream SSL object is freed for a stream which has not reached a terminal state for all of its parts (both send and receive, as applicable), the stream is automatically reset (non-normal termination) with an application error code of 0. To explicitly reset a stream with a different application error code, call SSL_stream_reset before calling this function.

If the peer continues to send data on the stream before it processes the notification of the stream's termination, that incoming data will be discarded. However, the peer will be reliably notified of the non-normal termination of the stream assuming that the connection remains healthy.

When freeing a QUIC stream SSL object which was terminated in a non-normal fashion, or which was terminated automatically due to a call to this function, any data which was appended to the stream via SSL_write may or may not have already been transmitted, and even if already transmitted, may or may not be retransmitted in the event of loss.

When freeing a QUIC stream SSL object which was terminated normally (for example via SSL_stream_conclude), data appended to the stream via SSL_write will still be transmitted or retransmitted as necessary, assuming that the QUIC connection SSL object is not freed and that the connection remains healthy.

QUIC connection SSL objects. SSL_free is largely unchanged for QUIC connection SSL objects on the client side. When freeing a QUIC connection SSL object being used in client mode, there is immediate termination of any QUIC network I/O processing as the resources needed to handle the connection are immediately freed. This means that, if a QUIC connection SSL object which has not been shutdown properly is freed using this function:

Any data which was pending transmission or retransmission will not be transmitted, including in streams which were terminated normally;
The connection closure process will not function correctly or in an RFC-compliant manner. Connection closure will not be signalled to the peer and the connection will simply disappear from the perspective of the peer. The connection will appear to remain active until the connection's idle timeout (if negotiated) takes effect.

For further discussion of this issue, see SSL_shutdown_ex and the Q&A.

`SSL_set0_rbio`, `SSL_set0_wbio`, `SSL_set_bio`

Semantics	`SSL_get_error`	Can Tick?	CSHL
Changed	Never	No	C

Sets network-side BIO.

The changes to the semantics of these calls are as follows:

The BIO MUST be a BIO with datagram semantics (this is a change relative to TLS, though not to DTLS).
If the BIO is non-pollable (see below), application-level blocking mode will be forced off.

`SSL_set_[rw]fd`

Semantics	`SSL_get_error`	Can Tick?	CSHL
Changed	Never	No	C

Sets network-side socket FD.

Existing behaviour: Instantiates a BIO_s_socket, sets an FD on it, and sets it as the BIO.

New proposed behaviour:

Instantiate a BIO_s_dgram instead for a QUIC connection SSL object.
Fails (no-op) for a QUIC stream SSL object.

`SSL_get_[rw]fd`

Semantics	`SSL_get_error`	Can Tick?	CSHL
Unchanged	Never	No	C

Should not require any changes.

`SSL_CTRL_MODE`, `SSL_CTRL_CLEAR_MODE`

Semantics	`SSL_get_error`	Can Tick?	CSHL
Unchanged	Never	No	CS

SSL Modes

SSL_MODE_ENABLE_PARTIAL_WRITE: Implemented. If this mode is set during a non-partial-write SSL_write operation spanning multiple SSL_write calls, this mode does not take effect until the non-partial write operation is completed.
SSL_MODE_ACCEPT_MOVING_WRITE_BUFFER: Implemented.
SSL_MODE_AUTO_RETRY: TBD.
SSL_MODE_RELEASE_BUFFERS: Ignored. This is an optimization and if it has any sensible semantic correspondence to QUIC, this can be considered later.
SSL_MODE_SEND_FALLBACK_SCSV: TBD: Either ignore or fail if the client attempts to set this prior to handshake. The latter is probably safer.

Ignored if set after handshake (existing behaviour).
SSL_MODE_ASYNC: TBD.

New APIs for Single-Stream Operation

`SSL_handle_events`

Semantics	`SSL_get_error`	Can Tick?	CSHL
New	Never	Yes	CS

Advances the QUIC state machine to the extent feasible, potentially performing network I/O. Also compatible with DTLSv1 and supercedes DTLSv1_handle_timeout for all use cases.

`SSL_get_event_timeout`

Semantics	`SSL_get_error`	Can Tick?	CSHL
New	Never	No	CS

Gets the time until the QUIC state machine next wants to receive a timeout event, if any.

This is similar to the existing DTLSv1_get_timeout function, but it is not specific to DTLSv1. It is also usable for DTLSv1 and can become a protocol-agnostic API for this purpose, superceding DTLSv1_get_timeout for all use cases.

The design is similar to that of DTLSv1_get_timeout and uses a struct timeval. However, this function can also output an infinite timeout using the is_infinite argument, whereas whereas DTLSv1_get_timeout represents an infinite timeout using a 0 return value, which does not allow a failure condition to be distinguished.

`SSL_set_blocking_mode`, `SSL_get_blocking_mode`

Semantics	`SSL_get_error`	Can Tick?	CSHL
New	Never	No	CS

Turns blocking mode on or off. This is necessary because up until now libssl has operated in blocking or non-blocking mode automatically as an emergent consequence of whether the underlying network socket is blocking. For QUIC, this is no longer viable, thus blocking semantics at the application level must be explicitly configured.

Use on stream objects: It may be feasible to implement this such that different QUIC stream SSL objects can have different settings for this option.

Not supported for non-QUIC SSL objects.

`SSL_get_rpoll_descriptor`, `SSL_get_wpoll_descriptor`

Semantics	`SSL_get_error`	Can Tick?	CSHL
New	Never	No	CS

These functions output poll descriptors which can be used to determine when the QUIC state machine next needs to have events handled. SSL_get_rpoll_descriptor is relevant if SSL_net_read_desired returns 1, and SSL_get_wpoll_descriptor is relevant if SSL_net_write_desired returns 1.

The implementation of these functions is a simple forward to BIO_get_rpoll_descriptor and BIO_get_wpoll_descriptor on the underlying network BIOs.

TODO: Support these for non-QUIC SSL objects

`SSL_net_read_desired`, `SSL_net_write_desired`

Semantics	`SSL_get_error`	Can Tick?	CSHL
New	Never	No	CS

These calls return 1 if the QUIC state machine is interested in receiving further data from the network, or writing to the network, respectively. The return values of these calls should be used to determine which wakeup events should cause an application to call SSL_handle_events. These functions do not mutate any state, and their return values may change after a call to any SSL function other than SSL_net_read_desired, SSL_net_write_desired, SSL_get_rpoll_descriptor, SSL_get_wpoll_descriptor and SSL_get_event_timeout.

TODO: Support these for non-QUIC SSL objects, turning this into a unified replacement for SSL_want

`SSL_want`, `SSL_want_read`, `SSL_want_write`

The existing API SSL_want, and the macros defined in terms of it, are traditionally used to determine if the SSL state machine has exited in non-blocking mode due to a desire to read from or write to the underlying network BIO. However, this API is unsuitable for use with QUIC because the return value of SSL_want can only express one I/O direction at a time (read or write), not both. This call will not be implemented for QUIC (e.g. always returns SSL_NOTHING) and SSL_net_read_desired and SSL_net_write_desired will be used instead.

`SSL_set_initial_peer_addr`, `SSL_get_initial_peer_addr`

Semantics	`SSL_get_error`	Can Tick?	CSHL
New	Never	No	CS

SSL_set_initial_peer_addr sets the initial L4 UDP peer address for an outgoing QUIC connection.

The initial peer address may be autodetected if no peer address has already been set explicitly and the QUIC connection SSL object is provided with a BIO_s_dgram with a peer set.

SSL_set_initial_peer_addr cannot be called after a connection is established.

`SSL_shutdown_ex`

Semantics	`SSL_get_error`	Can Tick?	CSHL
New	Error	Yes	C

typedef struct  ssl_shutdown_ex_args_st {
    /* These arguments pertain only to QUIC connections. */
    uint64_t    quic_error_code; /* [0, 2**62-1] */
    const char *quic_reason;
} SSL_SHUTDOWN_EX_ARGS;

#define SSL_SHUTDOWN_FLAG_RAPID         (1U << 0)
#define SSL_SHUTDOWN_FLAG_IMMEDIATE     (1U << 1)

int SSL_shutdown_ex(SSL *ssl,
                    uint64_t flags,
                    const SSL_SHUTDOWN_EX_ARGS *args,
                    size_t args_len);

SSL_shutdown_ex is an extended version of SSL_shutdown.

args specifies arguments which control how the SSL object is shut down. args are read only on the first call to SSL_shutdown_ex for a given SSL object and subsequent calls to SSL_shutdown_ex ignore the args argument. args_len should be set to sizeof(*args). This function is idempotent; once the shutdown process for a SSL object is complete, further calls are a no-op and return 1.

Calling SSL_shutdown_ex on a QUIC connection SSL object causes the immediate close of the QUIC connection. “Immediate close” is as defined by RFC 9000.

If no QUIC connection attempt was ever initiated using the given SSL object, the QUIC connection transitions immediately to the Terminated state. Otherwise, the connection closure process is initiated if it has not already begun.

Any application stream data on a non-terminated or normally terminated stream which has yet to be transmitted is flushed to the network before the termination process begins. This ensures that where an application which calls SSL_write and performs a connection closure in a way which is considered normal to the application protocol being used, all of the data written is delivered to the peer. This behaviour may be skipped by setting the SSL_SHUTDOWN_FLAG_IMMEDIATE flag, in which case any data appended to streams via SSL_write (or any end-of-stream conditions) may not be transmitted to the peer. This flag may be useful where a non-normal application condition has occurred and the delivery of data written to streams via SSL_write is no longer relevant. Application stream data on streams which were terminated non-normally (for example via SSL_stream_reset) is not transmitted by this function.

A QUIC connection can be shut down using this function in two different ways:

RFC compliant mode. In this mode, which provides the most robust operation, the shutdown process may take a period of time up to three times the current estimated RTT to the peer. It is possible for the closure process to complete much faster in some circumstances but this cannot be relied upon.

In blocking mode, the function will return once the closure process is complete. In non-blocking mode, SSL_shutdown_ex should be called until it returns 1, indicating the closure process is complete and the connection is now terminated.
Rapid mode. In this mode, a CONNECTION_CLOSE frame is sent in a best-effort manner and the connection is terminated immediately. If the CONNECTION_CLOSE frame sent is lost, the peer will not know that the connection has terminated until the negotiated idle timeout (if any) expires.

This will generally return 0 on success, indicating that the connection has not yet reached the Terminating state (unless it has already done so, in which case it will return 1).

In blocking mode, this blocks until at least one CONNECTION_CLOSE frame is sent but does not otherwise block. In non-blocking mode, this should be called until it returns a non-negative value. A negative value indicates failure or an I/O would-block condition.

It is permissible for an application to implement a hybrid approach, for example by initiating a rapid or non-blocking shutdown and continuing to call SSL_handle_events for a duration it chooses.

If SSL_SHUTDOWN_FLAG_RAPID is specified in flags, a rapid shutdown is performed, otherwise an RFC-compliant shutdown is performed. The principal effect of this flag is to partially disable blocking behaviour in blocking mode, and the QUIC implementation will still attempt to implement the Terminating state semantics if the application happens to call SSL_handle_events, until it reaches the Terminated state or is freed. An application can change its mind about performing a rapid shutdown by making a subsequent call to SSL_shutdown_ex without the flag set.

Calling SSL_shutdown_ex on a QUIC stream SSL object is not valid; such a call will fail and has no effect. The rationale for this is that an application may well want to pass around SSL objects for individual QUIC streams to existing parts of its own code which expect something which behaves like a typical SSL object (i.e., a single bytestream); those components may well already call SSL_shutdown and it is not desired for such calls to affect the whole connection.

TBD: Should we allow SSL_shutdown_ex on a QUIC stream SSL object to act as SSL_stream_conclude? Should we require this behaviour to be explicitly enabled?

The args->quic_error_code and args->reason fields allow the application error code and reason string for the closure of a QUIC connection to be specified. If args or args->reason is NULL, a zero-length string is used for the reason. If args is NULL, an error code of 0 is used. args->quic_error_code must be in the range [0, 2**62-1], else this function fails. These fields are ignored for SSL objects which do not represent QUIC connections.

`SSL_stream_conclude`

Semantics	`SSL_get_error`	Can Tick?	CSHL
New	Error	Yes	S

int SSL_stream_conclude(SSL *ssl, uint64_t flags);

SSL_stream_conclude signals the normal end-of-stream condition to the send part of a QUIC stream. If called on a QUIC connection SSL object with a default stream, it signals the end of that stream to the peer. If called on a QUIC stream SSL object, it signals the end of that stream to the peer.

This function may only be called for bidirectional streams and for outgoing unidirectional streams. It is a no-op if it has already been called for a given stream, or if either the stream or connection have entered an error state.

Any data already queued for transmission via a call to SSL_write() will still be written in a reliable manner before the end-of-stream is signalled, assuming the connection remains healthy. This function can be thought of as appending a logical end-of-stream marker after any data which has previously been written to the stream via calls to SSL_write. Further attempts to call SSL_write after calling this function will fail.

When calling this on a bidirectional stream, the receive part of the stream remains unaffected, and the peer may continue to send data via it until the peer also signals the end of the stream. Thus, SSL_read() can still be used.

This function is used to conclude the send part of a stream in a normal manner. To perform non-normal termination of both the sending and receiving parts of a stream, see SSL_stream_reset.

flags is reserved and should be set to 0.

`SSL_stream_reset`

Semantics	`SSL_get_error`	Can Tick?	CSHL
New	Error	Yes	S

typedef struct ssl_stream_reset_args_st {
    uint64_t quic_error_code; /* [0, 2**62-1] */
} SSL_STREAM_RESET_ARGS;

int SSL_stream_reset(SSL *ssl,
                     const SSL_STREAM_RESET_ARGS *args,
                     size_t args_len);

Conducts a non-normal termination of a bidirectional or outgoing unidirectional stream. For QUIC, this corresponds to a stream reset using a RESET_STREAM frame.

It may be called on either a QUIC stream SSL object or a QUIC connection SSL object with a default stream; the given stream is reset. The QUIC connection is not affected.

For bidirectional streams, this terminates both sending and receiving parts of the stream. It may not be called on an incoming unidirectional stream.

If args is NULL, an application error code of 0 is used. Otherwise, the application error code to use is specified in args->quic_error_code, which must be in the range [0, 2**62-1]. args_len must be set to sizeof(*args) if args is non-NULL.

Only the first call to this function has any effect; subsequent calls are no-ops. This is considered a success case.

`SSL_get_stream_state`

Semantics	`SSL_get_error`	Can Tick?	CSHL
New	Never	No	S

/*
 * e.g. Non-QUIC SSL object, or QUIC connection SSL object without a default
 * stream.
 */
#define SSL_STREAM_STATE_NONE                   0

/*
 * The read or write part of the stream is still available and has not been
 * terminated in a normal or non-normal manner.
 */
#define SSL_STREAM_STATE_OK                     1

/*
 * The stream is a unidirectional stream and this direction cannot be used; for
 * example, a remotely initiated unidirectional stream where
 * SSL_get_stream_write_state is called, or a locally initiated unidirectional
 * stream where SSL_get_stream_read_state is
 called.
 */
#define SSL_STREAM_STATE_WRONG_DIR              2

/*
 * The read or write part of the stream has been finished in a normal manner.
 *
 * For SSL_get_stream_read_state, this means that there is no more data to read,
 * and that any future SSL_read calls will return any residual data waiting to
 * be read followed by a SSL_ERROR_ZERO_RETURN condition.
 *
 * For SSL_get_stream_write_state, this means that the local application has
 * already indicated the end of the stream by calling SSL_stream_conclude,
 * and that future calls to SSL_write will fail.
 */
#define SSL_STREAM_STATE_FINISHED               3

/*
 * The stream was reset by the local party.
 *
 * For SSL_get_stream_read_state, this means that the stream was aborted using a
 * locally transmitted STOP_SENDING frame. Attempts to read from the stream via
 * SSL_read will fail, though SSL_read may allow any  residual data waiting to
 * be  read to be  read first.
 *
 * For SSL_get_stream_write_state, this means that the stream was aborted
 * using a locally transmitted RESET_STREAM frame. Attempts to write to
 * the stream will fail.
 */
#define SSL_STREAM_STATE_RESET_LOCAL            4

/*
 * The stream was reset by the remote party.
 *
 * For SSL_get_stream_read_state, this means the peer sent a STREAM_RESET
 * frame for the stream.
 *
 * For SSL_get_stream_write_state, this means the peer sent a STOP_SENDING
 * frame for the stream.
 */
#define SSL_STREAM_STATE_RESET_REMOTE           5

/*
 * The underlying connection supporting the stream has closed or otherwise
 * failed.
 *
 * For SSL_get_stream_read_state, this means that attempts to read from the
 * stream via SSL_read will fail, though SSL_read may allow any residual
 * data waiting to be read to be read first.
 *
 * For SSL_get_stream_write_state, this means that attempts to write to the
 * stream will fail.
 */
#define SSL_STREAM_STATE_CONN_CLOSED            6

int SSL_get_stream_read_state(SSL *ssl);
int SSL_get_stream_write_state(SSL *ssl);

This API allows the current state of a stream to be queried. This allows an application to determine whether a stream is still usable and why a stream has reached an error state.

`SSL_get_stream_read_error_code`, `SSL_get_stream_write_error_code`

Semantics	`SSL_get_error`	Can Tick?	CSHL
New	Never	No	S

int SSL_get_stream_read_error_code(SSL *ssl, uint64_t *app_error_code);
int SSL_get_stream_write_error_code(SSL *ssl, uint64_t *app_error_code);

SSL_get_stream_read_error_code gets the error code for the read part of the stream.

SSL_get_stream_write_error_code gets the error code for the write part of the stream.

If a stream has been terminated normally, returns 0.

If a stream has been terminated non-normally, returns 1 and writes the applicable application error code to *app_error_code.

If a stream is still healthy, or was healthy at the time the connection was closed, or the respective part of the stream does not exist (e.g. for a unidirectional stream), returns -1.

`SSL_get_conn_close_info`

Semantics	`SSL_get_error`	Can Tick?	CSHL
New	Never	No	C

typedef struct ssl_conn_close_info_st {
    uint64_t error_code;
    char     *reason;
    size_t   reason_len;
    int      is_local;
    int      is_transport;
} SSL_CONN_CLOSE_INFO;

int SSL_get_conn_close_info(SSL *ssl,
                            SSL_CONN_CLOSE_INFO *info,
                            size_t info_len);

If a connection is still healthy, returns 0. Otherwise, fills *info with information about the error causing connection termination and returns 1. info_len must be set to sizeof(*info). Returns -1 if called on a non-QUIC SSL object or if the connection status cannot be determined.

info->reason is set to point to a buffer containing a reason string. The buffer is valid for the lifetime of the SSL object. The reason string will always be zero terminated, but since it is received from a potentially untrusted peer, may also contain zero bytes. info->reason_len is the true length of the reason string in bytes.

info->is_local is 1 if the connection closure was locally initiated.

info->is_transport is 1 if the connection closure was initiated by QUIC, and 0 if it was initiated by the application. The namespace of info->error_code is determined by this parameter.

New APIs for Multi-Stream Operation

The above new APIs are built on constructively to facilitate multi-stream operation.

The concept of a QUIC stream SSL object is introduced. A QUIC SSL object is either a QUIC connection SSL object or a QUIC stream SSL object. A QUIC stream SSL object belongs to a QUIC connection SSL object. A QUIC connection SSL object may or may not have an associated default stream. There may only be at most one default stream for a QUIC connection SSL object. Reading or writing application data to a QUIC connection SSL object with a default stream is equivalent to reading or writing to that stream. It is an error to attempt to read or write application data, or perform other stream-specific operations, on a QUIC connection SSL object without a default stream associated.

Notes on Multi-Threaded Operation

Initially these APIs will not be thread safe over the same connection, but in the longer term we intend to support multiple threads using different QUIC stream SSL objects on different threads over the same connection without the application having to do any locking. This is referred to as multi-stream multi-thread (MSMT) operation. Only APIs explicitly denoted below will eventually be MSMT-safe.

Notes on Blocking

The blocking mode can be configured on each SSL object individually. When a QUIC stream SSL object is created it inherits its blocking state from the currently configured blocking state of the QUIC connection SSL object at the time the stream is created. This can be changed independently. For example, a QUIC connection SSL object can be in blocking mode to allow for blocking SSL_accept_stream calls, yet have some or all QUIC stream SSL objects be in non-blocking mode concurrently.

Notes on Application-Level Polling

An API may be added in the future to allow applications to poll multiple QUIC connection SSL objects efficiently for new stream and stream readability events. This is not yet urgent but will be more relevant for concurrent server applications.

`SSL_get0_connection`

Semantics	`SSL_get_error`	Can Tick?	CSHL
New	Never	No	CS

/*
 * Get the SSL object representing the connection associated with this object.
 *
 * If the SSL object represents a non-QUIC method or a QUIC connection, this
 * returns the same object passed.
 *
 * If the SSL object represents a QUIC stream returns the QUIC connection
 * object.
 */
SSL *SSL_get0_connection(SSL *ssl);

`SSL_is_connection`

Semantics	`SSL_get_error`	Can Tick?	CSHL
New	Never	No	CS

/*
 * Returns 1 if the object represents a connection. This always returns 1 for
 * non-QUIC methods, but returns 0 for SSL objects for QUIC streams which are
 * not also the QUIC connection object.
 *
 * This is exactly equivalent to (SSL_get0_connection(ssl) == ssl).
 */
int SSL_is_connection(SSL *ssl);

`SSL_get_stream_type`

Semantics	`SSL_get_error`	Can Tick?	CSHL
New	Never	No	S

/*
 * If the object represents a stream, returns a SSL_STREAM_TYPE value
 * designating whether the stream can be used for transmission, reception,
 * or both.
 *
 * This always returns SSL_STREAM_TYPE_BIDI for non-QUIC methods.
 *
 * It returns SSL_STREAM_TYPE_NONE for a QUIC connection object if it
 * does not have a default stream.
 */
#define SSL_STREAM_TYPE_NONE    0
#define SSL_STREAM_TYPE_READ    1
#define SSL_STREAM_TYPE_WRITE   2
#define SSL_STREAM_TYPE_BIDI    (SSL_STREAM_TYPE_READ | SSL_STREAM_TYPE_WRITE)
__owur int SSL_get_stream_type(SSL *ssl);

`SSL_get_stream_id`

Semantics	`SSL_get_error`	Can Tick?	CSHL
New	Never	No	S

/*
 * QUIC: Returns the unique stream ID for the stream, an integer in range [0, 2**62-1],
 * or UINT64_MAX if the stream ID is not available. If called on a QUIC
 * connection, returns the unique stream ID for the default stream if there is
 * one, and otherwise returns UINT64_MAX.
 *
 * TLS, DTLS: Returns UINT64_MAX.
 */
__owur uint64_t SSL_get_stream_id(SSL *ssl);

`SSL_new_stream`

Semantics	`SSL_get_error`	Can Tick?	CSHL
New	Never	No	C

/*
 * Create a new SSL object representing a single additional stream.
 *
 * There is no need to call SSL_connect on the resulting object, and
 * any such call is a no-op.
 *
 * For QUIC:
 *   Creates a new stream. Must be called only on a QUIC connection SSL object.
 *   Can be used on client or server. If the SSL_STREAM_FLAG_UNI flag is set,
 *   the created stream is unidirectional, otherwise it is bidirectional.
 *
 *   To be MSMT-safe.
 *
 * For TLS and DTLS SSL objects:
 *   Always fails.
 */
#define SSL_STREAM_FLAG_UNI    1

SSL *SSL_new_stream(SSL *ssl, uint64_t flags);

`SSL_accept_stream`

Semantics	`SSL_get_error`	Can Tick?	CSHL
New	Never	Yes	C

/*
 * Create a new SSL object representing an additional stream which was created
 * by the peer.
 *
 * There is no need to call SSL_accept on the resulting object, and
 * any such call is a no-op.
 *
 * For QUIC:
 *   Must be called only on a QUIC connection SSL object. Fails if called on a
 *   stream object. Checks if a new stream has been created by the peer. If it
 *   has, creates a new SSL object to represent it and returns it. Otherwise,
 *   returns NULL. If multiple streams are available to be accepted, the oldest
 *   stream (that is, the stream with the lowest stream ID) is accepted.
 *
 * For all other methods:
 *   Returns NULL.
 *
 * The flags argument is unused and should be set to zero.
 *
 * To be MSMT-safe (i.e., can be called from multiple threads).
 *
 * If the QUIC connection SSL object is configured in blocking mode, this
 * function will block unless the SSL_ACCEPT_STREAM_NO_BLOCK flag is passed.
 *
 * This function returns NULL if the effective incoming stream reject policy is
 * `REJECT`.
 */
#define SSL_ACCEPT_STREAM_NO_BLOCK      1

SSL *SSL_accept_stream(SSL *ssl, uint64_t flags);

`SSL_get_accept_stream_queue_len`

Semantics	`SSL_get_error`	Can Tick?	CSHL
New	Never	No	C

/*
 * Determine the number of streams waiting to be returned on a subsequent call
 * to SSL_accept_stream. If this returns a non-zero value, the next call to
 * SSL_accept_stream (on any thread) is guaranteed to work. Returns 0 for
 * non-QUIC objects, or for QUIC stream objects.
 *
 * To be MSMT-safe.
 */
size_t SSL_get_accept_stream_queue_len(SSL *ssl);

`SSL_set_incoming_stream_policy`

Semantics	`SSL_get_error`	Can Tick?	CSHL
New	Never	No	C

/*
 * Sets the policy for incoming streams. If `policy` is `AUTO` (the default):
 *
 *   - if the default stream mode is
 *     `SSL_DEFAULT_STREAM_MODE_AUTO_BIDI` or
 *     `SSL_DEFAULT_STREAM_MODE_AUTO_UNI`, this is equivalent to `REJECT`;
 *
 *   - otherwise, this is equivalent to `ACCEPT`.
 *
 * If configured to `ACCEPT`, incoming streams are placed on the accept queue
 * for application consumption. `aec` is ignored in this case.
 *
 * If configured to `REJECT`, incoming streams automatically have both their
 * receiving and sending parts handled via non-normal termination. `aec` is an
 * application error code used for the `STOP_SENDING` and `RESET_STREAM` frames
 * used for the purposes of this termination. The default AEC value used if this
 * function is never called is 0.
 */
#define SSL_INCOMING_STREAM_POLICY_AUTO      0
#define SSL_INCOMING_STREAM_POLICY_ACCEPT    1
#define SSL_INCOMING_STREAM_POLICY_REJECT    2

int SSL_set_incoming_stream_policy(SSL *ssl, int policy, uint64_t aec);

`SSL_set_default_stream_mode`

Semantics	`SSL_get_error`	Can Tick?	CSHL
New	Never	No	C

/*
 * Used to control single stream operation. Calling this function determines the
 * nature of the default stream which will automatically be created on the QUIC
 * connection SSL object.
 *
 * The default mode is `SSL_DEFAULT_STREAM_MODE_AUTO_BIDI`.
 *
 * The modes are as follows:
 *
 *   - `SSL_DEFAULT_STREAM_MODE_NONE`: No default stream will ever be created.
 *     The application is assumed to understand multi-stream operation.
 *     Remotely-initiated streams are placed in the accept queue for application
 *     consumption. `SSL_read` and `SSL_write` calls must be made on a QUIC
 *     stream SSL object, not the QUIC connection SSL object, as no default
 *     stream will be associated with it.
 *
 *   - `SSL_DEFAULT_STREAM_MODE_AUTO_BIDI`: "First stream wins" mode of
 *     operation for single-stream usage. If `SSL_write` is called before the
 *     peer opens a remotely-initiated stream, a locally-initiated bidirectional
 *     stream is created and bound as the default stream. If the peer opens a
 *     remotely-initiated stream before the local application calls `SSL_write`
 *     (with `len > 0`) for the first time, that stream is bound as the default
 *     stream, which may be bidirectional or unidirectional; if it is
 *     unidirectional, calls to `SSL_write` will fail. Attempts to create
 *     additional streams by the peer are automatically rejected unless
 *     the application opts in (API TBD).
 *
 *   - `SSL_DEFAULT_STREAM_MODE_AUTO_UNI`: "First stream wins" mode of
 *     operation for single-stream usage, with a unidirectional stream. This
 *     functions identically to `SSL_DEFAULT_STREAM_MODE_AUTO_BIDI`, but if the
 *     local application calls `SSL_write` prior to the peer creating a
 *     remotely-initiated stream, a unidirectional TX-only stream is created and
 *     bound as the default stream. Thereafter, calls to `SSL_read` will fail.
 *     If the peer creates a remotely-initiated stream prior to the first call
 *     to `SSL_write` (with `len > 0`), that stream will be bound as the default
 *     stream; note that a bidirectional stream may be bound in this case.
 *     Attempts to create additional streams by the peer are automatically
 *     rejected unless the application opts in (API TBD).
 *
 * This function must be called before a default stream object is created, for
 * example before initiating a connection. If the function is too late to have
 * an effect, this function fails and returns 0.
 */
#define SSL_DEFAULT_STREAM_MODE_NONE                0
#define SSL_DEFAULT_STREAM_MODE_AUTO_BIDI           1
#define SSL_DEFAULT_STREAM_MODE_AUTO_UNI            2

__owur int SSL_set_default_stream_mode(SSL *ssl, uint32_t mode);

Future APIs

A custom poller interface may be provided in the future. For more information, see the QUIC I/O Architecture design document.

BIO Objects

Existing APIs

`BIO_s_connect`, `BIO_new_ssl_connect`, `BIO_set_conn_hostname`

We are aiming to support use of the existing BIO_new_ssl_connect API with only minimal changes. This will require internal changes to BIO_s_connect, which should automatically detect when it is being used with a QUIC SSL_CTX and act accordingly.

`BIO_new_bio_pair`

Unsuitable for use with QUIC on the network side; instead, applications can make use of the new BIO_s_dgram_pair which provides equivalent functionality with datagram semantics.

Interactions with `BIO_f_buffer`

Existing applications sometimes combine a network socket BIO with a BIO_f_buffer. This is problematic because the datagram semantics of writes are not preserved, therefore the BIO provided to libssl is, as provided, unusable for the purposes of implementing QUIC. Moreover, output buffering is not a relevant or desirable performance optimisation for the transmission of UDP datagrams and will actually undermine QUIC performance by causing incorrect calculation of ACK delays and consequently inaccurate RTT calculation.

Options:

Require applications to be changed to not use QUIC with a BIO_f_buffer.
Detect when a BIO_f_buffer is part of a BIO stack and bypass it (yucky and surprising).

MTU Signalling

See also: BIO_s_dgram_pair(3)

BIO_dgram_get_mtu (BIO_CTRL_DGRAM_GET_MTU) and BIO_dgram_set_mtu (BIO_CTRL_DGRAM_SET_MTU) already exist for BIO_s_dgram and are implemented on a BIO_s_dgram_pair to allow the MTU to be determined and configured. One side of a pair can configure the MTU to allow the other side to detect it.

BIO_s_dgram also has pre-existing support for getting the correct MTU value from the OS using BIO_CTRL_DGRAM_QUERY_MTU.

New APIs

`BIO_sendmmsg` and `BIO_recvmmsg`

See also: BIO_sendmmsg(3)

The BIO interface features a new high-performance API for the execution of multiple read or write operations in a single system call, on supported OSes. On other OSes, a compatible fallback implementation is used.

Unlike all other BIO APIs, this API is intended for concurrent threaded use and as such operates in a stateless fashion with regards to a BIO. This means, for example, that retry indications are made using explicit API inputs and outputs rather than setting an internal flag on the BIO.

This new BIO API includes:

Local address support (getting the destination address of an incoming packet; setting the source address of an outgoing packet), where support for this is available;
Peer address support (setting the destination address of an outgoing packet; getting the source address of an incoming packet), where support for this is available.

The following functionality was intentionally left out of this design because not all OSes can provide support:

Iovecs (which have also been determined not to be necessary for a performant QUIC implementation);
Features such as MSG_DONTWAIT, etc.

This BIO API is intended to be extensible. For more information on this API, see BIO_sendmmsg(3) and BIO_recvmmsg(3).

Custom BIO implementers may set their own implementation of these APIs via corresponding BIO_meth getter/setter functions.

Truncation Mode

See also: BIO_s_dgram_pair(3)

The controls BIO_dgram_set_no_trunc (BIO_CTRL_DGRAM_SET_NO_TRUNC) and BIO_dgram_get_no_trunc (BIO_CTRL_DGRAM_GET_NO_TRUNC) are introduced. This is a boolean value which may be implemented by BIOs with datagram semantics. When enabled, attempting to receive a datagram such that the datagram would ordinarily be truncated (as per the design of the Berkeley sockets API) instead results in a failure. This is intended for implementation by BIO_s_dgram_pair. For compatibility, the default behaviour is off.

Capability Negotiation

See also: BIO_s_dgram_pair(3)

Where a BIO_s_dgram_pair is used, there is the potential for such a memory BIO to be used by existing application code which is being adapted for use with QUIC. A problem arises whereby one end of a BIO_s_dgram_pair (for example, the side being used by OpenSSL's QUIC implementation) may assume that the other end supports certain capabilities (for example, specifying a peer address), when in actual fact the opposite end of the BIO_s_dgram_pair does not.

A capability signalling mechanism is introduced which allows one end of a BIO_s_dgram_pair to indicate to the user of the opposite BIO the following capabilities and related information:

Whether source addresses the peer specifies will be processed.
Whether destination addresses the peer specifies will be processed.
Whether source addresses will be provided to the opposite BIO when it receives datagrams.
Whether destination addresses will be provided to the opposite BIO when it receives datagrams.

The usage is as follows:

One side of a BIO pair calls BIO_dgram_set_caps with zero or more of the following flags to advertise its capabilities:
- BIO_DGRAM_CAP_HANDLES_SRC_ADDR
- BIO_DGRAM_CAP_HANDLES_DST_ADDR
- BIO_DGRAM_CAP_PROVIDES_SRC_ADDR
- BIO_DGRAM_CAP_PROVIDES_DST_ADDR
The other side of the BIO pair calls BIO_dgram_get_effective_caps to learn the effective capabilities of the BIO. These are the capabilities set by the opposite BIO.
The above process can also be repeated in the opposite direction.

Local Address Support

See also: BIO_s_dgram_pair(3)

Support for local addressing (the reception of destination addresses for incoming packets, and the specification of source addresses for outgoing packets) varies by OS. Thus, it may not be available in all circumstances. A feature negotiation mechanism is introduced to facilitate this.

BIO_dgram_get_local_addr_cap (BIO_CTRL_DGRAM_GET_LOCAL_ADDR_CAP) determines if a BIO is potentially capable of supporting local addressing on the current platform. If it determines that support is available, local addressing support must then be explicitly enabled via BIO_dgram_set_local_addr_enable (BIO_CTRL_DGRAM_SET_LOCAL_ADDR_ENABLE). If local addressing support has not been enabled, attempts to use local addressing (for example via BIO_sendmmsg or BIO_recvmmsg with a BIO_MSG with a non-NULL local field) fails.

An explicit enablement call is required because setting up local addressing support requires system calls on most operating systems prior to sending or receiving packets and we do not wish to do this automatically inside the BIO_sendmmsg/BIO_recvmmsg fastpaths, particularly since the process of enabling support could fail due to lack of OS support, etc.

BIO_dgram_get_local_addr_enable (BIO_CTRL_DGRAM_GET_LOCAL_ADDR_ENABLE) is also available.

It is important to note that BIO_dgram_get_local_addr_cap is entirely distinct from the application capability negotiation mechanism discussed above. Whereas the capability negotiation mechanism discussed above allows applications to signal what they are capable of handling in their usage of a given BIO, BIO_dgram_local_addr_cap allows a BIO implementation to indicate to the users of that BIO whether it is able to support local addressing (where enabled).

`BIO_s_dgram_pair`

See also: BIO_s_dgram_pair(3)

A new BIO implementation, BIO_s_dgram_pair, is provided. This is similar to the existing BIO pair but provides datagram semantics. It provides full support for the new APIs BIO_sendmmsg, BIO_recvmmsg, the capability negotiation mechanism described above, local address support and the MTU signalling mechanism described above.

It can be instantiated using the new API BIO_new_dgram_pair.

`BIO_POLL_DESCRIPTOR`

The concept of poll descriptors are introduced. A poll descriptor is a tagged union structure which represents an abstraction over some unspecified kind of OS descriptor which can be used for synchronization and waiting.

The most commonly used kind of poll descriptor is one which describes a network socket (i.e., on POSIX-like platforms, a file descriptor), however other kinds of poll descriptor may be defined.

A BIO may be queried for whether it has a poll descriptor for read or write operations respectively:

Where BIO_get_rpoll_descriptor (BIO_CTRL_GET_RPOLL_DESCRIPTOR) is called, the BIO should output a poll descriptor which describes a resource which can be used to determine when the BIO will next become readable via a call to BIO_read or, if supported by the BIO, BIO_recvmmsg.
Where BIO_get_wpoll_descriptor (BIO_CTRL_GET_WPOLL_DESCRIPTOR) is called, the BIO should output a poll descriptor which describes a resource which can be used to determine when the BIO will next become writeable via a call to BIO_write or, if supported by the BIO, BIO_sendmmsg.

A BIO may not necessarily be able to provide a poll descriptor. For example, memory-based BIOs such as BIO_s_dgram_pair do not correspond to any OS synchronisation resource, and thus the BIO_get_rpoll_descriptor and BIO_get_wpoll_descriptor calls are not supported for such BIOs.

A BIO which supports these functions is known as pollable, and a BIO which does not is known as non-pollable. BIO_s_dgram supports these functions.

The implementation of these functions for a BIO_f_ssl forwards to SSL_get_rpoll_descriptor and SSL_get_wpoll_descriptor respectively. The

`BIO_s_dgram_mem`

This is a basic memory buffer BIO with datagram semantics. Unlike BIO_s_dgram_pair, it is unidirectional and does not support peer addressing or local addressing.

`BIO_err_is_non_fatal`

A new predicate function BIO_err_is_non_fatal is defined which determines if an error code represents a non-fatal or transient error. For details, see BIO_sendmmsg(3).

Q & A

To assist in understanding, when a “TBD” listed above is removed, or when a relevant question is raised, the resolution to the question will be placed here.

Q. Should SSL_do_handshake wait until the handshake is completed, or until it is confirmed?

Note: The terms handshake complete and handshake confirmed are defined in RFC 9001 and have specific meanings.

A. SSL_do_handshake should wait until the handshake is completed, because handshake completion represents the completion of the cryptographic authentication of the connection. When a connection's handshake is completed, TLS 1.3 Finished messages have been exchanged by both parties, even if the handshake has not yet been confirmed. Moreover, RFC 9001 s. 4.1.2 states:

Additionally, a client MAY consider the handshake to be confirmed when it receives an acknowledgment for a 1-RTT packet.

This logically implies that it is OK for a client to start transmitting 1-RTT packets prior to handshake confirmation, otherwise there would be no in-flight 1-RTT packets for the client to receive ACKs for.

Q. Does ENABLE_PARTIAL_WRITE interact with blocking mode?

A. No; this mode is only relevant to non-blocking mode. In blocking mode, SSL_write always waits until all data is written unless an error occurs. The semantics of SSL_write are preserved unchanged.

Q. Does SSL_write block until data is written to the network, or simply until it is buffered?

A. SSL_write blocks until it has accepted responsibility for the data passed to it, just like write(2) or send(2). In other words, it blocks until it can buffer the data. This does not necessarily mean that the data has actually been sent.

Q. How should connection closure work?

A. RFC requirements. After we begin terminating the connection by sending a CONNECTION_CLOSE frame, QUIC requires that we continue to process network I/O for a certain period of time so that any further traffic from the peer results in generation of a further CONNECTION_CLOSE frame. This is necessary to handle the possibility that the CONNECTION_CLOSE frame which was initially sent may be lost.

API issues. This creates a complication because it implies that the connection closure process may take a fair amount of time, whereas existing API users will generally expect to be able to call SSL_shutdown and then immediately free the SSL object.

However, if the caller immediately frees the SSL object, this precludes our implementing the applicable logic, at least on the client side. Moreover, existing API users are likely to tear down underlying network BIOs immediately after calling SSL_free anyway. In other words, any implementation based on secretly keeping QUIC state around after a call to SSL_free does not seem particularly workable on the client side.

Server side considerations. There is more of a prospect here on the server side, since multiple connections will share the same socket, which will presumably be associated with some kind of enduring listener object. Thus when server support is implemented in the future connection teardown could be handled internally by maintaining the state of connections undergoing termination inside the listener object. However, similar caveats to those discussed here arise when the listener object itself is to be town down. (It is also possible we could optionally allow use of the server-style API to make multiple outgoing client connections with a non-zero-length client-side CID on the same underlying network BIO.)

There are only really two ways to handle this:

RFC conformant mode. SSL_shutdown only indicates that shutdown is complete once the the entire connection closure process is complete.

This process consists of the Closing and Draining states. In some cases the Closing state may last only briefly, namely if the peer chooses to respond to our CONNECTION_CLOSE frame with a CONNECTION_CLOSE frame of its own. This allows immediate progression to the Draining state. However, a peer is not required to respond with such a frame. Thus in the worst case, this state can be as long as 3*PTO; for example a peer with a high estimated RTT of 300ms would have us wait for 900ms.

In the Draining state we simply ignore all incoming traffic and do not generate outgoing traffic. The purpose of this state is to simply tie up the socket and ensure any data still in flight is discarded. However, RFC 9000 states:

Disposing of connection state prior to exiting the closing or draining state could result in an endpoint generating a Stateless Reset unnecessarily when it receives a late-arriving packet. Endpoints that have some alternative means to ensure that late-arriving packets do not induce a response, such as those that are able to close the UDP socket, MAY end these states earlier to allow for faster resource recovery. Servers that retain an open socket for accepting new connections SHOULD NOT end the closing or draining state early

Because our client mode implementation uses one socket per connection, it appears to be reasonable based on the above text to omit the implementation of the draining state (the same may not be the case for the server role when implemented in the future).

Thus, in general, SSL_shutdown can be expected to take about one round trip's time to complete when dealing with a peer whose QUIC implementation happens to respond to a CONNECTION_CLOSE frame with a CONNECTION_CLOSE frame of its own, and about three round trips otherwise.
Rapid shutdown mode. SSL_shutdown sends a CONNECTION_CLOSE frame once and completes immediately. The Closing and Draining states are not used, and if the CONNECTION_CLOSE frame was lost, the peer will have to wait for idle timeout to determine that the connection is gone (there is also the possibility that, if the socket is closed by the application after teardown, a peer will make something of ICMP Port Unreachable messages, but this is unlikely to be reliable and since this message is not authenticated, QUIC implementations probably shouldn't pay much attention to it anyway.)

There is little problem with SSL_shutdown taking as long as it needs to for some long-running applications, but for others it poses a real issue. For example, a command-line tool which makes one connection, performs one application-specific transaction, and then tears down the connection. In this case an RFC-conformant connection termination would essentially require the process to hang around for a substantial amount of time after the work of the process is done.

For this reason, it is concluded that both of these shutdown modes need to be offered.

Where connection closure is initiated remotely rather than locally, only the draining state is relevant. Since we conclude above that we do not need to implement the draining state on the client side, this means that connection closure can be completed immediately in the case of a remote closure.

Q. Should we just map SSL_handle_events to DTLS_CTRL_HANDLE_TIMEOUT internally?

A. No, since the infinite time representation is different between the two calls.

Q. How should STOP_SENDING be supported?

TODO: Determine how STOP_SENDING should be supported.

Q. Can data be received on a locally initiated bidirectional stream before any data is sent on that stream?

This is an interesting question without a clear answer to be found in the QUIC RFCs. A close reading of RFC 9000 suggests that the answer is, in principle, yes; however the RFC also grants explicit permission to make design choices in implementations which would preclude this:

An implementation might choose to defer allocating a stream ID to a stream until it sends the first STREAM frame and enters this state, which can allow for better stream prioritization.

If an ID has not been allocated to a stream, obviously incoming data cannot be addressed to it. However, supposing that an implementation does not do this, RFC 9000 seems basically clear that it is valid for an application to create a stream locally, then receive data on it before sending anything:

The sending part of a stream that the endpoint initiates (types 0 and 2 for clients, 1 and 3 for servers) is opened by the application. The "Ready" state represents a newly created stream that is able to accept data from the application.

[...]

For a bidirectional stream, the receiving part enters the "Recv" state when the sending part initiated by the endpoint (type 0 for a client, type 1 for a server) enters the "Ready" state.

A peer is not generally notified of the creation of a stream which has not sent any data yet, since the creation of a stream is signalled only implicitly via the transmission of data in STREAM frames. However, a zero-length STREAM frame could presumably be used to effect such a notification. RFC 9000 contains no specific discussion of this possibility but does not preclude it. As such, in order to receive data on a locally-initiated bidirectional stream before sending any data on that stream, it would be necessary to either

Use a QUIC implementation which signals a bidirectional stream which has not yet sent any data via a zero-length stream frame, or
Use an application protocol which can inform the peer of the stream ID of the created stream in some application protocol-specific way. This is somewhat less plausible because it would require an API between the application and its QUIC library to inform the QUIC library that the peer has in fact created a stream with a given ID and to take its word for it. This is unlikely to be commonly available, especially as application errors in usage of such an API would lead to internal inconsistencies in QUIC connection state.

Of course this discussion is somewhat esoteric as it is unclear why an application would want to create a locally-initiated stream and then have the peer transmit on it first, rather than simply use a remotely-initiated stream. Thus this discussion of this edge case is more of a curiosity, however for completeness it needs to be thought about in the API design.

Q. How should single-stream operation support locally and remotely-initiated streams?

Note that the ID of a stream depends on whether it is bidirectional and whether it is initiated by the client or server. Therefore, in single stream operation, it is necessary to know whether single-stream QUIC is being used with client-initiated or server-initiated stream initiation, and whether a bidirectional or unidirectional stream is being used; otherwise, we do not know which stream ID to bind to.

The object of single stream operation is to support simple uses cases for simple applications. There seems no need to support esoteric usage of streams such as receiving first on a locally initiated stream here, thus we avoid supporting this to simplify the API.

As such, an application which calls SSL_write on a QUIC connection SSL object before it calls SSL_read by definition is using a locally-initiated stream, and an application which does the opposite is using a remotely-initiated stream. We can use the ordering of initial calls to SSL_read and SSL_write to infer the desired stream type.

Supporting locally-initiated streams (SSL_write called first) is simple; we automatically create the stream and queue data for transmission.

Supporting remotely-initiated streams (SSL_read called first) is a little stranger. We could create the stream with the correct ID when cued to by the initial call to SSL_read implying use of a remotely-initiated stream. However, this would mean we are creating state tracking a remotely-initiated stream before the peer has signalled it. This would work in the happy case where the client is connected to a compatible server but may result in strange inconsistencies of QUIC internal state if a client is accidentially connected to an incompatible peer. Since the peer ought to be the authority on the streams it creates, this seems like an undesirable approach.

Ergo, creation of a default remotely-initiated stream needs to be deferred until the peer signals such a stream.

This leads naturally to a "first stream wins" model of implementation:

When a QUIC connection SSL object is created, default stream mode is enabled, meaning that a default stream will be bound to the QUIC connection SSL object at the earliest available opportunity. However, no default stream is bound yet.
One of the following events happened — whichever happens first wins:
- The local application calls SSL_write() (len > 0). A locally-initiated stream with ordinal 0 is created. The stream is bidirectional by default but this can be changed. This stream is bound as the default stream.
- The peer creates a stream. This stream is bound as the default stream.

If the local application calls SSL_read() before either of the above occur, SSL_read() fails as though no data is available until one of the above events occurs.

Once one of the above events occurs, any additional stream created by the peer is automatically terminated using both STOP_SENDING and STREAM_RESET frames (to terminate both the receiving and sending parts respectively) and there is no API-visible effect to the local application (unless the application explicitly opts into supporting additional streams).

Files

quic-api.md

Latest commit

History

quic-api.md

File metadata and controls

QUIC API Overview

Overview and Implementation Status

Objectives

SSL Objects

Structure of Documentation

Existing APIs

SSL_set_connect_state

SSL_set_accept_state

SSL_is_server

SSL_connect

SSL_accept

SSL_do_handshake

SSL_read, SSL_read_ex, SSL_peek, SSL_peek_ex

SSL_write, SSL_write_ex

SSL_pending

SSL_has_pending

SSL_shutdown

SSL_clear

SSL_free

SSL_set0_rbio, SSL_set0_wbio, SSL_set_bio

SSL_set_[rw]fd

SSL_get_[rw]fd

SSL_CTRL_MODE, SSL_CTRL_CLEAR_MODE

SSL Modes

New APIs for Single-Stream Operation

SSL_handle_events

SSL_get_event_timeout

SSL_set_blocking_mode, SSL_get_blocking_mode

SSL_get_rpoll_descriptor, SSL_get_wpoll_descriptor

SSL_net_read_desired, SSL_net_write_desired

SSL_want, SSL_want_read, SSL_want_write

SSL_set_initial_peer_addr, SSL_get_initial_peer_addr

SSL_shutdown_ex

SSL_stream_conclude

SSL_stream_reset

SSL_get_stream_state

SSL_get_stream_read_error_code, SSL_get_stream_write_error_code

SSL_get_conn_close_info

New APIs for Multi-Stream Operation

Notes on Multi-Threaded Operation

Notes on Blocking

Notes on Application-Level Polling

SSL_get0_connection

SSL_is_connection

SSL_get_stream_type

SSL_get_stream_id

SSL_new_stream

SSL_accept_stream

SSL_get_accept_stream_queue_len

SSL_set_incoming_stream_policy

SSL_set_default_stream_mode

Future APIs

BIO Objects

Existing APIs

BIO_s_connect, BIO_new_ssl_connect, BIO_set_conn_hostname

BIO_new_bio_pair

Interactions with BIO_f_buffer

MTU Signalling

New APIs

BIO_sendmmsg and BIO_recvmmsg

Truncation Mode

Capability Negotiation

Local Address Support

BIO_s_dgram_pair

BIO_POLL_DESCRIPTOR

BIO_s_dgram_mem

BIO_err_is_non_fatal

Q & A

`SSL_set_connect_state`

`SSL_set_accept_state`

`SSL_is_server`

`SSL_connect`

`SSL_accept`

`SSL_do_handshake`

`SSL_read`, `SSL_read_ex`, `SSL_peek`, `SSL_peek_ex`

`SSL_write`, `SSL_write_ex`

`SSL_pending`

`SSL_has_pending`

`SSL_shutdown`

`SSL_clear`

`SSL_free`

`SSL_set0_rbio`, `SSL_set0_wbio`, `SSL_set_bio`

`SSL_set_[rw]fd`

`SSL_get_[rw]fd`

`SSL_CTRL_MODE`, `SSL_CTRL_CLEAR_MODE`

`SSL_handle_events`

`SSL_get_event_timeout`

`SSL_set_blocking_mode`, `SSL_get_blocking_mode`

`SSL_get_rpoll_descriptor`, `SSL_get_wpoll_descriptor`

`SSL_net_read_desired`, `SSL_net_write_desired`

`SSL_want`, `SSL_want_read`, `SSL_want_write`

`SSL_set_initial_peer_addr`, `SSL_get_initial_peer_addr`

`SSL_shutdown_ex`

`SSL_stream_conclude`

`SSL_stream_reset`

`SSL_get_stream_state`

`SSL_get_stream_read_error_code`, `SSL_get_stream_write_error_code`

`SSL_get_conn_close_info`

`SSL_get0_connection`

`SSL_is_connection`

`SSL_get_stream_type`

`SSL_get_stream_id`

`SSL_new_stream`

`SSL_accept_stream`

`SSL_get_accept_stream_queue_len`

`SSL_set_incoming_stream_policy`

`SSL_set_default_stream_mode`

`BIO_s_connect`, `BIO_new_ssl_connect`, `BIO_set_conn_hostname`

`BIO_new_bio_pair`

Interactions with `BIO_f_buffer`

`BIO_sendmmsg` and `BIO_recvmmsg`

`BIO_s_dgram_pair`

`BIO_POLL_DESCRIPTOR`

`BIO_s_dgram_mem`

`BIO_err_is_non_fatal`