Expose flow control information to transform #206

alvestrand · 2023-09-29T08:22:26Z

Discucssion of the late fanout use case at TPAC 2023 led to the realization that for this use case, and also for others such as "additional frame-level metadata", we need to expose congestion control signals from the transport via the packetizer to the transform, and allow them to be modified before (possibly) passing them on to the encoder (or, in the late fanout use case, to the incoming stream that it is doing fanout for).

Similarly, there may be signals that a decoder pipeline wishes to pass on to the depacketizer and to the transport, such as a keyframe request - these signals should also be exposed at the Javascript API.

A first suggestion would be to expose a series of events (one per feedback type identified) on the downstream interface of the transform, with a corresponding series of methods on the upstream interface of the transform that would take the same parameters as the content of the incoming events; this would make "default handling" easy by connecting the event directly to the handler if the transform did not desire to do any processing of them.

alvestrand · 2023-09-29T08:22:59Z

This resembles the proposal outlined in https://github.com/alvestrand/hackathon-encoded-media

youennf · 2023-09-29T09:35:56Z

RTCRtpScriptTransformer seems like an appropriate place for such API. If using events, Event.preventDefault() could be used to disable the default handling.

For instance, a keyFrameRequested event could be defined and fired when receiving PLI/FIR. Default handling would be to call the generate key frame algorithm. Web app could decide to be smarter, by calling preventDefault() on the event and then call itself generateKeyFrame() whenever appropriate.

steely-glint · 2023-09-29T11:14:08Z

Side note here on keyFrameRequested :

We found that you can get repeated keyFrameRequests which you need to ignore when when you know a keyframe is still inflight. Likewise you probably want to remove any frames in an output queue if they predate the keyFrame. So keyFrameRequested handling is by no means stateless.
Which might make it an awkward match for an event.

youennf · 2023-09-29T11:36:31Z

By default, it is up to the UA to not trigger too many key frames.
If the web app wants to handle it itself, it could be something like:

let isKeyFrameInflight = false;
const reader = transformer.readable.getReader();
const writer = transformer.writable.getWriter();
transformer.onKeyFrameRequest = async (e) => {
    e.preventDefault();
    // Do not ask for multiple key frames.
    if (isKeyFrameInflight)
        return;
    isKeyFrameInflight = true;
    await transformer.generateKeyFrame();
    isKeyFrameInflight = false;
}

And skipping of frames could be done like:

async function doTransform()
{
    const chunk = await reader.read();
    if (chunk.done)
        return;
    // Do not write chunks if we are waiting for a key frame.
    if (!isKeyFrameInflight) {
        await transformChunk(chunk.value);
        await writer.write(chunk.value);
    }
    return doTransform();
}

Does this look ok?

steely-glint · 2023-09-29T11:48:10Z

Yep, looks good.

However you now have to require that transformer.generateKeyFrame() returns and resets isKeyFrameInflight before the first keyframe chunk is given to doTransform() otherwise you'll drop the keyframe.

youennf · 2023-09-29T11:56:29Z

However you now have to require that transformer.generateKeyFrame() returns and resets isKeyFrameInflight before the first keyframe chunk is given to doTransform() otherwise you'll drop the keyframe.

Right!
And this is the current intent of the spec and WebKit implementation:

By resolving the promises just before enqueuing the corresponding key frame in a [RTCRtpScriptTransformer](https://w3c.github.io/webrtc-encoded-transform/#rtcrtpscripttransformer)'s readable, the resolution callbacks of the promises are always executed just before the corresponding key frame is exposed.

The intent is also for generateKeyFrame to be a no-op if a previous generateKeyFrame call did not yet resolve (see [[pendingKeyFrameTasks]] management).

alvestrand · 2023-10-02T13:15:43Z

Sounds good. I'll prepare a PR that adds events and corresponding listeners / action functions, and we can use that as a basis for discussion.

alvestrand mentioned this issue Oct 4, 2023

Congestion control API proposal #207

Merged

alvestrand added the enhancement New feature or request label Oct 5, 2023

foolip mentioned this issue Dec 1, 2023

WebRTC “end-to-end-encryption” web-platform-tests/interop#533

Closed

3 tasks

alvestrand closed this as completed in #207 Dec 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose flow control information to transform #206

Expose flow control information to transform #206

alvestrand commented Sep 29, 2023

alvestrand commented Sep 29, 2023

youennf commented Sep 29, 2023

steely-glint commented Sep 29, 2023

youennf commented Sep 29, 2023

steely-glint commented Sep 29, 2023

youennf commented Sep 29, 2023

alvestrand commented Oct 2, 2023

Expose flow control information to transform #206

Expose flow control information to transform #206

Comments

alvestrand commented Sep 29, 2023

alvestrand commented Sep 29, 2023

youennf commented Sep 29, 2023

steely-glint commented Sep 29, 2023

youennf commented Sep 29, 2023

steely-glint commented Sep 29, 2023

youennf commented Sep 29, 2023

alvestrand commented Oct 2, 2023