Suspending audio context for battery usage reasons #317

cwilso · 2014-05-05T21:48:51Z

We've run in to a significant request to reduce battery consumption. In short, we need to be able to suspend the audio hardware device usage.

Unfortunately, given the current design, we have no way of doing this; the best we can tell users is "drop all references to AudioContexts", but even then, until garbage collection happens there's no guarantee the hardware device context will be freed up (enabling it to suspend and stop consuming power).

Raymond and I have discussed this in some detail, and I think our preferred option is that we provide two methods on the AudioContext:

void suspend();
Promise resume();

...that will put the system into suspension (meaning, the AudioContext currentTime stops progressing, all sound output is paused, and the audio hardware can power down), and asynchronously wake it up again and resuming processing (which is an asynchronous operation, and may take some amount of [real] time until the audio system has resumed processing data).

This takes away the unpredictability of when the audio system shuts down, but also lets us account for the latency hiccup in restarting.

The text was updated successfully, but these errors were encountered:

padenot · 2014-05-05T22:30:11Z

fwiw, Gecko does that at the moment (pausing the audio hardware) for the same reasons (for example, some parts of FirefoxOS use the Web Audio API to output system sounds, battery drain is real concern in this context).

Our solution works, but has the drawback of not taking latency into account (it's something like pausing the underlying audio stream when we see that only the destination node is present in the graph).

I'd rather, as you say, have an explicit API than relying on the behavior of the engine.

Random things that need to be specced (maybe there are more):

MediaStreams should see their output ignored by the graph (as they are real time and we don't want to buffer everything)
HTMLMediaElement can either be paused or see their output ignored by the graph, we need to decide, as we can do both. Then again, that's tricky, because HTMLMediaElements can have a source that is a real time MediaStream.
ScriptProcessorNode events: should they drain the last bits of audio to the consumer, or just not fire their events if suspend() has been called ? What would the buffers look like, a shortened buffer for input, or a full buffer filled with silence? If the events are not fired, should they be queued for when the context is resume()ed ?
- Should the context stop immediately and throw away the hardware buffers, or drain the remaining audio ? Basically, which guarantees to we put on the suspend() call.
If we don't drain, would calling suspend() play back queued buffers from last time, or are they lost ?
Do we assume, for AnalyserNode window functions, that the period before the resume() is silence, or the last audio data played back?

cwilso · 2014-05-05T22:37:24Z

Yeah, the introduced latency or clock-that-doesn't-progress was the problem with trying to do this automatically.

You've correctly found the one piece I didn't elaborate on. :) Real-time source nodes - MediaStreams and HTMLMediaElements - should have their output ignored while the AudioContext is paused.

ScriptProcessors shouldn't be a problem - they should pause just like the destination node, and events should not be fired. They would not get "queued", per se - but once resume() is called, the events would start firing again as time progresses.

I don't think we should drain the buffers, and yes, the playback would continue from the last played frame. And yes, for windowing purposes, the last audio data played back would be retained.

padenot · 2014-05-05T22:42:45Z

Well, I think we agree, then. Any other thoughts from actual users of the API instead of implementers :-) ?

cwilso · 2014-05-05T22:49:59Z

:) (Note there's an implicit users-of-API ask here from us, since this came
from an internal team using Web Audio.)

On Mon, May 5, 2014 at 3:42 PM, Paul ADENOT notifications@github.comwrote:

Well, I think we agree, then. Any other thoughts from actual users of the
API instead of implementers :-) ?

—
Reply to this email directly or view it on GitHubhttps://github.com//issues/317#issuecomment-42249571
.

jernoble · 2014-05-05T23:03:34Z

This looks very similar to issue #72, which I opened for similar reasons.

cwilso · 2014-05-06T17:14:42Z

Hmm. There are three related but somewhat orthogonal things wrapped up here:

Exclusive access to sound hardware.
Pausing audio graph/processing for code scenario reasons (e.g. pausing while answering a call)
Avoiding battery drain by enabling the audio hardware to power down, while accounting for the latency inherent in reviving the audio hardware.

Jer, you wrapped the first and second together; I actually would like to explicitly enable audio apps to ask for exclusive access on systems that need exclusive access for better latency characteristics (e.g. Windows!), and I see how that would tie together with #2. I only list #2 and #3 separately because I think #2 can quickly imply pausing PARTS of the audio graph - for example, if a gamer hits Pause, the game sound may pause but the menu popped up frequently has UI feedback audio still active. That's much harder to explain (separate timelines for each audio pathway), unless we just say that's a multiple-AudioContext scenario. I'd like to explicitly avoid doing this, and keep one "currentTime" for each AudioContext.

Perhaps what we need here is

suspend and resume methods, so the developer can explicitly suspend/resume. If the system has suspended you (e.g. a phone call), resume()'s promise may not return for a while.
an event that fires when your AudioContext is suspended (since this may happen in your scenario when the system
a way to request "exclusive mode" on systems that have one. My preference would be to only do this on AudioContext startup, but at the very least, you should only be able to do this while the audio is suspended.
a running/suspended state exposure, as you suggested.

WDYT?

padenot · 2014-06-16T13:31:37Z

(1) is necessary.
(2) it might be nice to have an event for when the stream is restarted as well, because it can be super long. We put in probes in Firefox to measure this, and we now have numbers. 1 is the first time an audio stream is created for the process, 2 is after the first creation (we were pretty sure that the initial stream creation for a given process would set up a bunch of IPC stuff between the mixer process and the Firefox process and so would be longer). Number are in milliseconds, and if you look at the histogram at the bottom, the tail is depressingly long. This is across all platforms (someone on the WebRTC team at Mozilla has a Mac Book Pro where he regularly see a 8.5 second stream creation time, that's what made us actually measure it in the field).
(3) has already been requested by some people I met. This would allow apps to use ASIO drivers on Windows, for example, or WASAPI exclusive mode (although the same people have told me that the support is pretty buggy most of the time)
(4) I agree, it's cheap to do, and it's likely that the developer would implement something like that anyways, so it makes sense to directly have it on the context.

jernoble · 2014-06-16T17:38:06Z

On Jun 16, 2014, at 6:31 AM, Paul ADENOT notifications@github.com wrote:

(1) is necessary.

Agreed.

(2) it might be nice to have an event for when the stream is restarted as well, because it can be super long. We put in probes in Firefox to measure this, and we now have numbers. 1 is the first time an audio stream is created for the process, 2 is after the first creation (we were pretty sure that the initial stream creation for a given process would set up a bunch of IPC stuff between the mixer process and the Firefox process and so would be longer). Number are in milliseconds, and if you look at the histogram at the bottom, the tail is depressingly long. This is across all platforms (someone on the WebRTC team at Mozilla has a Mac Book Pro where he regularly see a 8.5 second stream creation time, that's what made us actually measure it in the field).

So a “suspended” and a “resumed” event (or some equivalent name)? Sounds fine to me.

(3) has already been requested by some people I met. This would allow apps to use ASIO drivers on Windows, for example, or WASAPI exclusive mode (although the same people have told me that the support is pretty buggy most of the time)

Why should this particular use case be exposed through API? Would a web page try “regular mode”, and if that fails retry with “exclusive mode”? Or would web developers start to always request “exclusive mode” because that “works better on windows”?

Maybe I’m missing something about what an “exclusive mode” to an AudioContext would allow UAs on Windows to provide.

(4) I agree, it's cheap to do, and it's likely that the developer would implement something like that anyways, so it makes sense to directly have it on the context.

Agreed.

-Jer

padenot · 2014-06-17T09:39:32Z

On 16/06/2014 19:38, Jer Noble wrote:

On Jun 16, 2014, at 6:31 AM, Paul ADENOT notifications@github.com wrote:
(3) has already been requested by some people I met. This would allow apps to use ASIO drivers on Windows, for example, or WASAPI exclusive mode (although the same people have told me that the support is pretty buggy most of the time)

Why should this particular use case be exposed through API? Would a web page try “regular mode”, and if that fails retry with “exclusive mode”? Or would web developers start to always request “exclusive mode” because that “works better on windows”?

Maybe I’m missing something about what an “exclusive mode” to an AudioContext would allow UAs on Windows to provide.

It's a mode where an application takes exclusive control of the audio
hardware, to get lower latency (by bypassing the mixing stage, for
example). It's not really clear to me that the people that are
requesting it have actually measured the latency: we can get between 10
to 30ms on desktop OS using the built-in audio stacks, these days, and
the Android situation is improving, so I think this can wait until there
is a clear need.

cwilso · 2014-08-25T15:49:57Z

Should additionally provide "release" to forcibly release unneeded AudioContexts.

bjornm · 2014-08-26T08:42:40Z

Will these methods be available for OfflineAudioContexts, too? Please see my comment on #21

This has a race somewhere, so we disable it for now. The real fix will come from the Web Audio API Suspend API [0] [0]: WebAudio/web-audio-api#317

WebAudio is adding a suspend()/resume() API: WebAudio/web-audio-api#361 WebAudio/web-audio-api#317 The specification is here: http://webaudio.github.io/web-audio-api/#the-audiocontext-interface http://webaudio.github.io/web-audio-api/#widl-AudioContext-suspend-Promise http://webaudio.github.io/web-audio-api/#widl-AudioContext-resume-Promise BUG=420106 TESTS=new layout tests added Review URL: https://codereview.chromium.org/697603003 git-svn-id: svn://svn.chromium.org/blink/trunk@186253 bbb929c8-8fbe-4397-9dbb-9b2b20218538

This has a race somewhere, so we disable it for now. The real fix will come from the Web Audio API Suspend API [0] [0]: WebAudio/web-audio-api#317 UltraBlame original commit: a71337656d0cb61f6cacce8714bea7094ad36000

cwilso added this to the V1 milestone May 5, 2014

cwilso self-assigned this May 5, 2014

mdjp added the Feature Request label Jun 25, 2014

bjornm mentioned this issue Aug 26, 2014

OfflineAudioContext needs a way to handle audio of arbitrary duration #21

Closed

rtoy mentioned this issue Oct 6, 2014

Clarify suspend/resume on AudioContexts #361

Closed

cwilso modified the milestones: V1, Web Audio v.1, Web Audio Last Call 1, Web Audio v 1.0 Oct 17, 2014

cwilso added Editorial/Documentation Needs Edits Decision has been made, the issue can be fixed. https://speced.github.io/spec-maintenance/about/ labels Oct 18, 2014

mdjp added V1 (TPAC 2014) and removed V1 (TPAC 2014) labels Oct 20, 2014

cwilso added a commit to cwilso/web-audio-api that referenced this issue Oct 20, 2014

Suspend/resume/close/state. Fixes WebAudio#317 and WebAudio#361

508ce12

cwilso mentioned this issue Oct 20, 2014

Suspend/resume/close/state added to AudioContext #374

Merged

cwilso added Ready for Review and removed Needs Edits Decision has been made, the issue can be fixed. https://speced.github.io/spec-maintenance/about/ labels Oct 20, 2014

cwilso closed this as completed in #374 Nov 4, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Suspending audio context for battery usage reasons #317

Suspending audio context for battery usage reasons #317

cwilso commented May 5, 2014

padenot commented May 5, 2014

cwilso commented May 5, 2014

padenot commented May 5, 2014

cwilso commented May 5, 2014

jernoble commented May 5, 2014

cwilso commented May 6, 2014

padenot commented Jun 16, 2014

jernoble commented Jun 16, 2014

padenot commented Jun 17, 2014

cwilso commented Aug 25, 2014

bjornm commented Aug 26, 2014

Suspending audio context for battery usage reasons #317

Suspending audio context for battery usage reasons #317

Comments

cwilso commented May 5, 2014

padenot commented May 5, 2014

cwilso commented May 5, 2014

padenot commented May 5, 2014

cwilso commented May 5, 2014

jernoble commented May 5, 2014

cwilso commented May 6, 2014

padenot commented Jun 16, 2014

jernoble commented Jun 16, 2014

padenot commented Jun 17, 2014

cwilso commented Aug 25, 2014

bjornm commented Aug 26, 2014