AnalyserNode design issues #86

olivierthereaux · 2013-09-11T14:28:12Z

Originally reported on W3C Bugzilla ISSUE-17361 Tue, 05 Jun 2012 11:49:46 GMT
Reported by Philip Jägenstedt
Assigned to

Audio-ISSUE-74 (RealtimeAnalyserNode): RealtimeAnalyserNode design [Web Audio API]

http://www.w3.org/2011/audio/track/issues/74

Raised by: Philip Jägenstedt
On product: Web Audio API

The RealtimeAnalyserNode has so many problems that we will put them into a single issue.

The use case appears to be probing/polling the signal for visualization, where it does not matter if all of the signal is available. It also doesn't modify the output, so it need not delay the processing at all, like JavaScriptAudioNode would.

The problems identified with the current spec are:

It is undefined how multi-channel input maps to time/frequency data, which are both single arrays.
The layout/order of the frequency bins is undefined. Are the negative frequencies included?
It is undefined what happens if the array to the getters has more elements than frequencyBinCount.
smoothingTimeConstant is defined only as "A value from 0 -> 1 where 0 represents no time averaging with the last analysis frame." How does it affect time/frequency domain data? What is an analysis frame and which is the last?
If frequencyBinCount == fftSize / 2, why is it exposed at all?
minDecibels/maxDecibels are undefined. Do minDecibels/maxDecibels control the output of getByteFrequencyData, or do they describe it? Are the parameters only used for getByteFrequencyData()? If so, why are they not arguments to that method? How does the Uint8 range (0-255) map to the decibel range (minDecibels-maxDecibels)?
What happens if fftSize is set to something that is not a power of two? Are there any limits? Are 1 and 2^32 both valid values?
If the fftSize is initially set to 1, then changed to 2^32, what should the getters do? To not restrict this requires unbounded buffering to handle an arbitrarily large fftSize.
It's not at all clear why there are Uint8Array getters, instead of simply a frequency domain and time domain getter, both as Float32Array.

For the use case we're aware of, this can be simplified greatly. We'd prefer an interface that just allows probing the most recent time domain data as an AudioBuffer and leave it up to the Web developer to perform the FFT by other means. A fast, generic FFT function can be very useful not only for visualization, but also for synthesis, filters etc. In the absence of a native FFT implementation (which could be part of another specification - perhaps add it to the Math object), a custom JavaScript FFT implementation will most likely suffice for most applications.

For example:

interface AudioProbe : AudioNode {
    // get the most recent data available.
    AudioBuffer getData();
}

// in AudioContext, the size must be given up-front and cannot change
AudioProbe createAudioProbe(in unsigned long bufferSize);

Depending on how https://www.w3.org/2011/audio/track/issues/28 is resolved, we could simply have an attribute "AudioBuffer data" that is guaranteed to be stable while the script is executing, to avoid the use of a getter function altogether.

The text was updated successfully, but these errors were encountered:

joeberkovitz · 2014-10-23T15:42:47Z

It seems that the clarifications suggested by Philip represent a set of non-breaking-change edits to the spec that would be useful.

cwilso · 2014-10-23T16:25:54Z

Related: #377

joeberkovitz · 2015-10-14T16:32:37Z

Fix plan for this issue:

"It is undefined how multi-channel input maps to time/frequency data, which are both single arrays.": Added explicit down-mixing operation to FFT computation.

"The layout/order of the frequency bins is undefined. Are the negative frequencies included?":
"It is undefined what happens if the array to the getters has more elements than frequencyBinCount."
"smoothingTimeConstant is defined only as "A value from 0 -> 1 where 0 represents no time averaging with the last analysis frame." How does it affect time/frequency domain data? What is an analysis frame and which is the last?"
Previously addressed.

"If frequencyBinCount == fftSize / 2, why is it exposed at all?"
I don't see a problem retaining frequencyBinCount although it seems unnecessary

"minDecibels/maxDecibels are undefined. Do minDecibels/maxDecibels control the output of getByteFrequencyData, or do they describe it? Are the parameters only used for getByteFrequencyData()? If so, why are they not arguments to that method? How does the Uint8 range (0-255) map to the decibel range (minDecibels-maxDecibels)?"

"What happens if fftSize is set to something that is not a power of two? Are there any limits? Are 1 and 2^32 both valid values?"

"If the fftSize is initially set to 1, then changed to 2^32, what should the getters do? To not restrict this requires unbounded buffering to handle an arbitrarily large fftSize."

Previously addressed.

"It's not at all clear why there are Uint8Array getters, instead of simply a frequency domain and time domain getter, both as Float32Array."
I don't see a strong case for removing the Uint8 version at this point although I also fail to understand why it exists.

joeberkovitz · 2015-10-14T16:33:42Z

If anyone feels strongly that getByteFrequencyData() or frequencyBinCount should be removed from the AnalyserNode spec (see original comments by Philip) please say so. Otherwise I plan to retain.

rtoy · 2015-10-14T17:55:42Z

On Wed, Oct 14, 2015 at 9:32 AM, Joe Berkovitz notifications@github.com
wrote:

I don't see a strong case for removing the Uint8 version at this point
although I also fail to understand why it exists.

I'm guessing Chris Rogers added this to make visualizations using the time
and frequency data easier since the user doesn't have to do scaling of the
float values himself.

—
Reply to this email directly or view it on GitHub
#86 (comment)
.

Ray

joeberkovitz · 2015-10-14T20:02:49Z

@rtoy Thanks. Please let me know on #629 if this is good to merge.

Fix #86 by clarifying channel down-mixing.

rtoy · 2015-12-07T17:01:22Z

Just wanted to note that this has memory and/or cpu implications because you either have to keep all the channels in memory just in case someone calls getFoo (at which point you downmix), or you always have to downmix just in case.

mdjp added the Architectural/Fundamental (Breaking change) label Jun 25, 2014

joeberkovitz added the V1 (TPAC 2014) label Oct 23, 2014

cwilso changed the title ~~(RealtimeAnalyserNode): RealtimeAnalyserNode design~~ AnalyserNode design issues Oct 23, 2014

joeberkovitz mentioned this issue Oct 23, 2014

Specify what AnalyserNode should do #28

Closed

cwilso added Clarification (Requires change) Needs Edits Decision has been made, the issue can be fixed. https://speced.github.io/spec-maintenance/about/ and removed Architectural/Fundamental (Breaking change) labels Oct 30, 2014

cwilso added this to the Web Audio Last Call 1 milestone Oct 30, 2014

joeberkovitz self-assigned this Oct 14, 2015

joeberkovitz mentioned this issue Oct 14, 2015

Fix #86 by clarifying channel down-mixing. #629

Merged

joeberkovitz closed this as completed in dabb1da Dec 3, 2015

rtoy added a commit that referenced this issue Dec 3, 2015

Merge pull request #629 from WebAudio/86-analyzer-node-issues

ba06f71

Fix #86 by clarifying channel down-mixing.

rtoy mentioned this issue Feb 12, 2016

Analyser downmixing unclear #719

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AnalyserNode design issues #86

AnalyserNode design issues #86

olivierthereaux commented Sep 11, 2013

joeberkovitz commented Oct 23, 2014

cwilso commented Oct 23, 2014

joeberkovitz commented Oct 14, 2015

joeberkovitz commented Oct 14, 2015

rtoy commented Oct 14, 2015

joeberkovitz commented Oct 14, 2015

rtoy commented Dec 7, 2015

AnalyserNode design issues #86

AnalyserNode design issues #86

Comments

olivierthereaux commented Sep 11, 2013

joeberkovitz commented Oct 23, 2014

cwilso commented Oct 23, 2014

joeberkovitz commented Oct 14, 2015

joeberkovitz commented Oct 14, 2015

rtoy commented Oct 14, 2015

joeberkovitz commented Oct 14, 2015

rtoy commented Dec 7, 2015