ExtremaSpectrum

ExtremaSpectrum is a .NET library for audio signal analysis based on an extrema-driven oscillation decomposition algorithm.

This is not an FFT spectrum. The output describes detected local oscillations by frequency, not orthogonal sinusoidal components. See Algorithm.

Repository: github.com/DimonSmart/ExtremaSpectrum

Visual overview

Visual frequency inspection for sound.
ExtremaSpectrum decomposes a waveform into local oscillations, making dominant frequency bands and each reduction pass easy to inspect.

Try the demo with a low-frequency focused view:

dotnet run --project src\DimonSmart.ExtremaSpectrum.Demo -- --min-frequency 0 --input ".\data\demo-low-010hz-plus-high-160hz.wav" --bins 180 --to-bin 20

AnalyzeDetailed(...) returns an ExtremaAnalysisReport with per-pass spectra, accepted oscillations, support ranges, and waveform snapshots.
The demo app can also export step-by-step SVG frames:

dotnet run --project src/DimonSmart.ExtremaSpectrum.Demo -- --input data/demo-low-010hz-plus-high-160hz.wav --export-step-images temp

Example frames generated from data/demo-low-010hz-plus-high-160hz.wav:

Install

dotnet add package DimonSmart.ExtremaSpectrum

The current package targets .NET 10.

Quick start

using DimonSmart.ExtremaSpectrum;

var analyzer = new ExtremaSpectrumAnalyzer(new ExtremaSpectrumOptions
{
    BinCount = 128,
    MinFrequencyHz = 500f,
    MaxFrequencyHz = 8000f,
    MaxPasses = 12
});

AnalysisResult result = analyzer.Analyze(samples, sampleRate);
float[] spectrum = result.Spectrum;

Streaming

var analyzer = new StreamingExtremaSpectrumAnalyzer(
    options,
    analysisWindowSamples: 2048,
    hopSamples: 512);

if (analyzer.PushPcm16(buffer, format, out var result))
{
    float[] spectrum = result!.Spectrum;
}

if (analyzer.PushDetailedPcm16(buffer, format, out var report))
{
    IReadOnlyList<ExtremaPassSnapshot> passes = report!.Passes;
}

Algorithm

The analyzer performs iterative decomposition of a discrete signal into local oscillations.

One pass

Find all local extrema using strict neighbour comparison. Boundary points are excluded.
Scan left to right for consecutive triples in the form min -> max -> min or max -> min -> max.
For each accepted triple (left, mid, right):
- periodSamples = right - left
- L = floor(midpoint(left, mid))
- R = ceil(midpoint(mid, right))
- when the next accepted triple reuses the same slope, its left boundary starts from the already chosen R
- baseline = (signal[left] + signal[right]) / 2
- amplitude = abs(signal[mid] - baseline)
- frequencyHz = sampleRate / periodSamples
If the oscillation passes the period and amplitude filters and maps into a valid bin, its contribution is added to the spectrum.
Samples strictly between L and R are removed from later passes, while the midpoint boundary samples are preserved.
The scan advances by one extremum, so adjacent accepted oscillations can share the same preserved midpoint boundary.

Multiple passes are performed until no valid triple is found or MaxPasses is reached.

Extrema detection

local max: signal[i - 1] < signal[i] && signal[i] >= signal[i + 1]
local min: signal[i - 1] > signal[i] && signal[i] <= signal[i + 1]

Flat plateaus use the first point that satisfies the comparison.

Frequency binning

binWidth = (MaxFrequencyHz - MinFrequencyHz) / BinCount
binIndex = floor((frequencyHz - MinFrequencyHz) / binWidth)

Input formats

All analysis overloads need audio samples plus a sample rate in Hz. The difference is how decoded the input already is.

Method	What you pass	Where the sample rate comes from	Notes
`Analyze(ReadOnlySpan<float> samples, int sampleRate)`	Mono float samples, one `float` per sample, typically normalized to `[-1, +1]`	`sampleRate` argument	Use when audio is already decoded to mono PCM values.
`AnalyzePcm16(ReadOnlySpan<byte> buffer, AudioBufferFormat format)`	Raw signed 16-bit PCM sample bytes, little-endian	`format.SampleRate`	`format` also defines channel count, layout, and mono downmix. `buffer` must contain whole sample frames, not a WAV file header.
`AnalyzeFloat32(ReadOnlySpan<byte> buffer, AudioBufferFormat format)`	Raw 32-bit IEEE float PCM sample bytes, little-endian	`format.SampleRate`	Same as `AnalyzePcm16`, but each sample uses 4 bytes instead of 2.

AudioBufferFormat describes how byte buffers are interpreted:

var format = new AudioBufferFormat
{
    SampleRate = 48000,
    Channels = 2,
    BitsPerSample = 16,
    Interleaved = true,
    ChannelMixMode = ChannelMixMode.AverageAllChannels
};

AnalysisResult result = analyzer.AnalyzePcm16(buffer, format);

Key AudioBufferFormat fields:

SampleRate: samples per second, for example 44100 or 48000
Channels: total channel count, for example 1 for mono or 2 for stereo
BitsPerSample: 16 for AnalyzePcm16(...), 32 for AnalyzeFloat32(...)
Interleaved = true: samples are laid out as L0 R0 L1 R1 ...
Interleaved = false: planar layout, for example all left samples followed by all right samples
PreferredChannel: zero-based channel index used only when ChannelMixMode is PreferredChannel

AnalyzeDetailed(...) follows the same input rules as Analyze(...). The streaming analyzer uses the same PCM buffer rules for PushPcm16(...) and PushFloat32(...).

Multi-channel buffers are mixed to mono according to ChannelMixMode:

Mode	Behavior
`FirstChannel`	Use channel 0
`PreferredChannel`	Use `AudioBufferFormat.PreferredChannel`
`AverageAllChannels`	Arithmetic mean of all channels

Configuration

new ExtremaSpectrumOptions
{
    BinCount = 128,
    MinFrequencyHz = 100f,
    MaxFrequencyHz = 8000f,
    MaxPasses = 16,
    MinPeriodSamples = 2,
    MaxPeriodSamples = 0,
    MinAmplitude = 0f,
    AccumulationMode = AccumulationMode.Amplitude
}

Limitations

Not FFT. Bin values are accumulated oscillation amplitudes or energies, not DFT coefficients.
There is no windowing, zero-padding, or spectral leakage correction.
Frequency resolution improves with longer input buffers.
The greedy left-to-right pass does not resolve overlap inside the same pass.
Results depend on signal amplitude. Normalize input if absolute comparison matters.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github		.github
data		data
docs		docs
src		src
tests/DimonSmart.ExtremaSpectrum.Tests		tests/DimonSmart.ExtremaSpectrum.Tests
.gitignore		.gitignore
AGENTS.md		AGENTS.md
ExtremaSpectrum.slnx		ExtremaSpectrum.slnx
LICENSE		LICENSE
README.md		README.md
test-on-test-samples.bat		test-on-test-samples.bat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ExtremaSpectrum

Visual overview

Install

Quick start

Streaming

Algorithm

One pass

Extrema detection

Frequency binning

Input formats

Configuration

Limitations

License

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ExtremaSpectrum

Visual overview

Install

Quick start

Streaming

Algorithm

One pass

Extrema detection

Frequency binning

Input formats

Configuration

Limitations

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages