Why does `transforms.TimeStretch` return of type `complex64`? #3688

kuraga · 2023-11-05T12:02:57Z

🐛 Describe the bug

Good day!

https://pytorch.org/audio/2.1.0/generated/torchaudio.transforms.TimeStretch.html#torchaudio.transforms.TimeStretch.forward:

Stretched spectrogram. The resulting tensor is of the same dtype as the input spectrogram, but the number of frames is changed to ceil(num_frame / rate).

But:

s = torchaudio.transforms.Spectrogram()(x)
s.dtype  # => torch.float32

t = torchaudio.transforms.TimeStretch(fixed_rate=0.9)(s)
t.dtype  # =>  torch.complex64

Should I collect a bug report or don't I understand time stretching?

(previously posted at the forum)

Versions

torchaudio 2.1.1 from Google Colab

The text was updated successfully, but these errors were encountered:

mthrok · 2023-11-08T15:57:34Z

TimeStretch (or underlying phase_vocoder) expects input to be raw spectrogram (the one with power=None) because it manipulates the input signal in complex plane based on the phase information. It alters both phase and magnitude, then returns the complex spectrogram.

audio/src/torchaudio/functional/functional.py

Line 735 in c5b6933

    
           def phase_vocoder(complex_specgrams: Tensor, rate: float, phase_advance: Tensor) -> Tensor:

torchaudio.transforms.Spectrogram has argument power with default value of 2, which produces real-valued power spectrogram. It discards phase information. In this case, TimeStretch interprets the input signal as having zero phase everywhere.

I feel like it is more user friendly to warn or reject real-valued spectrogram input in TimeStretch.

kuraga · 2023-11-09T01:02:54Z

@mthrok , thanks!

https://pytorch.org/audio/2.1.0/generated/torchaudio.transforms.TimeStretch.html:

Seems like we need to show the way of getting the picture in the Example.
And fix the statement:

Stretched spectrogram. The resulting tensor is of the same dtype as the input spectrogram, but the number of frames is changed to ceil(num_frame / rate).

Also:

hop_length (int) or None, optional) – Length of hop between STFT windows. (Default: win_length // 2)

But there is no win_length argument.

Your idea about the warning.

mthrok · 2023-11-10T02:36:22Z

@kuraga

#3694 will fix the documentation and #3695 will add the warning if real-valued tensor is passed.

Seems like we need to show the way of getting the picture in the Example.

It is found in Feature Extraction tutorial found in the same documentation, so I will defer to it.

Addresses #3688

kuraga · 2023-11-10T10:25:26Z

@mthrok

#3694 will fix the documentation and #3695 will add the warning if real-valued tensor is passed.

Wow-wow, thanks!!

Seems like we need to show the way of getting the picture in the Example.

It is found in Feature Extraction tutorial found in the same documentation, so I will defer to it.

I meant librosa.amplitude_to_db call (or .abs().pow(2) etc. call) isn't reflected.
But now I see visualisation details are not reflected at methods' documentation.

mthrok added a commit that referenced this issue Nov 10, 2023

Warn if the input dtype to TimeStretch is not complex (#3695)

ccd78ff

Addresses #3688

kuraga closed this as completed Nov 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why does `transforms.TimeStretch` return of type `complex64`? #3688

Why does `transforms.TimeStretch` return of type `complex64`? #3688

kuraga commented Nov 5, 2023

mthrok commented Nov 8, 2023

kuraga commented Nov 9, 2023 •

edited

Loading

mthrok commented Nov 10, 2023

kuraga commented Nov 10, 2023 •

edited

Loading

Why does transforms.TimeStretch return of type complex64? #3688

Why does transforms.TimeStretch return of type complex64? #3688

Comments

kuraga commented Nov 5, 2023

🐛 Describe the bug

Versions

mthrok commented Nov 8, 2023

kuraga commented Nov 9, 2023 • edited Loading

mthrok commented Nov 10, 2023

kuraga commented Nov 10, 2023 • edited Loading

Why does `transforms.TimeStretch` return of type `complex64`? #3688

Why does `transforms.TimeStretch` return of type `complex64`? #3688

kuraga commented Nov 9, 2023 •

edited

Loading

kuraga commented Nov 10, 2023 •

edited

Loading