Equally Divided Subband and Complex Spectrogram #6

sophia1488 · 2021-12-07T08:33:50Z

Hi, I've read your paper and have a few questions,

As lower frequency contains more information, will the result be better if the subband is not equally divided? Perhaps log-scale? (and maybe add another transformation so that they have the same shape to concatenate.)
I'd like to try it, yet I'm not sure how the filters (models/filters/*.mat) are generated.
If I understand it correctly, the U-Net cannot see the phase information,

2021-ISMIR-MSS-Challenge-CWS-PResUNet/models/resunet_conv1_vocals/model.py

Line 195 in 2f84db8

sp, cos_in, sin_in = self.f_helper.wav_to_mag_phase_subband_spectrogram(input)

since only sp is forwarded to U-Net.
I've tried adding phase on other channels, so that the input to the U-Net will be (batch, channel*2, time, frequency), and the rest of the code is the same. But the result is worse. Do you have any thoughts on this?

Thanks a million!

The text was updated successfully, but these errors were encountered:

haoheliu · 2021-12-08T02:56:19Z

@sophia1488 Hi, you made very good points! Below is my understanding.

Yes definitely. I divide the band equally so that it can be easily modeled by CNN and simple concatenation. If you divide the band unequally, like the filter below, you can use some projection layers to transform them into the same shape for concatenation. Alternatively, you can use other architecture instead of pure CNN to fit with subbands with different dimensions. The filter can be built using this tool.
Yes. We only forward sp to U-Net because it only needs to predict phase variations instead of phase information itself. In this case, original phase information becomes less important. You can try to add phase information on each channel. But personally, I suppose original phase information is quite complex and highly nonstructural, making it hard to learn from.

Welcome more discussions!

sophia1488 · 2021-12-09T02:07:18Z

Hello, thanks a lot for your quick reply and detailed comments! I'll dig into this.
Just one more question, is it okay for you to provide the command for the filter above? Since I'm not familiar with filters and I'd like to reproduce it.

Thank you 😊

haoheliu · 2021-12-09T02:53:24Z

@sophia1488 I've already lost the original code. But you can start with trying the following command:

[h,f,distortion,total_aliasing]=Filter_Bank_Design([1,2,3,4],[0.1,0.2,0.3,0.4],[0.02,0.04,0.08,0.16],64,32,100,20000,[1 1],[]);

sophia1488 closed this as completed Dec 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Equally Divided Subband and Complex Spectrogram #6

Equally Divided Subband and Complex Spectrogram #6

sophia1488 commented Dec 7, 2021

haoheliu commented Dec 8, 2021 •

edited

sophia1488 commented Dec 9, 2021

haoheliu commented Dec 9, 2021

Equally Divided Subband and Complex Spectrogram #6

Equally Divided Subband and Complex Spectrogram #6

Comments

sophia1488 commented Dec 7, 2021

haoheliu commented Dec 8, 2021 • edited

sophia1488 commented Dec 9, 2021

haoheliu commented Dec 9, 2021

haoheliu commented Dec 8, 2021 •

edited