Can I feed the 22050 sr wav to the pre-trained rawnet3 model ? #31

predawnang · 2023-04-26T15:20:14Z

Hi
I want to use rawnet3 model in my project to compute the speaker similarity of a pair of wavs. All the audio in my dataset is 22050Hz, for some reason I could down sample those audio to 16000 kHz. I wonder if the pretrained model is suitable to the 22050Hz audio.
Thanks

Jungjee · 2023-05-03T02:14:51Z

Hi @predawnang, yes I believe RawNet3 should work on downsampled 16 kHz waveforms.
First downsample your waveforms to 16 kHz and then feed them to the model.

predawnang · 2023-05-07T07:13:32Z

Dear author, can I directly feed the 22khz wavforms to the model, will it cause performance decrease of the model?

Jungjee · 2023-06-01T23:31:16Z

I haven't tested that case, but it would be likely that the output representations aren't representative.

In short, I recommend you not to.

Jungjee closed this as completed Jun 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can I feed the 22050 sr wav to the pre-trained rawnet3 model ? #31

Can I feed the 22050 sr wav to the pre-trained rawnet3 model ? #31

predawnang commented Apr 26, 2023 •

edited

Loading

Jungjee commented May 3, 2023

predawnang commented May 7, 2023

Jungjee commented Jun 1, 2023

Can I feed the 22050 sr wav to the pre-trained rawnet3 model ? #31

Can I feed the 22050 sr wav to the pre-trained rawnet3 model ? #31

Comments

predawnang commented Apr 26, 2023 • edited Loading

Jungjee commented May 3, 2023

predawnang commented May 7, 2023

Jungjee commented Jun 1, 2023

predawnang commented Apr 26, 2023 •

edited

Loading