Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can I feed the 22050 sr wav to the pre-trained rawnet3 model ? #31

Closed
predawnang opened this issue Apr 26, 2023 · 3 comments
Closed

Can I feed the 22050 sr wav to the pre-trained rawnet3 model ? #31

predawnang opened this issue Apr 26, 2023 · 3 comments

Comments

@predawnang
Copy link

predawnang commented Apr 26, 2023

Hi
I want to use rawnet3 model in my project to compute the speaker similarity of a pair of wavs. All the audio in my dataset is 22050Hz, for some reason I could down sample those audio to 16000 kHz. I wonder if the pretrained model is suitable to the 22050Hz audio.
Thanks

@Jungjee
Copy link
Owner

Jungjee commented May 3, 2023

Hi @predawnang, yes I believe RawNet3 should work on downsampled 16 kHz waveforms.
First downsample your waveforms to 16 kHz and then feed them to the model.

@predawnang
Copy link
Author

Dear author, can I directly feed the 22khz wavforms to the model, will it cause performance decrease of the model?

@Jungjee
Copy link
Owner

Jungjee commented Jun 1, 2023

I haven't tested that case, but it would be likely that the output representations aren't representative.

In short, I recommend you not to.

@Jungjee Jungjee closed this as completed Jun 28, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants