-
Notifications
You must be signed in to change notification settings - Fork 2.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Discussion] Confusion about different sample rates #503
Comments
Hi @expectopatronum
Using "mask_extension":"average", you should have separated signals that sum up to the mix. For older versions of spleeter (i.e. <=1.5.3), you may have differences at the very beginning or the very end of the signal due to the way STFT/iSTFT were managed, but that should be fixed from 1.5.4. Let us know if you still have troubles with recent versions. |
Thanks for your reply! I just checked, I am using version 1.5.4 which I installed not too long ago using
After
I get the following:
How would it work anyways? There are not '-16khz' configs available in the config folder?
Regarding you last comment I was trying with spleeter 1.5.4 and spleeter 2.0, but I still get differences. Should I take a delta into account when comparing? |
I tried different configurations but I haven't been able to make the output files actually be 48kHz audio files, am I missing something? or is it not currently possible to make the whole chain work at 48kHz or even potentially 88.2 or 96kHz? (I understand that given that the models were trained @44.1 the performance might be lower at a different SR but I'd still like to be able to produce 48kHz files) Also, Is there a reason for the output to be 16bits on wav files? is there a way to make this 24 or potentially 32FP? |
Oh! I figured out the 48kHz :) |
Hi!
Until quite recently I assumed that the models were trained using 44100 Hz, since this is also the parameter in the config. I noticed that for some of the tracks I used, there is actually a difference between the mix and the summed sources. I read all the issues related to this (#2, #15, #106) and tried to follow what's written there and also the section in the FAQ but as you can read in my comment the outputs still don't sum up to the mix.
I have a few question:
separator = Separator('spleeter:5stems-16khz')
fails withspleeter.SpleeterError: No embedded configuration 5stems-16khz found
. Should this work or does it only work with the command line interface?separator.separate(...)
(and maybe it's still wrong - see below)In
5stems_average.json
I set "mask_extension":"average", and tried "F" with 1536 and 1024.For some input audios the diff is 0 but for most tracks it is not.
Thanks a lot for the great package and pretrained models! I hope there is an easy fix for my problem.
Thanks and best regards
Verena
The text was updated successfully, but these errors were encountered: