New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
AxisError when signal contains silence #21
Comments
Thanks for raising the issue. |
I used version 0.2, a fresh install at least doesn't crash, thank you! But still, I doubt that returning a small number is the right thing to do. In the above example, doing, |
There is not enough frames to built a intermediate intelligibility index, so we cannot asses intelligibility with STOI in this case. In wsj0-2mix, there is one for which is always happens for me, but only one. Do you have more than one? |
This problem occurs for one example in the test (tt min) data, two in the training (tr min) data, and one in the cross-validation (cv min) data. I think it is not a big deal to ignore them. |
Yes, I meant for testing, but you're right that there are example in train and val. |
The stoi function produces an error if a reference signal only contains a short piece of speech. This seems to be caused by the removal of silent frames.
This is a minimal example using WSJ0-2mix data. Replace
wsj0_2mix_root
with the root to the WSJ0-2mix data. You might have to remove the suffix_2
if you have a newer version of the WJ0-2mix database:Is this a bug in the implementation or a general flaw of the STOI metric? Do you have a suggestion on how to handle this issue?
The text was updated successfully, but these errors were encountered: