-
Notifications
You must be signed in to change notification settings - Fork 117
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Audio input requirements #36
Comments
And also which is the best sample rate for input audio? :)) |
Hi, the model is trained to predict the quality of speech that was transmitted via telecommunication systems. It wasn't trained to predict the quality of enhanced speech, so the correlations might not be that high if you apply them to your samples. Regarding the other question I can give following recommendations. Let me know if you have more questions.
(Figures taken from Deep Learning Based Speech Quality Prediction) |
Are there any specific requirements for audio files to make the results of NISQA valid?
I couldn't find any documentation in this repo or the original paper describing the audio requirements, but I was hoping to use home-made recordings to evaluate the performance of speech enhancement algorithms. Can any audio be used and gives valid results?
I've been running NISQA on some local files and have found that the MOS scores don't always correlate with subjectively listening to the files. Is there anything I should be doing to make these files valid for use in NISQA?
For example, are there requirements/recommendations on:
The text was updated successfully, but these errors were encountered: