-
Notifications
You must be signed in to change notification settings - Fork 640
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Revise parameters for Kaldi mfcc compatibility test #689
Comments
I would like to work on this. |
Thanks. Do you know what are good parameters for |
Not off top of my head. Let me study it first for a day and come up with a proposal. |
Hi @mthrok, I would like to follow this approach with some questions:
Question 1: How should we store and keep the default values for fbank and mfcc up to date? Recommendation - cache the default fbank values and the override values in json. In future, revise manually if kaldi default argument values or example datasets default argument values change. Question 2: How should we handle when some datasets don't have fbank config or don't have mfcc config? Recommendation - we should only use configs from datasets for testing fbank or mfcc if they have the respective config. Example: Switchboard has both fbank and mfcc config, thus we will use both for testing. Example: librispeech only stores mfcc config, thus we will not use librispeech for testing fbank Question 3: What should we do with Currently |
I am not quite sure what you mean by cache, but in terms of JSON data, I think providing empty arguments BTW: Currently kaldi used in test CI is updated manually and I do it from time to time by building a new Docker file and pushing it. Although we plan to update it automatically, we do not know when that will happen. Also, note that there are some parameter discrepancies on parameters due to inconsistent design. Kaldi expects full range wave form where as typical
Yes, that makes sense.
Also making tests depend on external resource (networking, files stored elsewhere) increase maintenance cost, so we would like to refrain from doing it. Parsing help message of executables is plausible because it's available but let's defer on that one. We can discuss the extra value of doing that once we have a good set of values to test. |
Similar to #679
We should also revise the parameters for mfcc test.
See also #681
The text was updated successfully, but these errors were encountered: