-
Notifications
You must be signed in to change notification settings - Fork 292
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Increasing accuracy on Local Build #88
Comments
Are you running the code locally from source, or from a DMG release? Mac or
|
Linux. Yes, Locally from Source. |
@strob : ping! I tried to understand the codebase of gentle, and a lot of things make sense now. But still, the quality of locally done alignments don't match up to the alignments done on gentle server. I still don't know if building kaldi with cuda enabled will help. Can you guide me on this? |
There's no reason I can think of that alignment would be more accurate on On Sun, Aug 14, 2016, 5:30 PM Utkarsh Saxena notifications@github.com
|
It is reasonable that on my local machine (4GB RAM) the accuracy of alignment is somewhat jittered. There's an offset of 0.02 - 0.04 seconds between the gentle server and my local build.
Compare the CSV generated on gentle server with the CSV generated on my local build.
An example with 20 - 160 ms offset.
Another example with an offset of 2 seconds
I am sorry, for the naive requests that follow, I just started exploring Kaldi as a tool. I have no prior experience with ASR systems.
What can I do to increase the accuracy on my local build?
Now, I need these timestamps to do a research project. Specifically, I need to segment the audio on the basis of word boundaries. And gentle was the best available tool, from a developer's perspective. As I am not even a beginner in ASR and other such tools.
I believe that if I hire an amazon instance, this will not be a problem. But they are quite expensive.
Also, can anyone direct me if there is any other language model that might work better for English?
Meanwhile I will dive into the code, to understand it better.
Thanks
The text was updated successfully, but these errors were encountered: