-
-
Notifications
You must be signed in to change notification settings - Fork 238
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Rework - IMPORTANT #64
Comments
Some updates:
|
|
Ideas for better propagation https://guide.esciencecenter.nl/best_practices/communication.html
I am also thinking about adding tests and setting up some continuous integration like travis CI |
Hi, I'm having trouble understanding the readme files. Any Youtube video that can explain how to get the datasets and creating the envs. Most of the packages are unavailable for installation. |
Hi @SRK-returns, which branch do you use? The |
Currently, the project is undergoing big reorganization. The new code is in
rework
branch and once this issue is closed it will be merged withmaster
. What will be new:This brings some breaking changes. I recommend moving to the new code because I will no longer fix the issues from the old versions.
Model retraining
With the new version, some old models may become incompatible. Also, the old models were trained only on a small dataset. This requires large retraining. I would appreciate any help with this task because I have only limited access to some computation clouds.
Dropping support of Czech accents
The Czech accents will be removed from the words. Keeping only some text files which allow recovery of them. This solves some compatibility issues with different OS. Also, models trained on this dataset weren't very accurate.
However, as a school project, I will be creating software which automatically adds Czech accents to sentences. This is an only partial solution of the problem, but I don't have enough data for successful recognition of them anyways.
The text was updated successfully, but these errors were encountered: