Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Language pack request: Accented Belarusian #299

Closed
tryzniak opened this issue Nov 22, 2022 · 2 comments
Closed

Language pack request: Accented Belarusian #299

tryzniak opened this issue Nov 22, 2022 · 2 comments

Comments

@tryzniak
Copy link

Hello. I'd like to do it, at first, on my own, but a bit unsure how to do it. The idea is similar how you had one #8, but I want the same for Belarusian. I have a list of accented words, what to do next? Thank you for any help.

@stweil
Copy link
Contributor

stweil commented Nov 25, 2022

The langdata repository is for legacy models which are rarely used nowadays. Training of such models basically requires a minimal amount of training text which contains all desired glyphs (characters) and fonts to render images from that text.

For "modern" models which use the neural network engine, I suggest using tesstrain with text scanned from printed books. That requires much more work.

@tryzniak
Copy link
Author

Thank you for the response! I'll try to do it as you suggested. GL

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants