-
Notifications
You must be signed in to change notification settings - Fork 888
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Romanian Cyrillic #145
Comments
If you can provide unicode fonts that support the alphabet and text corpus for the same, I will be happy to do a training run . |
To be honest, I don;t know if they exists. Romania stopped using that in
the 19th century ( https://en.wikipedia.org/wiki/Romanian_Cyrillic_alphabet )
. I would say 60-70% of the characters are similar to regular Cyrillic.
This would be of use for historical documents.
Makes sense?
…On Sat, 6 Jul 2019 at 12:45, Shreeshrii ***@***.***> wrote:
If you can provide unicode fonts that support the alphabet and text corpus
for the same, I will be happy to do a training run .
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
<#145?email_source=notifications&email_token=AEAPZ5X5KENS264XTXO3TTLP6CAW7A5CNFSM4H6TDMA2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGODZKYBAA#issuecomment-508919936>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AEAPZ5XX7GDBAU6XTMAEG7DP6CAW7ANCNFSM4H6TDMAQ>
.
|
You'd want something like that one (public domain, digitisation by Gallica; grab PDF off the menu there): |
Yes, a good one: bilingual Romanian-French book
…On Tue, 4 Feb 2020 at 18:36, yurytch ***@***.***> wrote:
You'd want something like that one (public domain, digitisation by
Gallica; grab PDF off the menu there):
https://gallica.bnf.fr/ark:/12148/bpt6k5440993t
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
<#145?email_source=notifications&email_token=AEAPZ5RV2SHPBF233SBPOZ3RBGYRLA5CNFSM4H6TDMA2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEKYWXKI#issuecomment-582052777>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AEAPZ5R2VOK4PZ2LU5ZXGJTRBGYRLANCNFSM4H6TDMAQ>
.
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hi there,
There was little digitization effort for old Romanian labgugae documents. I think that is partly due to the lack of support for OCR for Romanian Cyrillic alphabet. Can this be added to the backlog?
The text was updated successfully, but these errors were encountered: