Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bulgarian #9

Closed
thisismattmiller opened this issue Jan 17, 2023 · 5 comments
Closed

Bulgarian #9

thisismattmiller opened this issue Jan 17, 2023 · 5 comments
Milestone

Comments

@thisismattmiller
Copy link
Member

Bulgarian letter "Ъ" and "'ь" do not Romanize, which makes the text unreadable. Amazingly, the following letters were Romanized perfectly: Ѣ ѫ ѧ ю я


Bulgarian Er-malak (small yer) with ALA-LOC sign " ʹ " (soft sign) missing after ScriptShift romanization

Romanized Ŭ for Bulgarian Ъ in the beginning and middle of words does not appear when using ScriptShift. The same is true for the romanized ʺ (hard sign) for Bulgarian Ъ at the end of the words.

image


Bulgarian "Ъ, ъ", known as Er-golyam (large yer), phonetic transcription “ă”, ALA-LC Romanization “ŭ” or ʺ - not present after ScriptShifter romanization

Romanized Ŭ for Bulgarian Ъ in the beginning and middle of words does not appear when using ScriptShift. The same is true for the romanized ʺ (hard sign) for Bulgarian Ъ at the end of the words.

image

@scossu
Copy link
Collaborator

scossu commented Mar 16, 2023

Can you provide text examples (rather than screenshots) including the expected output string?

@paulfrank7
Copy link
Collaborator

Script in resource:

Българският народъ между европейскитѣ раси и народи : сказка, държана предъ учредителното събрание на Съюза на естественицитѣ въ България на 12. януарии 1938 г. / отъ д-ръ Методий Поповъ, професоръ по обща биология при Софийския университетъ

Expected romanization:

Blgarskii︠a︡t narodʺ mezhdu evropeĭskiti︠e︡ rasi i narodi : skazka, drzhana predʺ uchreditelnoto sŭbranie na Sŭi︠u︡za na estestvenit︠s︡iti︠e︡ vʺ Bŭlgarii︠a︡ na 12. i︠a︡nuarii 1938 g. / otʺ d-rʺ Metodiĭ Popovʺ, profesorʺ po obshta biologii︠a︡ pri Sofiĭskii︠a︡ universitetʺ.

@thisismattmiller
Copy link
Member Author

This would require hooks. But also:

"For Bulgarian, at least, I'd suggest that this isn't even necessary. The hard sign at the end of words is only in pre-1945 texts, of which we get very few. Yes, it comes up occasionally, but we could just be expected to pay attention and make manual fixes in those rare cases."

@scossu
Copy link
Collaborator

scossu commented May 25, 2023

If you decide to implement this, I can assist. Even if it's rarely useful, it could be a good exercise for building hooks.

@scossu
Copy link
Collaborator

scossu commented Dec 4, 2023

Closing as there seems to be no request to implement this at the moment.

@scossu scossu closed this as completed Dec 4, 2023
@scossu scossu added this to the Phase 2 milestone Feb 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants