Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for Upper and Lower Sorbian, aka Wendish #1994

Open
alerque opened this issue Feb 7, 2024 · 2 comments
Open

Add support for Upper and Lower Sorbian, aka Wendish #1994

alerque opened this issue Feb 7, 2024 · 2 comments
Labels
enhancement Software improvement or feature request
Milestone

Comments

@alerque
Copy link
Member

alerque commented Feb 7, 2024

Interesting feedback here typst/typst#3235 (comment) adding (lower) Sorbian and Croatian to the list, and confirming Czech and Slovak.

Sorbian is a minority language (< 50000 people), it doesn't have a 2-letter language codes. Unless mistaken the 3-letter codes are hsb (Upper Sorbian), dsb (Lower Sorbian) and wen (Sorbian or "Wendish" collectively)

Originally posted by @Omikhleia in #1963 (comment)


While dealing with the explicit hyphen repetition handling I skipped Sorbian (which we now know can use that alternative code) because we don't have a language support file or hyphenation patterns for it at all. Since I'm guessing we can probably apply some other language's patters to it this shouldn't be too hard to add. Maybe after BCP-47?

@alerque alerque added the enhancement Software improvement or feature request label Feb 7, 2024
@alerque alerque added this to the v0.x.y milestone Feb 7, 2024
@Omikhleia
Copy link
Member

Since I'm guessing we can probably apply some other language's patters

You mean porting https://github.com/hyphenation/tex-hyphen/blob/master/hyph-utf8/tex/generic/hyph-utf8/patterns/tex/hyph-hsb.tex for Upper Sorbian, I assume.

@Omikhleia
Copy link
Member

This would be an occasion to split hyph patterns from the segmenter in different files, add their origin details and the scripts that were used to build them into a Lua table -- so we could check they are up-to-date and have the tooling for re-generating them.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement Software improvement or feature request
Projects
Development

No branches or pull requests

2 participants