Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for UAX#14 line breaking classes #7

Merged
merged 2 commits into from
Mar 7, 2016
Merged

Conversation

sebras
Copy link
Contributor

@sebras sebras commented Mar 6, 2016

Hi!

I have recently been implementing support for the UAX#14 linebreaking algorithm
in mupdf. While doing so I needed to access the linebreaking
class for each code point and thought that this would make sense to have in ucdn.
This enlarges each UCDRecord by a single byte. I have also implemented support
in the python script to extract the information from the unicode-files and have
re-generated the database.

I hope this passes your code review, if not let me know how to help out in getting
this into ucdn proper. :)

/ Sebastian

@sebras
Copy link
Contributor Author

sebras commented Mar 6, 2016

In case you want to read up on the specifcation it is located here.

Extract the linebreak class for each code point from the Unicode text
files and embed them in the database. Add a function for retrieving the
linebreak class given a code point. Add a second function resolving the
linebreak class according to the fallback outlined in LB1 of UAX#14.

This takes UAX#14 revision 35 into account as revised for Unicode 8.0.0.
@grigorig
Copy link
Owner

grigorig commented Mar 7, 2016

OK, looks good to me.

grigorig added a commit that referenced this pull request Mar 7, 2016
Support for UAX#14 line breaking classes
@grigorig grigorig merged commit c8e5e8c into grigorig:master Mar 7, 2016
@sebras sebras deleted the uax14 branch July 4, 2021 00:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants