Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

update Unicode tables #1

Closed
stevengj opened this issue Jul 16, 2014 · 1 comment · Fixed by #9
Closed

update Unicode tables #1

stevengj opened this issue Jul 16, 2014 · 1 comment · Fixed by #9

Comments

@stevengj
Copy link
Member

As discussed in JuliaLang/julia#7582, utf8proc currently has the Unicode 5.0 tables. It would be good to import the database from Unicode 7.

The file data_generator.rb is a Ruby script that outlines how the Unicode 5 tables were imported, though looks like it is not fully automated. The first step would be to figure out how to re-run that on the Unicode 5 table in order to reproduce the current utf8proc_data.c. This will verify that we are importing the data correctly before we move to the new Unicode 7 data tables.

@stevengj
Copy link
Member Author

I'm tempted to re-write the scripts in Julia, as I don't know Ruby; they don't look very complicated. But we should see how difficult they are to run on the Unicode 5 data first.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant