Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unicode Database and Related APIs #140

Open
my2iu opened this issue May 3, 2021 · 1 comment
Open

Unicode Database and Related APIs #140

my2iu opened this issue May 3, 2021 · 1 comment

Comments

@my2iu
Copy link

my2iu commented May 3, 2021

This is not directly related to segmentation, but it would also be useful if you added an API that provides access to the Unicode database. When a web app uses a custom text renderer to lay out text vertically or when implementing the BIDI algorithm, the code needs to look up character classes in the Unicode databases. This information is also needed when doing text shaping. It's annoying to package up a Unicode database to include in web pages, especially when the web browser already knows all that information.

Having an API for the BIDI algorithm might also be nice.

While I'm rambling, it would also be great if someone could hire an intern to figure out how to incrementally download CJK webfonts on-demand. Right now, CJK users are at a disadvantage to Western users in the use of fonts because CJK webfonts are so big that it makes for a slow download, so it would be nice if someone figured out whether it's possible to chop up these fonts into smaller chunks to make for a smaller, on-demand download.

@sffc
Copy link

sffc commented May 8, 2021

Hi @my2iu, there is an upstream issue for Unicode properties: tc39/ecma402#90. Please upvote and comment on that issue to help get it prioritized. Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants