Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Are you planning on doing anything with this? #1

Open
simoncozens opened this issue Jan 24, 2020 · 3 comments
Open

Are you planning on doing anything with this? #1

simoncozens opened this issue Jan 24, 2020 · 3 comments

Comments

@simoncozens
Copy link

I recently started a similar thing - almost the same name, too! I looked around and found this. In my version I use the flat files instead of the XML. (It turns out to be much faster to parse.):

$ PYTHONPATH=Lib python3 -m youseedee ద
Downloading Unicode Character Database...
[==================================================]
{'Age': '1.1',
 'Block': 'Telugu',
 'Canonical_Combining_Class': '0',
 'East_Asian_Width': 'N',
 'General_Category': 'Lo',
 'Indic_Syllabic_Category': 'Consonant',
 'Line_Break': 'AL',
 'Name': 'TELUGU LETTER DA',
 ...

$ time PYTHONPATH=Lib python3 -m youseedee ద > /dev/null
PYTHONPATH=Lib python3 -m youseedee ద > /dev/null  0.67s user 0.10s system 96% cpu 0.801 total

$ time PYTHONPATH=Lib python3 -m youseedy ucd.nounihan.grouped.xml ద > /dev/null
PYTHONPATH=Lib python3 -m youseedy ucd.nounihan.grouped.xml ద > /dev/null  6.61s user 0.56s system 98% cpu 7.287 total

I don't know whether it's better to keep working on mine or contribute to this.

@ebraminio
Copy link

Guess is related harfbuzz/harfbuzz@771712b

@behdad
Copy link
Member

behdad commented Jan 28, 2020

Hey...

I read the XML because it contains all data. My goal with this package was to read those and use packTab and other tricks to generate fast optimized libraries for C and w Python bindings to access data fast. In that model doesn't matter if loading the XML is slow.

But I don't see myself working on it. So, do what you need to. I'd be happy if you take over this module and discuss your needs and do your work here. Or doesn't matter, we can kill this / archive it.

PackTab is nice though. The code is horrible, but works for now.

@simoncozens
Copy link
Author

Here's my thing: https://github.com/simoncozens/youseedee

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants