This simple Python script classifies Unicode characters for bidi text processing
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
.gitignore
BidiData.txt
README.md
classify_unicode_chars.py

README.md

unicode-classifier

This simple Python script classifies Unicode characters for bidi text processing. To use it, download http://www.unicode.org/Public/UNIDATA/UnicodeData.txt to same directory and run:

python classify_unicode_chars.py

The script will generate a file called BidiData.txt with a C-compatible array containing weak, netutral and strong RTL characters. Feel free to generate your own structures.