New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Definition of a "character" is wrong #44150
Comments
Python's definition of a character does not match that Unicode's definition of a character can be found here: Python seems to use the Code Units option given here: |
Logged In: YES The Python string type is not at all Unicode compliant, so I |
Logged In: YES Sorry, I wasn't clear. I only intended this to be about the |
Logged In: YES Ok. Can you come up with a patch? |
Logged In: YES Not at the moment. |
Anyone brave enough can find the mentioned definitions in the thread http://mail.python.org/pipermail/python-dev/2008-July/080886.html |
See this talk for an explanation of the various Unicode terms and how http://www.egenix.com/library/presentations/#PythonAndUnicode Also note that the Unicode standard has evolved a lot since Unicode And finally: don't forget that Python provides ways of *working* with |
Without patch, I don't see how this issue can be moved forward. Adding a list of such Unicode term definitions would at best cause additional confusion and only address people knowledgable in the Unicode field. Note that Python's use of code units and code points matches those of the Unicode standard in most respects. Glyphs and all higher-level definitions are out-of-scope for Python. |
Can this be tied in with the work being done on the unicode howto bpo-20906? |
Note: these values reflect the state of the issue at the time it was migrated and might not reflect the current state.
Show more details
GitHub fields:
bugs.python.org fields:
The text was updated successfully, but these errors were encountered: