Charinfo fails to link newline characters (and several others)

Currently, the !charinfo command runs `unicodedata.name()` to figure out what name a character should be given. However, `unicodedata.name()` does not support control characters like the newline characters such as \n, the null character, or a variety of other ones that have unicode names as aliases but not official "recognized" names which are often "<control>" for some reason. 

There's an open bug on cpython for a few years now, but it doesn't look like there's much activity (https://github.com/python/cpython/issues/71683)

Unfortunately it doesn't look like there's an easy standard library hack either (please correct me if I'm wrong!) because `unicodedata` is generated C, then compiled, and the only interface to the data is through lookup functions (and we'd want to go the other direction, from character to aliases). I've also had no luck in looking at additional libraries, so any actual solution to this might just fall down to a hardcoded list. 

At the very least, we should consider "cleaning up" the output so that instead of injecting empty space we use the character id perhaps like `\0x0a`, for newline, for the link name so it doesn't visually break?

(ex: https://discord.com/channels/267624335836053506/635950537262759947/1111690082441769011)



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Charinfo fails to link newline characters (and several others) #2611

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Charinfo fails to link newline characters (and several others) #2611

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions