Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clarify character set constraints on name (Add a link to the Unicode spec) #273

Closed
ndeloof opened this issue Sep 6, 2019 · 5 comments
Closed

Comments

@ndeloof
Copy link

ndeloof commented Sep 6, 2019

spec says

name MUST contain only characters from the Unicode graphic characters
The Unicode graphic character set allowed for name includes letters, numbers, punctuation, symbols, and spaces; it does not include newlines or tabs.

I'm not sure this definition is clear enough on supported characters. Typically, what does symbols mean ? Can I use Klingon characters? emojis? Unicode Character 'BACKSPACE' (U+0008) etc.

Maybe clarify using explicit Unicode General categories Letter (L), Number (N) ... and explictely exclude Controls & al (C)

@technosophos
Copy link
Member

"Unicode Graphics Characters" is a defined set of Unicode characters.

In Unicode, Graphic characters are those with General Category Letter, Mark, Number, Punctuation, Symbol or Zs=space. Other code points (General categories Control, Zl=line separator, Zp=paragraph separator) are Format, Control, Private Use, Surrogate, Noncharacter or Reserved (unassigned)

https://www.unicode.org/versions/Unicode5.2.0/ch02.pdf#G25564

@technosophos
Copy link
Member

Oops... the link was to the spec, and the quote was from the wikipedia entry on graphic characters.

https://en.wikipedia.org/wiki/Graphic_character

@ndeloof
Copy link
Author

ndeloof commented Sep 6, 2019

Oh ok. I googled for it but didn't found anything obvious.
Then a link would avoid stupids like me to make such a mistake ;)

@technosophos technosophos changed the title Clarify character set constraints on name Clarify character set constraints on name (Add a link to the Unicode spec) Sep 6, 2019
@technosophos
Copy link
Member

Thanks! I updated the title and will do a PR next week.

@technosophos
Copy link
Member

The PR was merged a long time ago, so I am closing this issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants