Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

definition of character #263

Closed
r12a opened this issue Sep 9, 2016 · 8 comments
Closed

definition of character #263

r12a opened this issue Sep 9, 2016 · 8 comments
Labels
i18n-needs-resolution Issue the Internationalization Group has raised and looks for a response on. SVG Core Text chapter

Comments

@r12a
Copy link

r12a commented Sep 9, 2016

11.1.1. Definitions
https://svgwg.org/svg2-draft/text.html#Definitions

character

A character is an atomic unit of text as defined in XML [XML].

Essentially, a Unicode character. ...

The wording here is still a little vague. We generally recommend referring to a character as a Unicode code point.

We also recommend using specific terms, where available, rather than the general term 'character'.

@r12a r12a added the i18n-needs-resolution Issue the Internationalization Group has raised and looks for a response on. label Sep 9, 2016
@AmeliaBR AmeliaBR added this to the SVG Core clean-up milestone Sep 9, 2016
@nikosandronikos
Copy link
Member

I think it's important that we maintain consistent terminology with CSS, so where they use 'character', so should we.
Though I suspect that generally we should be using 'typographic character' in most places where 'character' is used.

@nikosandronikos
Copy link
Member

Ok my understanding has improved somewhat since I last commented.

Curious @r12a why you recommend referring to a character as a unicode code point?
I agree that 'character' is vague and should be avoided, but a 'user perceived character', which is what grapheme clusters approximate can require multiple code points. I don't see how relating 'character' to 'code point' helps anything.

It looks like we need to go through the text chapter and ensure we only use the following terms:

  • code point
  • typographic character
  • what else do we need?

@Tavmjong
Copy link
Contributor

I agree "character" is a bit vague (so is "typographic character" but maybe bit less). The definition of "character" is from SVG 1.1. I believe it is meant to correspond to a Unicode point. In terms of input, a 'u' with a combining '`' would be two points while using the preformed 'ù' is one point. This has mostly to do with how the 'x', 'y', ... attributes are matched to the input.

@nikosandronikos
Copy link
Member

If it's meant to correspond to an XML character as the description says then it would be equivalent to a 'typographic character' rather than a code point.
My concern is that in the text that came from SVG 1.1 it might be used to mean either.

@AmeliaBR
Copy link
Contributor

"character" in SVG 1.1 was used in a multiple senses. SVG 2 distinguishes "typographic character" (an indivisible unit, used for rendering) versus "addressable character" (a 2-byte block when normalizing the string to UTF-16, after adjusting for whitespace and display, used for counting). Both are distinct from "Unicode code point", which is what the general definition is referring to.

I agree that we should make sure any normative text refers to either typographic character, addressable character, or Unicode code point. The general definition for "character" could also emphasize that the term is a little imprecise, and cross-link to those more specific definitions later in the terms list.

@fantasai
Copy link

I think this means I should take an action item to update the extremely outdated /TR publication of CSS3 Text so that you can refer to its definitions. =) (You'll still need your own for addressable character... we don't have that in CSS3 Text.)

@Tavmjong
Copy link
Contributor

@fantasai Please do update CSS3 Text! That would be very helpful!

@dirkschulze
Copy link
Contributor

Discussion moved to #537

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
i18n-needs-resolution Issue the Internationalization Group has raised and looks for a response on. SVG Core Text chapter
Projects
None yet
Development

No branches or pull requests

7 participants