Do we need to discuss regular expressions? #54

aphillips · 2016-02-02T00:00:40Z

There is an issue comment in the current text that says:

Following requirements added 2013-10-29. Needs discussion of regular expressions.

Here is the proposed requirement (also in the current text):

[S][I] Specifications that define a regular expression syntax MUST provide at least Basic Unicode
Level 1 support per [UTS18] and SHOULD provide Extended or Tailored (Levels 2 and 3) support.

Should we add more discussion of regular expressions? Is that beyond the scope of our document? Should we keep the above requirement? Or should we do something different?

aphillips · 2017-01-27T22:16:41Z

We need to consider this question in WG.

aphillips · 2017-06-04T17:59:56Z

Discussed in teleconference https://www.w3.org/2017/06/01-i18n-minutes.html

The notes there are not very helpful, since we didn't record the conversation, but changes to spec to follow.

aphillips · 2017-11-25T21:02:50Z

Currently we have this text in 5.1:

Regular expression syntaxes are sometimes useful in defining a format or protocol, since they allow users to specify values that are only partially known or which can vary. The definition or use of regular expression syntaxes or wildcards when considered over the range of Unicode encoding variations, and particularly when considering character or grapheme boundaries brings with it additional considerations.

[S][I] Specifications that define a regular expression syntax MUST provide at least Basic Unicode Level 1 support per [UTS18] and SHOULD provide Extended or Tailored (Levels 2 and 3) support.

Is that enough to close? Do we need a section discussing regex?

asmusf · 2017-11-26T17:43:17Z

"Unicode encoding variations" is not a defined term, and if people look up variation, they will only find standardized variation sequences, which I am sure were not of uppermost concerns here.

I can't tell (even after looking at the original in more detail) whether the concern about "variation" was about encoding forms or normalization forms. The header of the section mentions normalization, but encoding forms are also discussed.

It may be worth noting that in some cases comparisons should be preferably done in NFD - this is the case for comparing domain names against confusable variants, to give one example.

aphillips · 2017-11-26T17:53:23Z

It's about the various and sundry different ways text can be encoded. It's not meant to be a term. That paragraph should be made clearer.

@asmusf

Address #54. Edited text to better address @asmusf's comment.

aphillips · 2017-11-27T22:28:55Z

@asmusf check the above edit and see if that works better. Suggest edits as needed.

asmusf · 2017-11-28T01:01:14Z

Definitely better.

Source code for the text is now a single very long line per paragraph, just pointing that out if it matters.

aphillips · 2018-07-14T19:44:13Z

Closing this issue. Reopen if needed

aphillips added the question label Jan 27, 2017

aphillips removed the question label Jun 4, 2017

aphillips self-assigned this Jun 4, 2017

aphillips added a commit to aphillips/charmod-norm that referenced this issue Nov 27, 2017

Address w3c#54. Edited text to better address @asmusf's comment.

7e1f463

aphillips added a commit that referenced this issue Nov 27, 2017

Merge pull request #148 from aphillips/gh-pages

86b1f31

Address #54. Edited text to better address @asmusf's comment.

aphillips added the close? label Feb 15, 2018

aphillips closed this as completed Jul 14, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do we need to discuss regular expressions? #54

Do we need to discuss regular expressions? #54

aphillips commented Feb 2, 2016

aphillips commented Jan 27, 2017

aphillips commented Jun 4, 2017

aphillips commented Nov 25, 2017

asmusf commented Nov 26, 2017 •

edited

Loading

aphillips commented Nov 26, 2017

aphillips commented Nov 27, 2017

asmusf commented Nov 28, 2017

aphillips commented Jul 14, 2018

Do we need to discuss regular expressions? #54

Do we need to discuss regular expressions? #54

Comments

aphillips commented Feb 2, 2016

aphillips commented Jan 27, 2017

aphillips commented Jun 4, 2017

aphillips commented Nov 25, 2017

asmusf commented Nov 26, 2017 • edited Loading

aphillips commented Nov 26, 2017

aphillips commented Nov 27, 2017

asmusf commented Nov 28, 2017

aphillips commented Jul 14, 2018

asmusf commented Nov 26, 2017 •

edited

Loading