feat(compiler): allow unicode characters for component name as described in #8564 #8666

youngrok · 2018-08-17T04:13:23Z

resolve #8564

What kind of change does this PR introduce? (check at least one)

Does this PR introduce a breaking change? (check one)

Yes
No

If yes, please describe the impact and migration path for existing applications:

The PR fulfills these requirements:

It's submitted to the dev branch for v2.x (or to a previous version branch), not the master branch
When resolving a specific issue, it's referenced in the PR's title (e.g. fix #xxx[,#xxx], where "xxx" is the issue number)
All tests are passing: https://github.com/vuejs/vue/blob/dev/.github/CONTRIBUTING.md#development-setup
New/updated tests are included

If adding a new feature, the PR's description includes:

A convincing reason for adding this feature (to avoid wasting your time, it's best to open a suggestion issue first and wait for approval before working on it)

Other information:

resolve vuejs#8564

- use unicode letters when parsing path for watcher instead of only ascii letters - extract const `unicodeLetters` from html-parser to lang

youngrok · 2018-08-17T06:41:30Z

In addition to #8564 , I added support for unicode property path in watcher.

src/core/util/options.js

Justineo · 2018-12-10T09:12:58Z

src/core/util/lang.js

+/**
+ * unicode letters used for parsing html tags, component names and property paths.
+ * use https://www.w3.org/TR/html53/semantics-scripting.html#potentialcustomelementname
+ * except \u10000-\uEFFFF because of performance problem


I made a benchmark for this and it turns out that in most modern browsers except Safari, it runs fastest if we include \u10000-\uEFFFF:

https://jsperf.com/unicode-regex-test

Well, in my computer, npm test failed because of timeout when including \u10000-\uEFFFF. It passed without the extra characters. I don't know why the performance difference exists, yet.

Co-Authored-By: youngrok <pak.youngrok@gmail.com>

… watch paths (vuejs#8666) close vuejs#8564

cutPicturesMan · 2020-09-14T08:39:48Z

src/compiler/parser/html-parser.js

-// except \u10000-\uEFFFF because of performance problem
-export const pcenchars = '[\\-\\.0-9_a-zA-Z\\u00B7\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u037D\u037F-\u1FFF\u200C-\u200D\u203F-\u2040\u2070-\u218F\u2C00-\u2FEF\u3001-\uD7FF\uF900-\uFDCF\uFDF0-\uFFFD]'
-const ncname = `[a-zA-Z_]${pcenchars}*`
+const ncname = `[a-zA-Z_][\\-\\.0-9_a-zA-Z${unicodeLetters}]*`


'a-zA-Z' is repeat in unicodeLetters

youngrok added 2 commits August 17, 2018 13:04

feat(compiler): allow unicode characters for component name

a350926

resolve vuejs#8564

feat(core): allow unicode characters for property path in watcher

ee393db

- use unicode letters when parsing path for watcher instead of only ascii letters - extract const `unicodeLetters` from html-parser to lang

posva added the improvement label Aug 28, 2018

yyx990803 added the semver:minor label Oct 23, 2018

yyx990803 mentioned this pull request Oct 23, 2018

fix #8862: cannot watch unicode properties like 'a.中文' #8925

Closed

13 tasks

Justineo reviewed Dec 10, 2018

View reviewed changes

Update src/core/util/options.js

9ef245e

Co-Authored-By: youngrok <pak.youngrok@gmail.com>

yyx990803 added 2 commits December 26, 2018 09:35

Update lang.js

dff7849

Update lang.js

d845cf1

yyx990803 changed the base branch from dev to 2.6 December 26, 2018 14:53

yyx990803 added 2 commits December 26, 2018 09:55

Merge branch '2.6' into dev

31d5059

Update html-parser.js

dfc51bf

yyx990803 merged commit 9c71852 into vuejs:2.6 Dec 26, 2018

yyx990803 mentioned this pull request Dec 26, 2018

Invalid component name error for valid name that uses unicode characters #8564

Closed

willvincent mentioned this pull request Jan 25, 2019

cannot watch unicode properties like 'a.中文' #8862

Closed

f2009 pushed a commit to f2009/vue that referenced this pull request Jan 25, 2019

feat(compiler/watch): allow unicode characters in component names and…

1f69066

… watch paths (vuejs#8666) close vuejs#8564

cutPicturesMan reviewed Sep 14, 2020

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(compiler): allow unicode characters for component name as described in #8564 #8666

feat(compiler): allow unicode characters for component name as described in #8564 #8666

youngrok commented Aug 17, 2018

youngrok commented Aug 17, 2018

Justineo Dec 10, 2018

youngrok Dec 20, 2018

cutPicturesMan Sep 14, 2020

feat(compiler): allow unicode characters for component name as described in #8564 #8666

feat(compiler): allow unicode characters for component name as described in #8564 #8666

Conversation

youngrok commented Aug 17, 2018

youngrok commented Aug 17, 2018

Justineo Dec 10, 2018

Choose a reason for hiding this comment

youngrok Dec 20, 2018

Choose a reason for hiding this comment

cutPicturesMan Sep 14, 2020

Choose a reason for hiding this comment