Added a test for identifier support across all languages #2371

RunDevelopment · 2020-05-07T19:19:47Z

Motivated by this comment, I added a test that goes through all languages and checks that identifiers aren't broken.

What does "identifiers aren't broken" mean?
It means that any identifier (/[_a-zA-Z][_a-zA-Z0-9]*/) will be tokenized as either one token or not at all. I.e. the identifier foo123 would be broken if the language tokenized the 123 part as a number. The test will see how the languages handle identifiers like this and others. It will also check for numbers.

Why do we need this?
As pointed out in the comment, Markup templating (MT) assumes that its placeholders (which are identifiers) aren't broken up. If they are, MT will stop working. In the past, it caused this issue.

How is this implemented?
The test is quite simple. It has a list (actually 3) of identifiers and just tests that those identifiers aren't broken for any given language. Because some languages don't have identifiers, you can selectively disable the test for a certain class or all classes of identifiers.
The error message of this test includes an explanation of what broken identifiers are and how to fix them. Instructions on disabling are also included.

(The problem with the current implementation of this test is that I only do a Prism.tokenize on every identifier. I don't test inside grammars because these are usually very specific to the parent pattern, so there are almost only false positives.)

The actual changes to the languages are just boundary assertions. (I didn't just blindly throw some \b in there tho. I went and looked up the spec/doc of every language I didn't know.)
In some cases, I even had to change some test cases because they were wrong. Markdown changed the most because I didn't know that foo_italic_ won't make anything italic at the time. That's fixed now. For languages that had a faulty number pattern, I didn't create any new test files because we now have this test.

components/prism-css-extras.js

Co-authored-by: James DiGioia <jamesorodig@gmail.com>

mAAdhaTTah

All of this makes sense to me. Thanks for adding!

Added a test for identifier support across all languages

a8276fa

RunDevelopment added enhancement needs review labels May 7, 2020

mAAdhaTTah reviewed Jun 11, 2020

View reviewed changes

components/prism-css-extras.js Outdated Show resolved Hide resolved

RunDevelopment and others added 2 commits June 12, 2020 12:44

Update components/prism-css-extras.js

b889a27

Co-authored-by: James DiGioia <jamesorodig@gmail.com>

Merge branch 'master' into identifier-test

37df26a

mAAdhaTTah approved these changes Jun 12, 2020

View reviewed changes

RunDevelopment merged commit 48fac3b into PrismJS:master Jun 12, 2020

RunDevelopment deleted the identifier-test branch June 12, 2020 13:58

quentinvernot pushed a commit to TankerHQ/prismjs that referenced this pull request Sep 11, 2020

Added a test for identifier support across all languages (PrismJS#2371)

df48c0e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added a test for identifier support across all languages #2371

Added a test for identifier support across all languages #2371

RunDevelopment commented May 7, 2020

mAAdhaTTah left a comment

Added a test for identifier support across all languages #2371

Added a test for identifier support across all languages #2371

Conversation

RunDevelopment commented May 7, 2020

mAAdhaTTah left a comment

Choose a reason for hiding this comment