Apply `escape-case` to regex literal and remove transformation of `\c` escape on string literal #294

JLHwung · 2019-05-02T21:23:32Z

This PR addresses issue and feature request in #273. I've made separate commit on this.

Although the owner suggested using https://github.com/mysticatea/regexpp, I decide not to use it for performance considerations -- There would be many regular expression in a mature codebase and we can not afford the extra regular expression parse + replace cycle only to find whether there is any escapable sequence.

The original escapeWithLowercase pattern should work good enough on regular expression under most situation. I have minimally modified the code to support slightly difference escape syntax between string literal and RegExp literal/objects.

Fixes #273

`\c` is not related to control characters in strings.

futpib · 2019-05-03T00:16:27Z

A slightly off-topic nitpick.

Although the owner suggested using mysticatea/regexpp, I decide not to use it for performance considerations -- There would be many regular expression in a mature codebase and we can not afford the extra regular expression parse + replace cycle only to find whether there is any escapable sequence.

Would it really be that slow? We could eventually cache the result and reuse it in other regexp-related rules (or maybe even across projects). Even if it were substantially slower, we may value correctness and maintainability over speed, especially for a linter (unless it was really slow).

Also a mature codebase with a lot of unique regular expressions sounds like a nightmare.

I'm not saying this PR should not be accepted, only that the argument against regexpp is not convincing.

MrHen · 2019-05-03T13:28:11Z

I agree with @futpib. Performance issues should be addressed once they have been confirmed. Does the slower approach add 0.5 seconds for every 1,000 regex statements? 1 minute for every 10 regex statements?

JLHwung · 2019-05-04T20:45:15Z

we may value correctness and maintainability over speed, especially for a linter (unless it was really slow).

Make sense, I can spare some time to try on the regexpp approach.

\{\} is redundantly escaped, {1,} is equivalent to +

JLHwung · 2019-05-08T17:32:26Z

@futpib Upgraded the branch. The new lowercase detection approach is

use regexpp to parse regular expression literal
tap on onCharacterLeave handler, check if it is the first occurrence of escaped lowercase sequence
report if invalid

The fixer approach is

use regexpp to parse regular expression literal
tap on onCharacterLeave handler, check if it is the first occurrence of lowercase sequence, record the position of escaped sequence in the regular expression
replace the original escaped sequence with the one applied with the string escape fixer
return the replaced string as the fixed regular expression literal

sindresorhus · 2019-05-22T16:20:05Z

@JLHwung Can you give the PR a proper title that describes what it fixes?

sindresorhus · 2019-05-22T16:28:49Z

I pushed some minor formatting tweaks: 38f91bb

sindresorhus · 2019-05-22T16:21:10Z

rules/escape-case.js

+			const matches = node.raw.match(escapePatternWithLowercase);
+
+			if (matches && matches[2].slice(1).match(hasLowercaseCharacter)) {
+				escapeNodePosition = [node.start, node.end];


You cannot use .start and .end. See: #272

Here is different to fixer.replaceTextRange([node.start, node.end]) as the node here is actually the ASTNode from regexpp, while other node is the AST node from eslint parsers.

As start and end is official properties of a ASTNode in regexpp, I think we can leave it as-is, but I can add a comment here as a reminder.

Ah ok. I missed that. It's ok to leave it. My mistake.

sindresorhus · 2019-05-22T16:36:07Z

rules/escape-case.js

@@ -30,7 +81,18 @@ const create = context => {
 				context.report({
 					node,
 					message,
-					fix: fixer => fixer.replaceTextRange([node.start, node.end], fix(node.raw))
+					fix: fixer => fixer.replaceTextRange([node.start, node.end], fix(node.raw, escapeWithLowercase))


Use replaceText(node instead of replaceTextRange[node.start, node.end]

sindresorhus · 2019-05-22T16:59:51Z

rules/escape-case.js

+Find the `[start, end]` position of the lowercase escape sequence in a regular expression literal ASTNode.
+
+@param {string} value - String representation of a literal ASTNode.
+@returns {number[] | null} The `[start, end]` pair if found, or null if not.


I would prefer returning undefined instead of null.

JLHwung · 2019-05-23T22:35:46Z

Thanks for the review comments. I am sorry that I am taking vacation off my laptop for a couple days. I will come back and revise on the next week. Thank you.

JLHwung · 2019-05-30T16:08:03Z

Regards to the feedback, I have pushed fixes and they are ready to review.

sindresorhus · 2019-06-10T14:16:54Z

Thanks for working on this, @JLHwung 🙌

Fix escape-case transforming \c in string literal

ebe7bfd

`\c` is not related to control characters in strings.

JLHwung added 3 commits May 7, 2019 19:14

chore: add regexpp

8c6b40f

feat: check and fix lowercase escape sequence in regexp literal

72b70c8

refactor: tweak escape checker

5e690ea

\{\} is redundantly escaped, {1,} is equivalent to +

JLHwung force-pushed the fix-273 branch from c739302 to 5e690ea Compare May 8, 2019 17:25

Update escape-case.js

38f91bb

sindresorhus requested changes May 22, 2019

View reviewed changes

sindresorhus reviewed May 22, 2019

View reviewed changes

JLHwung changed the title ~~Fix 273~~ Apply escape-case to regex literal and remove transformation of \c escape on string literal May 22, 2019

JLHwung added 2 commits May 30, 2019 10:56

fix: follow up review comments

115d25b

fix: avoid usage of node.start/node.end

5836fc2

JLHwung force-pushed the fix-273 branch from 66c6e11 to 5836fc2 Compare May 30, 2019 15:53

MrHen approved these changes Jun 3, 2019

View reviewed changes

sindresorhus merged commit 79748e1 into sindresorhus:master Jun 10, 2019

JLHwung deleted the fix-273 branch June 10, 2019 15:23

fisker mentioned this pull request Feb 12, 2020

Simplify escape-case #531

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Apply `escape-case` to regex literal and remove transformation of `\c` escape on string literal #294

Apply `escape-case` to regex literal and remove transformation of `\c` escape on string literal #294

JLHwung commented May 2, 2019 •

edited by sindresorhus

Loading

futpib commented May 3, 2019 •

edited

Loading

MrHen commented May 3, 2019

JLHwung commented May 4, 2019

JLHwung commented May 8, 2019

sindresorhus commented May 22, 2019

sindresorhus commented May 22, 2019

sindresorhus May 22, 2019

JLHwung May 30, 2019

sindresorhus Jun 10, 2019

sindresorhus May 22, 2019

sindresorhus May 22, 2019

JLHwung commented May 23, 2019

JLHwung commented May 30, 2019

sindresorhus commented Jun 10, 2019

Apply escape-case to regex literal and remove transformation of \c escape on string literal #294

Apply escape-case to regex literal and remove transformation of \c escape on string literal #294

Conversation

JLHwung commented May 2, 2019 • edited by sindresorhus Loading

futpib commented May 3, 2019 • edited Loading

MrHen commented May 3, 2019

JLHwung commented May 4, 2019

JLHwung commented May 8, 2019

sindresorhus commented May 22, 2019

sindresorhus commented May 22, 2019

sindresorhus May 22, 2019

Choose a reason for hiding this comment

JLHwung May 30, 2019

Choose a reason for hiding this comment

sindresorhus Jun 10, 2019

Choose a reason for hiding this comment

sindresorhus May 22, 2019

Choose a reason for hiding this comment

sindresorhus May 22, 2019

Choose a reason for hiding this comment

JLHwung commented May 23, 2019

JLHwung commented May 30, 2019

sindresorhus commented Jun 10, 2019

Apply `escape-case` to regex literal and remove transformation of `\c` escape on string literal #294

Apply `escape-case` to regex literal and remove transformation of `\c` escape on string literal #294

JLHwung commented May 2, 2019 •

edited by sindresorhus

Loading

futpib commented May 3, 2019 •

edited

Loading