Use custom word boundary matchers #35

mkhl · 2016-03-08T12:20:20Z

The regexp engine and the shell grammar don't agree on what comprises a “word boundary”. This leads to problems where a command or path ending in a shell keyword would be interpreted as that keyword.

This change replaces \b word boundary matchers with lookahead/-behind matchers on whitespace, line breaks, and command separators (; and &).

It also eliminates ad-hoc custom word boundaries, where subsets of [-=/] were pre- or appended to some word boundaries, which fixed some similar problems.

This is basically a cherry-pick of textmate/shellscript.tmbundle@cb6e72e, modified for atom.

winstliu · 2016-03-08T20:40:59Z

Can you please add some specs to this to ensure that nothing regressed or will regress?

The regexp engine and the shell grammar don't agree on what comprises a “word boundary”. This leads to problems where a command or path ending in a shell keyword would be interpreted as that keyword. This change replaces `\b` word boundary matchers with lookahead/-behind matchers on whitespace, line breaks, and command separators (";" and "&"). It also eliminates ad-hoc custom word boundaries, where subsets of [-=/] were pre- or appended to some word boundaries, which fixed some similar problems.

mkhl · 2016-03-15T15:23:10Z

I've added a bunch of specs for cases that weren't handled before.

for…in loops and case blocks seem covered well enough already (except at least one spec is wrong, but that's the subject of #36), which leaves while and until loops, and function declarations.

Do you want me to add specs for their "happy paths" as well?

winstliu · 2016-09-12T15:19:05Z

spec/shell-unix-bash-spec.coffee


-    expect(tokens[0]).toEqual value: 'iffy', scopes: ['source.shell']
+    for string of strings


Sorry for the long delay...but shouldn't this be for string in strings?

Um, yes, I think it should be. Not sure what I got mixed up there 😅

winstliu · 2016-09-12T15:19:46Z

Do you want me to add specs for their "happy paths" as well?

Can you elaborate on this?

mkhl · 2016-09-13T17:35:08Z

Do you want me to add specs for their "happy paths" as well?

Can you elaborate on this?

I was talking about while and until loops, and function declarations. Those don’t have proper tests yet, and I only added tests that ensure that many strange characters are not considered to be word boundaries by the shell.

What’s missing are tests that tokenize a whole block, like the one for for…in loops.

(Tests for for…in loops without the in and for C-style for loops are also missing.)

winstliu · 2016-09-13T17:51:39Z

Ok, since this is only testing for word boundaries, I think it's fine that we don't add new tests for the other loops.

Followup from #35 Fixes #71

mkhl mentioned this pull request Mar 8, 2016

Fix for…in loop handling #36

Closed

winstliu added the needs-review label Mar 8, 2016

mkhl force-pushed the words branch from f98e580 to 401095e Compare March 8, 2016 21:53

winstliu reviewed Sep 12, 2016
View reviewed changes

winstliu mentioned this pull request Jan 9, 2017

Use custom word boundary matchers #65

Merged

winstliu closed this in #65 Jan 9, 2017

winstliu pushed a commit that referenced this pull request Aug 22, 2017

Properly delimit heredoc identifiers

64a0e4d

Followup from #35 Fixes #71

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use custom word boundary matchers #35

Use custom word boundary matchers #35

Uh oh!

mkhl commented Mar 8, 2016

Uh oh!

winstliu commented Mar 8, 2016

Uh oh!

mkhl commented Mar 15, 2016

Uh oh!

winstliu Sep 12, 2016

Uh oh!

mkhl Sep 13, 2016

Uh oh!

winstliu commented Sep 12, 2016

Uh oh!

mkhl commented Sep 13, 2016

Uh oh!

winstliu commented Sep 13, 2016

Uh oh!

Uh oh!


		expect(tokens[0]).toEqual value: 'iffy', scopes: ['source.shell']
		for string of strings

Use custom word boundary matchers #35

Use custom word boundary matchers #35

Uh oh!

Conversation

mkhl commented Mar 8, 2016

Uh oh!

winstliu commented Mar 8, 2016

Uh oh!

mkhl commented Mar 15, 2016

Uh oh!

winstliu Sep 12, 2016

Choose a reason for hiding this comment

Uh oh!

mkhl Sep 13, 2016

Choose a reason for hiding this comment

Uh oh!

winstliu commented Sep 12, 2016

Uh oh!

mkhl commented Sep 13, 2016

Uh oh!

winstliu commented Sep 13, 2016

Uh oh!

Uh oh!