Regression: "\8", "\9" - Unexpected token ILLEGAL #1226

zlobz · 2015-07-08T12:00:00Z

2.4.1
See #1106
bf5c615

// Error: Line 1: Unexpected token ILLEGAL
var esprima = require('esprima');
esprima.parse('"\\8"');

Some valid strings that esprima currently does not parse:

"\8"
"\9"

The text was updated successfully, but these errors were encountered:

michaelficarra · 2015-07-08T15:03:23Z

Those are not valid. If you feel otherwise, show me the grammar production that allows them.

zlobz · 2015-07-08T17:39:52Z

No, it is valid. "\8\9" works in the wild.

> "\8\9"
'89'

https://es5.github.io/#x16
An implementation may extend program syntax and regular expression pattern or flag syntax.

dmethvin · 2015-07-08T18:14:31Z

Seems like the \ DecimalDigit production would apply? Things are complicated a bit because in non-strict mode there's also LegacyOctalEscapeSequence but none of those productions apply since 8 is not an octal digit. In strict a valid octal sequence generates a syntax error but an invalid one like \8 silently converts to 8.

zlobz · 2015-07-08T18:19:56Z

Standard ECMA-262 June 1997 http://tecfa.unige.ch/guides/js/e262-pdf.pdf
7.7.4 String Literals

EscapeSequence ::
CharacterEscapeSequence
OctalEscapeSequence
HexEscapeSequence
UnicodeEscapeSequence

CharacterEscapeSequence ::
\ SingleEscapeCharacter
\ NonEscapeCharacter

NonEscapeCharacter::
SourceCharacter but not EscapeCharacter or LineTerminator

EscapeCharacter ::
SingleEscapeCharacter
OctalDigit
x
u

SingleEscapeCharacter :: one of
'"\ b f n r t

mathiasbynens · 2015-07-08T18:21:52Z

See whatwg/javascript#12 and especially https://bugs.ecmascript.org/show_bug.cgi?id=3477.

dmethvin · 2015-07-08T18:33:19Z

This is what i saw in ES2015 RC1:

SingleStringCharacter ::
\ EscapeSequence

EscapeSequence ::
CharacterEscapeSequence

CharacterEscapeSequence ::
NonEscapeCharacter

NonEscapeCharacter ::
SourceCharacter but not one of EscapeCharacter or LineTerminator

EscapeCharacter ::
SingleEscapeCharacter
DecimalDigit
x
u

ariya · 2015-07-08T18:49:42Z

@dmethvin Good analysis! That means \8 does not fulfill any production since 8 is a DecimalDigit and NonEscapeCharacter can't be it.

zlobz · 2015-07-08T19:25:23Z

@dmethvin

but an invalid one like \8 silently converts to 8.

because it is valid in old spec - ECMA-262 1997. since
8 is SourceCharacter
8 is not EscapeCharacter because it is not OctalDigit

NonEscapeCharacter ::
SourceCharacter but not one of EscapeCharacter or LineTerminator

EscapeCharacter ::
OctalDigit

dmethvin · 2015-07-08T19:36:39Z

Yeah, I meant \8 was invalid octal.

ariya · 2015-07-08T20:04:27Z

@xzo That's very old. ES 5.1 already specified EscapeCharacter to be exactly like above (see 7.8.4)

I don't think Esprima needs to support ECMAScript < 5 at all.

zlobz · 2015-07-08T21:51:56Z

Why parser doesn't need to support it, If modern ECMAScript engines support it? How to parse obfuscated code, for example?

ariya · 2015-07-09T04:39:29Z

Do we have some information as to how many scripts out there still utilizing such an escape sequence?

In all cases, even if we want to support, at best this is an error that should be tolerated, aka when tolerant is set to true in the parsing options.

mathiasbynens · 2015-07-09T06:25:02Z

This is on the table for ES7: https://bugs.ecmascript.org/show_bug.cgi?id=3477

Fixes jquery#1226

zlobz changed the title ~~Regression: "\812" - Unexpected token ILLEGAL~~ Regression: "Hello\812World" - Unexpected token ILLEGAL Jul 8, 2015

zlobz changed the title ~~Regression: "Hello\812World" - Unexpected token ILLEGAL~~ Regression: "\\8" - Unexpected token ILLEGAL Jul 8, 2015

zlobz changed the title ~~Regression: "\\8" - Unexpected token ILLEGAL~~ Regression: "\\8", "\\9" - Unexpected token ILLEGAL Jul 8, 2015

zlobz pushed a commit to zlobz/esprima that referenced this issue Jul 8, 2015

fixes jquery#1226: accept string escape sequences: \8, \9

896e175

zlobz changed the title ~~Regression: "\\8", "\\9" - Unexpected token ILLEGAL~~ Regression: "\8", "\9" - Unexpected token ILLEGAL Jul 8, 2015

ariya added a commit to ariya/esprima that referenced this issue Jul 11, 2015

In tolerant mode, tolerate invalid escape sequences.

0c5a20c

Fixes jquery#1226

ariya mentioned this issue Jul 11, 2015

In tolerant mode, tolerate invalid escape sequences. #1231

Closed

ariya closed this as completed in 5917ae4 Jul 14, 2015

ariya mentioned this issue Oct 25, 2016

string literal with "\9" fails with ILLEGAL though it works in browser #1601

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Regression: "\8", "\9" - Unexpected token ILLEGAL #1226

Regression: "\8", "\9" - Unexpected token ILLEGAL #1226

zlobz commented Jul 8, 2015

michaelficarra commented Jul 8, 2015

zlobz commented Jul 8, 2015

dmethvin commented Jul 8, 2015

zlobz commented Jul 8, 2015

mathiasbynens commented Jul 8, 2015

dmethvin commented Jul 8, 2015

ariya commented Jul 8, 2015

zlobz commented Jul 8, 2015

dmethvin commented Jul 8, 2015

ariya commented Jul 8, 2015

zlobz commented Jul 8, 2015

ariya commented Jul 9, 2015

mathiasbynens commented Jul 9, 2015

Regression: "\8", "\9" - Unexpected token ILLEGAL #1226

Regression: "\8", "\9" - Unexpected token ILLEGAL #1226

Comments

zlobz commented Jul 8, 2015

michaelficarra commented Jul 8, 2015

zlobz commented Jul 8, 2015

dmethvin commented Jul 8, 2015

zlobz commented Jul 8, 2015

mathiasbynens commented Jul 8, 2015

dmethvin commented Jul 8, 2015

ariya commented Jul 8, 2015

zlobz commented Jul 8, 2015

dmethvin commented Jul 8, 2015

ariya commented Jul 8, 2015

zlobz commented Jul 8, 2015

ariya commented Jul 9, 2015

mathiasbynens commented Jul 9, 2015