No viable alternative can be incorrectly thrown for start rules without explicit EOF #118

sharwell · 2013-01-02T15:17:01Z

If the start rule does not contain an explicit EOF transition, ParserATNSimulator.adaptivePredict may fail to return a viable alternative.

Grammar:

start : ID | ID INT ID;
ID : [a-z]+;
INT : [0-9]+;
WS : [ \t]+ -> skip;

Input: x 1

The text was updated successfully, but these errors were encountered:

sharwell · 2013-01-02T15:32:57Z

I think the key insight here is we can create a "pseudo-rule" during ATN deserialization with the following form, where rule1, rule2, and rule3 are all the rules in the grammar. This can be reduced a bit by excluding rules from the set which end with an explicit EOF.

During SLL prediction, a configuration which steps out of the decision rule will have at least one configuration which ends up in the wildcard loop in the implicit follow rule.

Special care must be taken for the following:

Only choose a configuration in the implicit follow rule when no other alternatives are viable.
When all viable configurations are in the implicit follow rule, choose the alternative corresponding to the last alternative to enter the implicit follow rule.

… behavior of antlr#118, but the performance overhead is extreme)

sharwell · 2013-01-02T20:03:32Z

When the parser is configured to fall back to LL prediction in the event of SLL conflict, the "Special care" conditions listed above are implicitly handled by the full-context parsing algorithm.

sathya2311 · 2014-07-18T08:11:15Z

I looked into the source code and found that the above is happening because of new error handling strategy introduced in ANTLR 4 as part of org.antlr.v4.runtime.DefaultErrorStrategy class.
Also the javadoc of the 'sync()' methods suggests it may have performance overhead and can be overridden. After overriding the sync() method EOF exception is not occurring and parser terminates as soon as it doesn't find a corresponding match. I think for people who wants to migrate to ANTLRv4 and doesn't need this feature then this is the best option.

sharwell · 2014-07-18T12:21:58Z

I believe you may have misinterpreted the problem described above. To make sure I understand what you are saying, here are the two parts to your comment that led me to this conclusion.

I looked into the source code and found that the above is happening because of new error handling strategy introduced in ANTLR 4 as part of DefaultErrorStrategy class.

By the time the error handler identifies a problem and attempts to report and/or recover from it, the prediction mechanism has already failed to return a correct prediction for a previous input.

parser terminates as soon as it doesn't find a corresponding match

This issue describes a case where the parser should match the input, but fails to do so. Suppressing the error message but failing to find a match would still be the wrong result.

sathya2311 · 2014-07-21T09:30:12Z

@sharwell, We used Antlr3 for file parsing and java objects created out of it. Parsing is done partially so that one object is created at a time. Example grammar is as below

file : metadata record* endsummary

Object is created once a record is identified. After upgrading to Antlr4, it doesnt stop after finding one a record rather continues till EOF and prints error for every record from second instance of it. The error is thrown from DefaultErrorStrategy.sync() method.

I posted it because others may find it useful as well. Do you see any issue with this approach?

sharwell · 2014-07-21T12:08:58Z

Do you have a copy of the grammar and example input demonstrating the problem?

According to antlr/antlr4#118, it is better to add EOF for the first rule in grammar. This can solve the potential incorrect error message.

According to antlr/antlr4#118, incorrect error message might be returned if start rule doesn't contains EOF.

Arpit2506 · 2018-01-09T13:55:01Z

Not able to pass double datatype getting error:
found: '40.715', expected: '}}'

ghost assigned sharwell Jan 2, 2013

sharwell mentioned this issue Jan 2, 2013

EOF handling is not correct #110

Closed

sharwell added a commit to sharwell/antlr4 that referenced this issue Jan 2, 2013

Use implicit follow pseudo-rule to ensure correct SLL handling (fixes…

4c187ba

… behavior of antlr#118, but the performance overhead is extreme)

sharwell mentioned this issue Jan 15, 2014

problem with missing/wrong alts from stop states #412

Closed

sharwell mentioned this issue Mar 22, 2014

Incorrect rule chosen in unambiguous grammar #509

Closed

sharwell mentioned this issue Apr 13, 2014

x: x x | 'y'; sometimes causes "no viable alternative" #545

Closed

This was referenced Jun 8, 2014

no viable alternative at input '<EOF>' #606

Closed

Whitespace skips accept "3 3" as (expr 3) #605

Closed

parrt added type:bug and removed type:bug:2 labels Nov 16, 2014

sharwell removed their assignment Feb 20, 2015

sharwell mentioned this issue Feb 27, 2015

Using a rule in another one change parsing behavior for this rule #826

Closed

wiztigers mentioned this issue Jul 17, 2015

ParserTests failures TypeCobolTeam/TypeCobol#17

Closed

sharwell mentioned this issue Sep 1, 2015

Unrelated rules change matching output #985

Open

This was referenced Jan 19, 2016

C# backtrack won't work. #1095

Closed

Context sensitive parsing bug #1097

Closed

sharwell mentioned this issue Dec 23, 2016

Parentheses without quantifier in parser rule lead to syntax error on non-root rule parsing #1545

Closed

lys0716 added a commit to lys0716/hive that referenced this issue Apr 29, 2017

Add EOF for the first rule in Hplsql.g4

adc2d01

According to antlr/antlr4#118, it is better to add EOF for the first rule in grammar. This can solve the potential incorrect error message.

lys0716 mentioned this issue Apr 29, 2017

HIVE-16595: fix syntax in Hplsql.g4 apache/hive#174

Closed

lys0716 mentioned this issue May 17, 2017

fix the syntax of TypeCalculation prestodb/presto#8042

Closed

martint pushed a commit to martint/presto-facebook that referenced this issue May 17, 2017

Add explicit EOF match to TypeCalculation rule

2867d44

According to antlr/antlr4#118, incorrect error message might be returned if start rule doesn't contains EOF.

sharwell mentioned this issue Jul 11, 2017

Single-token deletion regression between 4.5.3 and 4.7 #1931

Closed

sharwell mentioned this issue Jul 26, 2017

Premature end of parsing #1971

Open

sharwell mentioned this issue Sep 7, 2017

Possible breaking change between versions 4.5.3 and 4.6.4 tunnelvisionlabs/antlr4cs#248

Closed

sharwell mentioned this issue Oct 10, 2019

Upgrade to 4.7.2 lead to change in parser behavior. #2650

Open

This was referenced Nov 25, 2019

Superfluous rule needed to make parser work (4.7.2, Java target) #2689

Open

Failing to report error with incomplete input #2695

Closed

jihoonson mentioned this issue Mar 13, 2021

Caching parsed expressions in sql planner apache/druid#10987

Closed

9 tasks

jihoonson mentioned this issue Mar 28, 2021

Add explicit EOF for expression parser and use assert instead of exception in sql planner apache/druid#11041

Merged

1 task

nrmancuso mentioned this issue Aug 19, 2021

Issue #3095: Add COMPILATION_UNIT token in Ast Tree, remove EOF token checkstyle/checkstyle#10574

Merged

mjw99 mentioned this issue Oct 25, 2021

‘antlr’ exception when parsing Tags BlueObelisk/chemicaltagger#8

Open

kaby76 mentioned this issue Dec 1, 2021

Malformed start rules in grammars antlr/grammars-v4#2405

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

No viable alternative can be incorrectly thrown for start rules without explicit EOF #118

No viable alternative can be incorrectly thrown for start rules without explicit EOF #118

sharwell commented Jan 2, 2013

sharwell commented Jan 2, 2013

sharwell commented Jan 2, 2013

sathya2311 commented Jul 18, 2014

sharwell commented Jul 18, 2014

sathya2311 commented Jul 21, 2014

sharwell commented Jul 21, 2014

Arpit2506 commented Jan 9, 2018

No viable alternative can be incorrectly thrown for start rules without explicit EOF #118

No viable alternative can be incorrectly thrown for start rules without explicit EOF #118

Comments

sharwell commented Jan 2, 2013

sharwell commented Jan 2, 2013

sharwell commented Jan 2, 2013

sathya2311 commented Jul 18, 2014

sharwell commented Jul 18, 2014

sathya2311 commented Jul 21, 2014

sharwell commented Jul 21, 2014

Arpit2506 commented Jan 9, 2018